Gene Information

Name : pet (EC042_pAA035)
Accession : YP_006099165.1
Strain :
Genome accession: NC_017627
Putative virulence/resistance : Virulence
Product : serine protease (plasmid-encoded toxin Pet)
Function : -
COG functional category : -
COG ID : -
EC number : 3.4.21.-
Position : 28073 - 31960 bp
Length : 3888 bp
Strand : +
Note : -

DNA sequence :
ATGAATAAAATATACTCCATTAAATATAGTGCTGCCACTGGCGGACTCATTGCTGTTTCTGAATTAGCGAAAAAAGTCAT
ATGTAAAACAAACCGAAAAATTTCTGCTGCATTATTATCTCTGGCAGTTATTAGTTATACTAATATAATATATGCCGCCA
ATATGGATATATCTAAAGCATGGGCCCGGGATTATCTCGATCTGGCACAGAATAAAGGGGTGTTTCAACCAGGTTCAACA
CATGTAAAAATAAAACTGAAAGACGGGACTGATTTTTCATTTCCAGCACTTCCTGTTCCTGACTTTTCATCTGCAACCGC
AAATGGAGCTGCAACAAGTATTGGTGGTGCCTATGCCGTAACCGTTGCACACAATGCAAAAAATAAGTCATCAGCTAATT
ATCAAACATACGGTTCTACGCAATATACTCAAATAAACAGAATGACAACTGGAAACGATTTTTCCATTCAGCGATTAAAC
AAGTATGTCGTGGAAACAAGAGGGGCTGATACATCATTTAATTATAATGAGAACAACCAAAATATTATTGACAGATATGG
CGTAGACGTTGGAAATGGAAAAAAAGAAATCATTGGTTTTCGTGTTGGTTCAGGAAACACCACTTTTTCCGGAATAAAAA
CATCCCAAACATATCAGGCTGACCTGTTAAGTGCATCACTATTCCATATAACAAATTTACGAGCAAATACTGTCGGAGGT
AACAAAGTGGAATATGAAAATGACTCATATTTCACTAACTTAACCACTAATGGTGACAGTGGATCAGGCGTGTATGTATT
TGATAACAAAGAAGATAAATGGGTTCTACTTGGAACAACCCATGGAATAATAGGGAACGGAAAAACGCAAAAAACATATG
TAACACCATTTGACTCCAAAACCACCAATGAATTAAAGCAACTATTTATTCAAAATGTTAATATTGATAACAATACTGCT
ACCATTGGTGGTGGTAAGATAACTATTGGCAATACAACTCAAGATATCGAGAAAAATAAAAATAACCAGAATAAAGACCT
AGTGTTCTCTGGTGGTGGTAAAATCTCATTAAAAGAGAATCTTGATCTTGGATATGGTGGGTTTATTTTTGATGAAAATA
AAAAATATACTGTTAGCGCTGAAGGGAATAATAATGTCACCTTTAAAGGTGCAGGCATTGATATAGGTAAAGGCAGTACT
GTTGACTGGAACATCAAATATGCCTCAAATGATGCACTGCATAAAATTGGTGAAGGGAGCCTTAATGTCATACAGGCACA
GAATACGAATCTGAAAACCGGGAACGGGACCGTCATTCTTGGCGCACAGAAAACGTTCAACAATATCTATGTCGCCGGTG
GCCCGGGCACAGTACAACTCAATGCAGAGAACGCCCTGGGTGAGGGTGATTATGCTGGTATTTTTTTCACTGAAAACGGC
GGAAAACTCGACCTGAATGGTCATAACCAGACCTTCAAAAAAATTGCTGCAACAGATTCCGGAACCACCATCACTAACAG
TAACACCACTAAAGAGAGTGTACTGTCGGTCAATAACCAGAATAACTATATCTATCATGGTAATGTGGACGGCAATGTAC
GCCTTGAACATCACCTCGACACTAAGCAGGATAATGCCCGCCTGATACTGGATGGTGATATTCAGGCAAACAGTATCAGT
ATCAAAAATGCCCCTCTGGTAATGCAGGGCCATGCGACTGATCACGCCATTTTCAGAACAACAAAAACAAATAATTGTCC
TGAGTTCCTCTGTGGTGTTGACTGGGTCACCAGAATCAAAAATGCTGAGAATTCAGTAAATCAGAAGAATAAAACCACCT
ATAAATCGAATAACCAGGTTTCCGATTTGTCCCAGCCGGACTGGGAAACCAGAAAATTTAGATTCGACAATCTGAATATT
GAAGATTCATCATTATCCATTGCCAGAAATGCAGATGTTGAAGGTAACATCCAGGCTAAAAACTCTGTGATAAATATCGG
GGACAAAACGGCATATATTGATCTGTACTCAGGAAAAAATATTACCGGTGCCGGATTCACCTTTCGTCAGGACATAAAAA
GCGGTGACTCCATCGGTGAAAGTAAATTTACCGGGGGCATTATGGCAACAGATGGCTCCATCAGCATAGGGGATAAAGCC
ATTGTCACGCTGAACACGGTCTCGTCTCTGGACAGAACAGCGCTGACTATCCACAAGGGGGCGAATGTTACGGCCAGCAG
TTCCCTTTTCACCACCAGTAACATCAAATCCGGAGGCGACCTGACCCTGACTGGCGCAACAGAATCGACCGGGGAAATCA
CTCCGTCGATGTTCTATGCTGCAGGAGGATATGAACTGACGGAAGACGGGGCTAACTTTACCGCCAAAAATCAGGCCTCT
GTAACCGGTGATATTAAATCCGAAAAAGCAGCAAAACTTTCATTTGGCTCCGCTGACAAGGATAATTCTGCCACAAGATA
TTCGCAGTTTGCTCTCGCGATGCTGGATGGCTTTGATACGTCCTATCAGGGCAGCATTAAGGCTGCACAATCCAGCCTTG
CAATGAATAATGCGCTCTGGAAAGTGACCGGCAATTCCGAGTTGAAAAAACTGAACTCCACCGGCAGTATGGTGCTCTTC
AACGGAGGGAAAAACATCTTCAATACACTGACTGTCGATGAACTGACAACCAGTAACAGTGCCTTTGTGATGCGAACCAA
TACACAACAGGCAGACCAGTTAATTGTTAAAAACAAACTGGAAGGTGCAAACAACCTGCTGTTAGTCGATTTTATTGAGA
AAAAAGGAAACGACAAAAACGGTCTGAACATCGATCTGGTTAAGGCTCCTGAGAATACCAGTAAGGATGTCTTCAAAACT
GAAACACAGACCATTGGTTTCAGTGATGTAACCCCTGAAATTAAACAGCAGGAAAAAGATGGCAAATCTGTCTGGACGCT
GACCGGGTATAAAACGGTGGCAAATGCTGATGCTGCGAAAAAGGCAACATCACTGATGTCAGGCGGCTATAAAGCCTTCC
TTGCAGAGGTCAACAACCTCAACAAACGTATGGGTGATCTGCGTGACATTAACGGTGAGGCCGGTGCATGGGCCCGTATC
ATGAGTGGAACCGGGTCTGCCGGCGGTGGATTCAGTGACAACTACACCCACGTTCAGGTCGGTGCGGATAACAAACATGA
ACTCGATGGCCTTGACCTCTTCACCGGGGTGACCATGACCTATACCGACAGCCATGCAGGCAGTGATGCCTTCAGTGGTG
AAACGAAGTCTGTGGGTGCCGGTCTCTATGCCTCTGCCATGTTTGAGTCCGGAGCATATATCGACCTCATCGGTAAGTAC
GTTCACCATGACAACGAGTATACCGCAACTTTCGCCGGCCTTGGCACCAGAGACTACAGCTCCCACTCCTGGTATGCCGG
TGCGGAAGTCGGTTACCGTTACCATGTAACTGACTCTGCATGGATTGAGCCGCAGGCGGAACTTGTTTACGGTGCTGTAT
CCGGGAAACAGTTCTCCTGGAAGGACCAGGGAATGAACCTCACCATGAAGGATAAGGACTTTAATCCGCTGATTGGGCGT
ACCGGTGTTGATGTGGGTAAATCCTTCTCCGGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGGCTACCAGTTTGA
CCTGTTTGCCAACGGTGAAACTGTACTGCGTGATGCGTCCGGTGAAAAACGTATCAAAGGTGAAAAAGACGGCCGTATGC
TCATGAATGTTGGTCTGAATGCTGAGATTCGTGACAACGTACGCTTTGGTCTTGAGTTTGAGAAATCGGCATTTGGTAAG
TACAACGTGGATAACGCCATCAACGCCAACTTCCGTTACTCCTTCTGA

Protein sequence :
MNKIYSIKYSAATGGLIAVSELAKKVICKTNRKISAALLSLAVISYTNIIYAANMDISKAWARDYLDLAQNKGVFQPGST
HVKIKLKDGTDFSFPALPVPDFSSATANGAATSIGGAYAVTVAHNAKNKSSANYQTYGSTQYTQINRMTTGNDFSIQRLN
KYVVETRGADTSFNYNENNQNIIDRYGVDVGNGKKEIIGFRVGSGNTTFSGIKTSQTYQADLLSASLFHITNLRANTVGG
NKVEYENDSYFTNLTTNGDSGSGVYVFDNKEDKWVLLGTTHGIIGNGKTQKTYVTPFDSKTTNELKQLFIQNVNIDNNTA
TIGGGKITIGNTTQDIEKNKNNQNKDLVFSGGGKISLKENLDLGYGGFIFDENKKYTVSAEGNNNVTFKGAGIDIGKGST
VDWNIKYASNDALHKIGEGSLNVIQAQNTNLKTGNGTVILGAQKTFNNIYVAGGPGTVQLNAENALGEGDYAGIFFTENG
GKLDLNGHNQTFKKIAATDSGTTITNSNTTKESVLSVNNQNNYIYHGNVDGNVRLEHHLDTKQDNARLILDGDIQANSIS
IKNAPLVMQGHATDHAIFRTTKTNNCPEFLCGVDWVTRIKNAENSVNQKNKTTYKSNNQVSDLSQPDWETRKFRFDNLNI
EDSSLSIARNADVEGNIQAKNSVINIGDKTAYIDLYSGKNITGAGFTFRQDIKSGDSIGESKFTGGIMATDGSISIGDKA
IVTLNTVSSLDRTALTIHKGANVTASSSLFTTSNIKSGGDLTLTGATESTGEITPSMFYAAGGYELTEDGANFTAKNQAS
VTGDIKSEKAAKLSFGSADKDNSATRYSQFALAMLDGFDTSYQGSIKAAQSSLAMNNALWKVTGNSELKKLNSTGSMVLF
NGGKNIFNTLTVDELTTSNSAFVMRTNTQQADQLIVKNKLEGANNLLLVDFIEKKGNDKNGLNIDLVKAPENTSKDVFKT
ETQTIGFSDVTPEIKQQEKDGKSVWTLTGYKTVANADAAKKATSLMSGGYKAFLAEVNNLNKRMGDLRDINGEAGAWARI
MSGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDLIGKY
VHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNPLIGR
TGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNVRFGLEFEKSAFGK
YNVDNAINANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 64
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 57
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 57
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 57
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 53
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 2e-177 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
pet YP_006099165.1 serine protease (plasmid-encoded toxin Pet) VFG0862 Protein 0.0 100
pet YP_006099165.1 serine protease (plasmid-encoded toxin Pet) VFG0902 Protein 0.0 64
pet YP_006099165.1 serine protease (plasmid-encoded toxin Pet) VFG0630 Protein 0.0 57
pet YP_006099165.1 serine protease (plasmid-encoded toxin Pet) VFG0844 Protein 0.0 57
pet YP_006099165.1 serine protease (plasmid-encoded toxin Pet) VFG0772 Protein 0.0 53