Gene Information

Name : eatA (EcE24377A_E0051)
Accession : YP_001451588.1
Strain :
Genome accession: NC_009790
Putative virulence/resistance : Virulence
Product : secreted serine peptidase EatA
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 49306 - 53406 bp
Length : 4101 bp
Strand : +
Note : identified by similarity to SP:Q84GK0; match to protein family HMM PF02395; match to protein family HMM PF03212; match to protein family HMM PF03797; match to protein family HMM TIGR01414

DNA sequence :
ATGAATAAAGTGTTCTCTCTTAAGTATAGTTTTTTAGCCAAAGGTTTTATTGCTGTTTCTGAACTTGCCCGTCGCGTTTC
TGTTAAAGGGAAACTGAAGAGTGCTTCATCAATAATTATTTCACCAATAACAATTGCTATTGTTTCTTATGCCCCCCCAT
CTCTTGCTGCAACAGTTAATGCAGATATATCGTATCAAACATTTCGGGATTTTGCCGAAAATAAAGGAGCTTTTATAGTT
GGCGCATCAAATATAAATATCTACGATAAGAATGGAGTGTTAGTTGGAGTGCTTGATAAAGCTCCAATGCCTGATTTTAG
TAGTGCCACGATGAATACAGGGACATTACCACCAGGAGACCATACACTGTACTCACCTCAATATGTTGTCACAGCAAAGC
ATGTTAATGGATCAGATATAATGAGTTTTGGACATATTCAAAATAATTATACTGTAGTAGGAGAGAACAACCATAATAGC
CTTGATATTAAAACACGACGTTTAAATAAGATTGTCACGGAGGTCGCCCCTGCAGAAGTCTCCAGTGTTGGAGCTGTAAA
TGGCGCTTATCAAGAAGGGGGGCGTTTTACAGCCTTTTATAGGCTTGGAGGTGGATTGCAATATATAAAGGATAAAAATG
GAAATCTTACACCGGTATATACAAATGGTGGTTTCCTAACTGGAGGAACTATCAGTGCTTTAAGCTCATATAACAATGGG
CAAATGATTACTGCACCTACAGGAGATATTTTTAATCCTGCTAATGGGCCTCTTGCAAACTATCTAAATAAAGGTGATAG
TGGCTCTCCTTTATTTGCGTATGACTCTCTGGAAAAAAAATGGGTTTTAATTGGCGTTCTTTCATCAGGAAGTGAGTATG
GTAATAACTGGGTTGTCACAACTCAAGATTTTCTTAATCAGCAACAAAAGCATGATTTTGATAAAACAATATCATATGAC
TCTAAAAAGGGGAGCTTACAATGGAGATATGATAAAGACGCAGGAGTGGGAACATTAAGTCAAGAGGGAGTTGTGTGGGA
CATGCATGGAAAAAAAGGGGAAGACCTAAATGCAGGTAAAAATCTTCAATTTACAGGAAATAATGGAGAGGTTATTTTAC
ACGACTCTATAGATCAAGGGGCTGGCTATCTGCAGTTTTTTAACAACTACACAGTTACATCCTTAACTGACCAAACATGG
ACCGGAGGTGGTATCATTACTGAAAAAGGTGTAAATGTGCTTTGGCAGGTTAATGGTGTTAATAATGATAACTTACATAA
AGTTGGTGAAGGCACATTAACTGTTAATGGAAAAGGGGTTAATAATGGAGGACTGAAAGTCGGTGATGGAACCGTAATTC
TGAATCAACGCCCTGATGATAATGGACACAAGCAAGCCTTTAGCTCTATTAACATTTCCAGTGGTCGTGCAACAGTTATA
CTTTCAGATGCTAATCAAGTTAACCCAGATAAAATATCATGGGGATATAGAGGCGGCACTCTTGATTTAAATGGAAATAA
TGTAACCTTTACTCGTCTTCAGGCTGCAGATTATGGTGCTATTATTTCTAACAATAACAATAACAAAAACAAATCTGAAT
TAACACTTAAATTACAAACACTAAATGAAAATGACATTAGTGTTGATGTGAAGACATATGAAGTTTTTGGGGGGCGTGGT
AGTCCAGGTGACTTATATTATGTTTCTGCATCAAATACTTACTTTATCCTGAAATCAAAGGCGTACGGTCCATTTTTCAG
TGATTTAGATAATACCAATGTCTGGCAAAATGTTGGTCACGATCGTGATAAAGCGATTCAAATCGTGAAACAGCAGAAGA
TTGAGGAAAGCTCTCAACCTTATATGTTTCATGGACAACTTAATGGTTATATGGATGTAAATATACATCCACTCTCTGGT
AAGGATGTGCTGACTCTTGATGGTTCTGTTAATCTGCCTGAAGGGGTGATAACGAAAAAGTCAGGTACTCTGATATTCCA
GGGGCATCCGGTGATTCATGCTGGAATGACAACCTCAGCCGGCCAGAGTGATTGGGAAAACCGTCAGTTTACAATGGATA
AACTGAAGCTTGATGCAGCAACATTCCATCTCTCCAGAAATACTCGTATGCAAGGAGATATTAGTGCTGCCAACGGAAGT
ACCGTCATTCTGGGAAGTTCTCGGGTCTTTACTGACAAGAATGATGGAACCGGTAATGCAGTATCTTCTGTTGAAGGGAG
TTCCACTGCAACAACAGCTGCTGACCAAAGTTATTACAGTGGTAATGTGCTGCTCGAAAACCATTCGTCTCTGGAGGTCA
GGGAGAATTTTACTGGTGGTATTGAGGCTTATGACAGTTCTGTTAGTGTGACCTCTCAGAATGCTATTCTTGACCATGTT
GGTAGCTTTATTAATAGTAGTCTGCTTCTCGAAAAAGGAGCAAAACTGACTGCACAGAGTGGTATTTTCACAAATAACAC
TATGGAAATAAAAGAAAACGCCTCTCTGACTCTGACAGGGATCCCTTCTGTAGGAAAGCCGGGGTATTATTCACCTGTAA
TCTCGACTACAGAAGGAATTCATCTCGGTGAGCGGGCCAGCCTTTCAGTGAAAAATATGGGCTATCTGAGTTCAAATATT
ATAGCAGAGGACTCTGCAGCAATTATTAATCTGGGAGACAGTAATGCAACTATCGGGAAGACGGACTCTCCATTATTCAA
TACCTTAATGAGGGGATATAATGCTGTTTTGCAGGGCAATATTATGGGGCCACAGAGCTCAGTGAATATGAACAATGCTC
TGTGGCACTCTGATAGAAATTCGGAAATCAAAGAGCTGAAAGCCAACGACTCCCAAATAGAGTTAGGTGGAAGAGGGCAT
TTTGCAAAACTGCGGGTAAAAGAGCTTATTGCGTCTAACTCAGTGTTTCTTGTACATGTAAACAATGGCCAGGCTGACCA
GTTGAATGTTACCGGCAAACTGCAGGGCAGCAATAATACTATTCTTGTTAACTTTTTTAACAAAGCAGCCAATGGTACAA
ATGTGACGTTAATCACTGCACCAAAAGGCAGTGATGAAAATACATTCAAGGCCGGAACCCAGCAGATTGGATTCAGTAAT
ATCACGCCAGAAATCAGGACAGAAAATACGGATACAGCCACAAAGTGGGTGCTGACTGGATATCAGTCTGTCGCTGATGC
CAGAGCCTCGAAAATCGCAACGGACTTTATGGATTCAGGTTATAAATCCTTCCTGACGGAAGTCAATAATCTGAACAAAC
GTATGGGGGATTTACGGGATAGTCAGGGAGATGCTGGAGGGTGGGCGCGTATCATGAATGGTACCGGTTCAGGTGAGAGT
GGTTACAGAGATAACTATACCCACGTTCAGATTGGTGCAGACAGAAAGCATGAGCTGAACGGTCTAAATTTATTCACCGG
TGCATTACTGACCTATACAGACAGCAATGCCAGCAGCCAGGCTTTCAGCGGTAAAACAAAATCGCTAGGGGGAGGGGTGT
ATGCATCAGGTCTCTTTGAGTCTGGAGCTTATTTTGACCTGATTGGTAAATATCTCCATCATGATAATCGGTATACGTTG
AATTTTGCCTCCCTGGGGGAAAGAAGCTATACCACCCATTCTTTGTATGCTGGAGCTGAAATCGGGTATCGTTATCACAT
GTCAGAAAATACATGGGTGGAACCACAGATGGAACTGGTTTATGGTTCGGTATCAGGAAAGTCATTTAACTGGAAAGACC
AGGGAATGCAATTGAGTATGAAAGACAAAGACTATCACCCACTAATTGGTCGAACAGGCGTGGATGTAGGTAGAGTGTTC
TCTGGAGATACCTGGAAAGTAACAGCACGTGCAGGACTGAGTTACCAGTTCGATTTGCTGGCAAATGGAGAAACTGTTTT
ACAGGATGCTTCTGGTAAAAAACACTTCAAAGGTGAAAAAGACAGCAGGATGCTAATGAACGTGGGAACGAATGTGGAAG
TTAAAGACAATATGCGTTTTGGTCTGGAGTTGGAGAAGTCGGCGTTTGGGAGATATAATATAGACAACTCTATAAATGCT
AACTTCCGTTATTATTTCTGA

Protein sequence :
MNKVFSLKYSFLAKGFIAVSELARRVSVKGKLKSASSIIISPITIAIVSYAPPSLAATVNADISYQTFRDFAENKGAFIV
GASNINIYDKNGVLVGVLDKAPMPDFSSATMNTGTLPPGDHTLYSPQYVVTAKHVNGSDIMSFGHIQNNYTVVGENNHNS
LDIKTRRLNKIVTEVAPAEVSSVGAVNGAYQEGGRFTAFYRLGGGLQYIKDKNGNLTPVYTNGGFLTGGTISALSSYNNG
QMITAPTGDIFNPANGPLANYLNKGDSGSPLFAYDSLEKKWVLIGVLSSGSEYGNNWVVTTQDFLNQQQKHDFDKTISYD
SKKGSLQWRYDKDAGVGTLSQEGVVWDMHGKKGEDLNAGKNLQFTGNNGEVILHDSIDQGAGYLQFFNNYTVTSLTDQTW
TGGGIITEKGVNVLWQVNGVNNDNLHKVGEGTLTVNGKGVNNGGLKVGDGTVILNQRPDDNGHKQAFSSINISSGRATVI
LSDANQVNPDKISWGYRGGTLDLNGNNVTFTRLQAADYGAIISNNNNNKNKSELTLKLQTLNENDISVDVKTYEVFGGRG
SPGDLYYVSASNTYFILKSKAYGPFFSDLDNTNVWQNVGHDRDKAIQIVKQQKIEESSQPYMFHGQLNGYMDVNIHPLSG
KDVLTLDGSVNLPEGVITKKSGTLIFQGHPVIHAGMTTSAGQSDWENRQFTMDKLKLDAATFHLSRNTRMQGDISAANGS
TVILGSSRVFTDKNDGTGNAVSSVEGSSTATTAADQSYYSGNVLLENHSSLEVRENFTGGIEAYDSSVSVTSQNAILDHV
GSFINSSLLLEKGAKLTAQSGIFTNNTMEIKENASLTLTGIPSVGKPGYYSPVISTTEGIHLGERASLSVKNMGYLSSNI
IAEDSAAIINLGDSNATIGKTDSPLFNTLMRGYNAVLQGNIMGPQSSVNMNNALWHSDRNSEIKELKANDSQIELGGRGH
FAKLRVKELIASNSVFLVHVNNGQADQLNVTGKLQGSNNTILVNFFNKAANGTNVTLITAPKGSDENTFKAGTQQIGFSN
ITPEIRTENTDTATKWVLTGYQSVADARASKIATDFMDSGYKSFLTEVNNLNKRMGDLRDSQGDAGGWARIMNGTGSGES
GYRDNYTHVQIGADRKHELNGLNLFTGALLTYTDSNASSQAFSGKTKSLGGGVYASGLFESGAYFDLIGKYLHHDNRYTL
NFASLGERSYTTHSLYAGAEIGYRYHMSENTWVEPQMELVYGSVSGKSFNWKDQGMQLSMKDKDYHPLIGRTGVDVGRVF
SGDTWKVTARAGLSYQFDLLANGETVLQDASGKKHFKGEKDSRMLMNVGTNVEVKDNMRFGLELEKSAFGRYNIDNSINA
NFRYYF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 54
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 53
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 53
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 53
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 50
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 44
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 44
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eatA YP_001451588.1 secreted serine peptidase EatA VFG0861 Protein 0.0 53
eatA YP_001451588.1 secreted serine peptidase EatA VFG0903 Protein 0.0 53
eatA YP_001451588.1 secreted serine peptidase EatA VFG0635 Protein 0.0 53
eatA YP_001451588.1 secreted serine peptidase EatA VFG0904 Protein 0.0 44
eatA YP_001451588.1 secreted serine peptidase EatA VFG1689 Protein 0.0 44