Gene Information

Name : eatA (ETEC_p948_0020)
Accession : YP_006203824.1
Strain :
Genome accession: NC_017724
Putative virulence/resistance : Virulence
Product : serine protease EatA
Function : -
COG functional category : -
COG ID : -
EC number : 3.4.21.-
Position : 1202 - 5296 bp
Length : 4095 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAAGTGTTCTCTCTTAAGTATAGTTTTTTAGCCAAAGGTTTTATTGCTGTTTCTGAACTTGCCCGTCGCGTTTC
TGTTAAAGGGAAACTGAAGAGTGCTTCATCAATAATTATTTCACCAATAACAATTGCTATTGTTTCTTATGCACCCCCAT
CTCTTGCTGCAACAGTTAATGCAGATATATCGTATCAAACATTTCGGGATTTTGCCGAAAATAAAGGAGCTTTTATAGTT
GGCGCATCAAATATAAATATCTACGATAAGAATGGAGTGTTAGTTGGAGTGCTTGATAAAGCTCCAATGCCTGATTTTAG
TAGCGCCACGATGAATACAGGGACATTACCACCAGGAGACCATACACTGTACTCACCTCAATATGTTGTCACAGCAAAGC
ATGTTAATGGATCAGATATAATGAGTTTTGGACATATTCAAAATAATTATACTGTAGTAGGAGAGAACAACCATAATAGC
CTTGATATTAAAATACGGCGTTTAAATAAGATTGTCACGGAGGTCGCCCCTGCAGAAATCTCCAGTGTTGGAGCTGTAAA
TGGCGCTTATCAAGAAGGGGGGCGTTTTAAAGCCTTTTATAGGCTTGGAGGTGGATTGCAATATATAAAGGATAAAAATG
GGAATCTTACACCGGTATATACAAATGGTGGTTTCCTAACCGGAGGTACTATCAGTGCTTTAAGCTCATATAACAATGGC
CAAATGATTACCGCACCTACAGGCGATATATTTAATCCAGCTAATGGGCCTCTTGCAAACTATCTAAATAAAGGTGATAG
TGGCTCTCCTTTATTTGCGTATGACTCTCTGGACAAAAAATGGGTTTTAGTTGGCGTTCTGTCATCAGGAAGCGAGCATG
GTAATAACTGGGTTGTCACAACTCAAGATTTTCTTCATCAGCAACCAAAACATGATTTTGATAAAACAATATCATATGAC
TCTGAAAAGGGTAGCTTACAATGGAGATATAATAAAAATTCAGGAGTGGGAACATTAAGTCAAGAGAGCGTTGTGTGGGA
CATGCATGGAAAAAAAGGGGGAGATCTAAACGCAGGTAAAAATCTTCAATTTACAGGAAATAATGGAGAGATTATTTTAC
ACGACTCTATAGATCAAGGGGCTGGCTATTTGCAGTTTTTTGACAACTACACAGTTACATCCTTAACTGACCAAACATGG
ACCGGAGGTGGTATCATTACTGAAAAAGGTGTAAATGTGCTTTGGCAGGTTAATGGTGTTAATGATGATAACTTACATAA
AGTTGGTGAAGGCACATTAACTGTTAATGGAAAAGGGGTTAATAATGGAGGACTGAAAGTCGGTGATGGAACCGTAATTC
TGAATCAACGCCCTGATGATAATGGACACAAGCAAGCCTTTAGCTCTATTAACATTTCCAGTGGTCGTGCAACAGTTATA
CTTTCAGATGCTAATCAAGTTAACCCAGATAAAATATCATGGGGATATAGAGGCGGTACTCTTGATTTAAATGGAAATAA
TGTAAACTTTACTCGTCTTCAGGCTGCAGATTATGGTGCTATTGTTTCTAACAATAACAAAAACAAATCTGAATTAACAC
TTAAATTACAAACACTAAATGAAAATGACATTAGTGTTGATGTGAAGACATATGAAGTTTTTGGGGGGCATGGTAGTCCA
GGTGACTTATATTATGTTCCTGCATCAAATACTTACTTTATCCTGAAATCAAAGGCGTACGGTCCATTTTTCAGTGATTT
AGATAATACCAATGTCTGGCAAAATGTTGGTCACGATCGTGATAAAGCGATTCAAATCGTGAAACAGCAGAAGATTGGGG
AAAGCTCTCAACCTTATATGTTTCATGGACAACTTAATGGTTATATGGATGTAAATATACATCCACTCTCTGGTAAGGAT
GTGCTGACTCTTGATGGTTCTGTTAATCTGCCTGAAGGGGTGATAACGAAAAAGTCAGGTACTCTGATATTTCAAGGGCA
TCCGGTGATTCATGCTGGAATGACAACCTCAGCCGGCCAGAGTGATTGGGAAAATCGTCAGTTTACAATGGATAAACTGA
GGCTTGATGCAGCAACATTCCATCTCTCCAGAAATGCTCATATGCAGGGAGATATTAGTGCTGCCAACGGAAGCACCGTC
ATTCTGGGAAGTTCTCGGGTCTTTACTGACAAGAATGACGGAACCGGTAATGCGGTATCTTCTGTTGAAGGGAGTTCCAT
TGCAACAACAGCTGGTGACCAAAGTTATTACAGCGGTAATGTGCTGCTGGAAAACCATTCGTCTCTAGAGGTCAGGGAGA
ATTTTACTGGTGGTATTGAGGCTTATGACAGTTCTGTTAGTGTGACCTCTCAGAATGCTATTTTTGACCATGTTGGTAGC
TTTGTTAATAGTAGTCTGCTTCTCGAAAAAGGAGCAAAACTGACAGCACAGAGTGGTATTTTCACAAATAACACTATGAA
AATAAAAGAAAACGCCTCCCTGACTCTGACAGGGATACCTTCTGTAGGAAAGCCAGGGTATTATTCACCTGTGACCTCGA
CTACTGAAGGAATTCATCTCGGTGAGCGAGCCAGCCTTTCAGTGAAAAATATGGGCTATCTGAGTTCAAATATTACAGCA
GAGAACTCTGCAGCAATTATTAATCTGGGAGACAGTAATGCAACTATCGGGAAGACGGACTCTCCATTATTCAGTACCTT
AATGAGGGGATATAATGCTGTTTTGCAGGGCAATATTATGGGGCCCCAGAGCTCAGTGAATATGAACAATGCTCTGTGGC
ACTCTGATAGAAATTCGGAACTCAAAGAGCTGAAAGCCAACGACTCCCAAATAGAGTTGGGTGTAAGAGGGCATTTTGCA
AAACTGCGGGTAAAAGAGCTTATTGCGTCTAACTCAGTGTTTCTTGTACATGCAAACAATAGCCAGGCTGACCAGTTGAA
CGTTACCGACAAACTGCAGGGCAGCAACAATACTATTCTTGTTGACTTTTTTAACAAAGCAGCCAATGGTACAAATGTGA
CGTTAATTACTGCACCAAAAGGCAGTGATGAAAATACATTCAAAGCCGGAACCCAGCAGATTGGATTCAGTAATATCACG
CCAGAAATCAGGACAGAAAATACGGATACAGCCACACAGTGGGTGCTGACTGGATATCAGTCTGTCGCTGATGCCAGAGC
CTCGAAAATCGCAACGGACTTTATGGATTCAGGTTATAAATCTTTCCTGACGGAAGTCAATAATCTGAACAAACGTATGG
GAGATTTACGGGATAGTCAGGGAGATGCTGGAGGGTGGGCGCGTATCATGAATGGTACCGGTTCAGGTGAGAGTGGTTAC
AGAGATAACTATACCCACGTTCAGATTGGTGCAGACAGAAAGCATGAGCTGAACGGTATAGATTTATTCACCGGTGCATT
ACTGACTTATACAGACAACAATGCTAGCAGCCAGGCTTTCAGCGGTAAAACAAAATCGCTAGGGGGAGGGGTGTATGCAT
CAGGTCTCTTTGAGTCTGGAGCTTATTTTGACCTGATTGGTAAATATCTCCATCATGATAATCGGTATACGTTGAATTTT
GCCTCCTTGGGGGAAAGAAGCTACACCTCCCATTCTTTGTATGCTGGAGCTGAAATCGGGTATCGTTATCACATGTCAGA
AAATACATGGGTGGAACCACAGATGGAACTGGTTTATGGTTCGGTATCAGGAAAGTCATTTAACTGGAAAGACCAGGGAA
TGCAACTGAGTATGAAAGACAAAGACTATCACCCACTAATTGGTCGAACAGGTGTGGATGTAGGTAGAGCGTTCTCTGGA
GATACCTGGAAAGTAACAGTACGTGCAGGACTGGGTTACCAGTTCGATTTGCTGGCAAATGGAGAAACTGTTTTACAGGA
TGCTTCTGGTAAAAAACACTTCAAAGGTGAAAAAGACAGCAGGATGCTAATGAACGTGGGGACGAATGTGGAAGTTAAAG
ACAATATGCGTTTTGGTCTGGAGTTGGAGAAGTCGGCGTTTGGGAGATATAACATAGACAACTCTATAAATGCTAACTTC
CGTTATTATTTCTGA

Protein sequence :
MNKVFSLKYSFLAKGFIAVSELARRVSVKGKLKSASSIIISPITIAIVSYAPPSLAATVNADISYQTFRDFAENKGAFIV
GASNINIYDKNGVLVGVLDKAPMPDFSSATMNTGTLPPGDHTLYSPQYVVTAKHVNGSDIMSFGHIQNNYTVVGENNHNS
LDIKIRRLNKIVTEVAPAEISSVGAVNGAYQEGGRFKAFYRLGGGLQYIKDKNGNLTPVYTNGGFLTGGTISALSSYNNG
QMITAPTGDIFNPANGPLANYLNKGDSGSPLFAYDSLDKKWVLVGVLSSGSEHGNNWVVTTQDFLHQQPKHDFDKTISYD
SEKGSLQWRYNKNSGVGTLSQESVVWDMHGKKGGDLNAGKNLQFTGNNGEIILHDSIDQGAGYLQFFDNYTVTSLTDQTW
TGGGIITEKGVNVLWQVNGVNDDNLHKVGEGTLTVNGKGVNNGGLKVGDGTVILNQRPDDNGHKQAFSSINISSGRATVI
LSDANQVNPDKISWGYRGGTLDLNGNNVNFTRLQAADYGAIVSNNNKNKSELTLKLQTLNENDISVDVKTYEVFGGHGSP
GDLYYVPASNTYFILKSKAYGPFFSDLDNTNVWQNVGHDRDKAIQIVKQQKIGESSQPYMFHGQLNGYMDVNIHPLSGKD
VLTLDGSVNLPEGVITKKSGTLIFQGHPVIHAGMTTSAGQSDWENRQFTMDKLRLDAATFHLSRNAHMQGDISAANGSTV
ILGSSRVFTDKNDGTGNAVSSVEGSSIATTAGDQSYYSGNVLLENHSSLEVRENFTGGIEAYDSSVSVTSQNAIFDHVGS
FVNSSLLLEKGAKLTAQSGIFTNNTMKIKENASLTLTGIPSVGKPGYYSPVTSTTEGIHLGERASLSVKNMGYLSSNITA
ENSAAIINLGDSNATIGKTDSPLFSTLMRGYNAVLQGNIMGPQSSVNMNNALWHSDRNSELKELKANDSQIELGVRGHFA
KLRVKELIASNSVFLVHANNSQADQLNVTDKLQGSNNTILVDFFNKAANGTNVTLITAPKGSDENTFKAGTQQIGFSNIT
PEIRTENTDTATQWVLTGYQSVADARASKIATDFMDSGYKSFLTEVNNLNKRMGDLRDSQGDAGGWARIMNGTGSGESGY
RDNYTHVQIGADRKHELNGIDLFTGALLTYTDNNASSQAFSGKTKSLGGGVYASGLFESGAYFDLIGKYLHHDNRYTLNF
ASLGERSYTSHSLYAGAEIGYRYHMSENTWVEPQMELVYGSVSGKSFNWKDQGMQLSMKDKDYHPLIGRTGVDVGRAFSG
DTWKVTVRAGLGYQFDLLANGETVLQDASGKKHFKGEKDSRMLMNVGTNVEVKDNMRFGLELEKSAFGRYNIDNSINANF
RYYF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 54
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 53
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 53
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 53
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 51
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 44
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 44
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eatA YP_006203824.1 serine protease EatA VFG0903 Protein 0.0 53
eatA YP_006203824.1 serine protease EatA VFG0635 Protein 0.0 53
eatA YP_006203824.1 serine protease EatA VFG0861 Protein 0.0 53
eatA YP_006203824.1 serine protease EatA VFG1689 Protein 0.0 44
eatA YP_006203824.1 serine protease EatA VFG0904 Protein 0.0 44