Gene Information

Name : EcE24377A_3432 (EcE24377A_3432)
Accession : YP_001464429.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3439337 - 3443857 bp
Length : 4521 bp
Strand : -
Note : identified by glimmer

DNA sequence :
TTGAGTGCAACCCTGTTAGCCGGTTGTGATGGCGGTGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAAC
AGGGTCTTTGCCGGAAGTGAAACCTGATCCAACACCAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTA
CGCCGGAACCGACACCTGATCCTGAGCCAACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACT
CTGGGCGGAAGCCTGCGGGTAACTGGTGATATCACCTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGA
CAAAGTCACCTGTGTGGCAGGGAACAACACGACAATTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTG
CGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCGCAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCG
CTGGTCACCTCCATGAACAGTTGCCCGGCGAATACAGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACG
TTTCGACTCGCTGTATAAGCAAATCGATCTGGCACCGGAAGAATTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATG
CCGCGACCGATAAAGCGCCATCCACTCATACTTCACCGGTCGTGCCCGCCACCACTCCGGGAACAAAACCGGATCTAAAC
GCTTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTCTCTGAAGGTCGACTGGT
CGATAGTCAGGGGGATGGTGTTGTTGGTGTCAACTATTACACCAATTCCGGCCGTGGTGTAACCGGAGAAAACGGGGAAT
TTTCCTTTAGTTGGGGGGAAACCATCTCCTTTGGCATCGACACTTTTGAGCTTGGTTCTGTGCGTGGTAACAAGTCGACT
ATTGCATTGACTGAACTGGGTGATGAAGTTCGCGGGGCAAATATCGATCAGTTGATTCACCGCTATTCGAAGGCTGGACA
AAATCACACGCGTGTAGTTCCGGATGAAGTGCGCAAGGTTTTTGCTGAATATCCCAACGTGATTAACGAGATTATCAATC
TCTCGTTATCCAATGGTGCGACGCTGGGGGAAGGTGAGCAAGTCGTTAATCTGCCTAACGAATTTATCGAGCAGTTTAAG
ACGGGTCAGGCCAAAGAGATCGATACCGCGATTTGTGCGAAAACCGATGGTTGTAACGAGGCTCGCTGGTTCTCGCTGAC
AACGCGCAATGTTAATGACGGCAAGATTCAGGGCGTTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCA
GTAAGTTCCATGTATTCCATGACTCTACCAACTTCTATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATC
TCCAACGCGGCCTTCCCGATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGA
TAAAAATGAGCTGGCGTACATTACTGAAGCGCCTTCCATTGTGCGACCAGAGAACGTGACACGCGAAACCGCCACCTTCA
ACCTGCCGTTTATCTCGCTGGGGCAAGTGGGCGATGGCAAGCTGATGGTTATCGGTAACCCACACTACAACAGCATCCTG
CGTTGCCCGAACGGCTACAGCTGGAACGGGGGCGTTAATAAAGACGGACAGTGTACGCTCAACAGCGACCCGGATGACAT
GAAGAACTTCATGGAGAACGTGCTGCGCTATCTGTCCGACGATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAG
GCACCAACCTGGATACTGTCTATTTTAAACGTCATGGTCAGGTTACAGGAAACAGCGCTGCGTTCGACTTCCATCCGGAT
TTTGCGGGCATCTCTGTTGAGCATTTAAGTAGCTATGGCGATCTCGATCCGCAGGAAATGCCGCTGCTGATCCTTAACGG
CTTTGAATATGTGACTCAGGTGGGTAACGATCCTTATGCAATCCCGCTGCGTGCAGATACCAGCAAACCGAAGCTGACTC
AGCAGGATGTGACCGATCTGATCGCCTATCTGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTT
AAGGAAGAGAGCGCGTCTGGTTTTGTGCGTCTGTTGGATGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAA
TAACGATCCGCAAGGGTATCCGAACCGCGTTCGTCAACGACGTTCAACGCCAATTTGGGTCTATGAGCGTTATCCGGCTG
TCGATGGTAAACCACCGTATACCATTGATGACACCACGAAAGAAGTTATCTGGAAATATCAGCAAGAAAACAAACCTGAT
GACAAACCGAAGCTGGAAGTTGCCAGCTGGCAGGAAGAAGTTGAGGGTAAACAGGTAACCCAATTCGCCTTTATTGATGA
AGCCGACCACAAAACGCCTGAGTCACTGGCTGCGGCAAAACAGAGAATTCTGGACGCGTTCCCAGGGCTGGAAGTGTGTA
AGGATTCTGACTACCACTATGAGGTCAACTGTCTGGAATACCGCCCAGGCACGGATGTGCCGGTAACCGGTGGCATGTAT
GTTCCGCAGTATACGCAGCTGGATCTGAGTGCCGACACCGCGAAAGCGATGCTGCAGGCTGCGGATTTAGGCACCAATAT
TCAGCGCCTGTATCAGCATGAGCTTTATTTCCGTACCAATGGCCGCCAGGGTGAGCGTCTCAACAGCGTTGATCTGGAAC
GTTTATACCAAAACATGTCCGTCTGGCTGTGGAACGAGACGAAATATCGTTATGAAGAGGGTAAAGAAGACGAGCTGGGC
TTTAAAACGTTCACTGAGTTTCTGAACTGCTACACCAACAATGCATACGTTGGCACGCAGTGTTCTGCTGAGCTGAAAAA
ATCGCTGATCGATAACAAGATGATTTATGGTGAAGAAAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACT
ATATGGAAAAACCGCTGACACGCCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTTGAGAAGTAT
CCGGGAGTGGTGAATACAAACGGCGAAACCGTCACACAAAACATTAACTTGTACTCAGCTCCAACCAAATGGTTTGCAGG
TAACATGCAGTCAACTGGCCTGTGGGCACCTGCCCAGCAGGAAGTCAGCATTGAGTCAAAGGCGACAGTTCCTGTGACCG
TGACTGTTGCGCTGGCCGACGACCTGACAGGACGAGAGAAGCATGAAGTTAGCCTGAATCGTCCACCCAGAGTGACAAAA
ACCTATGACCTGAAAGCCAATGATAAGGTGACGTTCAAAGTCCCTTACGGTGGTCTGATTTACATCAAGGGCGACAGCAA
AGAGGTGCAATCAGCTGACTTCACCTTTACCGGTGTAGTAAAAGCGCCGTTCTATAAAGACGGTAAGTGGCAACACGATC
TGAACTCCCCTGCCCCGCTGGGCGAACTGGAGTCTGCCTCGTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGC
AATTACACTGGCGGACTGGAGCAATTCGCTAACGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGCCGTGA
TAGCGAAGACGGTAAGCACCGGATGTTTACCTATAAAAACTTGCCGGGCCACAAACATCGTTTCGCCAACGATGTGCAGA
TCTCCATCGGTGATGCGCATTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACG
CCGCTGAACGACTGGCTGATCTGGCATGAAGTCGGTCATAACGCCGCAGAAACGCCGTTGACTGTACCGGGTGCAACTGA
AGTCGCTAACAACGTGCTGGCGCTGTACATGCAGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCG
TCGCACCGGAATATCTGGAGGAAAGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCA
CAGCTGAAGGAATGGGCAGAGAAAAACTTTGATATCAAGAAATGGTATCCAGATGGCACTCCTCTGCCAGAGTTTTACAG
CGAGCGTGAAGGGATGAAAGGCTGGAACCTGTTCCAGTTGATGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACA
AGTTTGGCGGCAAGAATTACTGTGCTGAATCCAACGGTAACGCAGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCC
CAGACGGATCTTTCGGAGTTCTTTAAGAAATGGAATCCGGGTGCGAATGCTTACCAGCTGCCGGGGGCGAGCGAGATGAG
CTTCGAGGGCGGTGTGAGCCAGTCGGCGTACAACACGCTGGCGTCACTCAAGCTGCCGAAACCGGAACAGGGACCGGAAA
CCATTAACAAGGTTACCGAGCATAAGATGTCTGTCGAGTAA

Protein sequence :
MSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKTGYLT
LGGSLRVTGDITCNDESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDAQELAGSDNKKSNALS
LVTSMNSCPANTEQVCLEFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPATTPGTKPDLN
ASFVSANAEQFYQYQPTEIILSEGRLVDSQGDGVVGVNYYTNSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKST
IALTELGDEVRGANIDQLIHRYSKAGQNHTRVVPDEVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFK
TGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGKIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNI
SNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVRPENVTRETATFNLPFISLGQVGDGKLMVIGNPHYNSIL
RCPNGYSWNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWTPDAKASMTVGTNLDTVYFKRHGQVTGNSAAFDFHPD
FAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNL
KEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQRRSTPIWVYERYPAVDGKPPYTIDDTTKEVIWKYQQENKPD
DKPKLEVASWQEEVEGKQVTQFAFIDEADHKTPESLAAAKQRILDAFPGLEVCKDSDYHYEVNCLEYRPGTDVPVTGGMY
VPQYTQLDLSADTAKAMLQAADLGTNIQRLYQHELYFRTNGRQGERLNSVDLERLYQNMSVWLWNETKYRYEEGKEDELG
FKTFTEFLNCYTNNAYVGTQCSAELKKSLIDNKMIYGEESSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKY
PGVVNTNGETVTQNINLYSAPTKWFAGNMQSTGLWAPAQQEVSIESKATVPVTVTVALADDLTGREKHEVSLNRPPRVTK
TYDLKANDKVTFKVPYGGLIYIKGDSKEVQSADFTFTGVVKAPFYKDGKWQHDLNSPAPLGELESASFVYTTPKKNLNAS
NYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFANDVQISIGDAHSGYPVMNSSFSPNSTTLPTT
PLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQAWARGGAGDRLLMYA
QLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVA
QTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLKLPKPEQGPETINKVTEHKMSVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 88
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 50
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcE24377A_3432 YP_001464429.1 hypothetical protein VFG0106 Protein 0.0 49