Gene Information

Name : EcHS_A3142 (EcHS_A3142)
Accession : YP_001459760.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3151413 - 3155924 bp
Length : 4512 bp
Strand : -
Note : identified by similarity to GB:BAC62719.1

DNA sequence :
TTGAGCGCAACCCTGTTAGCCGGTTGTGATGGCGGTGGTTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAAC
AGGATCTTTGCCGGAAGTGAAACCTGATCCAACACCAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAGCCTA
CGCCAGAACCGATACCTGATCCTGAACCAACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACC
CTGGGCGGAAGCCAGCGGGTAACTGGTGCTACCTGTAATGGTGAATCCAGCGATGGCTTTACATTTAAACCTGGCGAGGA
CGTTACTTGCGTGGCGGGTAACACGACAATTGCCACCTTCAACACTCAGTCAGAAGCTGCGCGTAGCTTGCGTGCGGTTG
AAAAAGTGTCGTTTAGCCTTGAGGACGCGCAAGAACTGGCGGGCTCCGATGACAAGAAAAGCAATGCGGTTTCGCTGGTA
ACGTCCAGTAACAGCTGTCCGGCGAATACAGAACAGGTTTGTCTGACGTTCTCCTCGGTGATCGAGAGTAAACGCTTCGA
CTCGCTGTATAAGCAAATCGATCTGGCACCGGAAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGA
CCGATAAAGCGCCATCCACTCATACTTCACCGGTCGTGCCCGTCACCACGCCGGGAACAAAACCGGATCTGAACGCTTCC
TTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTCTCTGAAGGTCGACTGGTCGATAG
CCAGGGATATGGTGTTGCTGGCGTCAACTACTACACCAATTCAGGCCGTGGCGTGACAGGGGAAAATGGTGAATTTTCCT
TTAGCTGGGGCGAAACCATCTCCTTTGGTATCGATACCTTTGAACTGGGTTCAGTGCGCGGCAATAAGTCGACCATTGCG
CTGACTGAACTGGGTGATGAAGTTCGCGGGGCGAATATTGATCAGCTTATTCATCGCTATTCGACGACCGGGCAAAATAA
TACCCGTGTTGTTCCGGACGATGTACGCAAGGTCTTTGCCGAATATCCCAACGTGATCAACGAGATTATCAATCTCTCGT
TATCCAACGGTGCGACGCTGGGGGAAGGTGAGCAAGTCGTTAATCTGCCTAACGAATTTATTGAGCAGTTTAATACGGGT
CAGGCCAAAGAGATCGATACCGCGATTTGTGCGAAAACCGATGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACGACGCG
CAATGTTAATGACGGCCAGATTCAGGGCGTTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGCAAGT
TCCATGTATTCCATGACTCCACCAACTTCTATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAAC
GCGGCCTTCCCGATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCCTTCGGCGAGAAACGGGCCTGGGATAAAAA
TGAGCTGGCGTACATTACTGAAGCGCCTTCCATTGTGCGACCAGAGAACGTGACACGCGAAACCGCCACCTTCAACCTGC
CGTTTATTTCGCTGGGGCAAGTGGGCGATGGCAAGCTGATGGTTATCGGTAACCCACACTACAACAGCATCCTGCGTTGC
CCGAACGGTTACAGCTGGAACGGGGGCGTTAATAAAGATGGGCAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAA
CTTCATGGAGAACGTGCTGCGCTATCTGTCAAATGATCGCTGGTTGCCGGATGCAAAATCCAATATGACCGTGGGTACTA
ACCTGGACACGGTGTATTTCAAAAAACACGGGCAGGTTACAGGAAATAGTGCTGCGTTCGGCTTTCATCCGGATTTTGCG
GGTATCTCTGTTGAGCATTTAAGTAGCTATGGCGATCTCGACCCGCAGGAAATGCCGCTGCTGATCCTCAACGGCTTTGA
GTATGTGACTCAGGTTGGTAACGATCCTTATGCAATCCCGCTGCGTGCAGATACCAGCAAACCGAAGCTGACCCAGCAGG
ATGTGACCGATTTGATCGCCTATATGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAA
GAGAGCGCATCTGGCTTTGTACGTCTGCTTGATGCCGCAGGTTTGTCGATGGCGCTTAACAAGTCGGTAGTAAATAACGA
TCCGCAAGGCTACCCGGACCGCGTTCGTCAACGACGTTCAACGCCAATTTGGGTCTATGAGCGTTATCCGGCTGTCGATG
GTAAACCACCGTATACCATTGATGACACCACGAAAGAAGTTATCTGGAAATATCAGCAAGAAAACAAACCTGATGACAAA
CCGAAGCTGGAAGTTGCCAGCTGGCAGGAAGAAGTTGAGGGTAAACAGGTAACTCAATTCGCCTTTATCGATGAAGCCGA
CCACAAAACGCCTGAGTCACTGGCTGCGGCGAAGAAGAGAATTCTGGACGCGTTCCCAGGGCTGGAAGAGTGTAAGGATT
CTGACTACCACTATGAGGTCAACTGTCTGGAATATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCA
CAGTATACGCAACTAAGCCTTAACGCCGACACTGCGAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCG
TCTGTATCAGCATGAGCTTTACTTCCGTACCAATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGT
ACCAGAACATGTCGGTCTGGCTGTGGAATAAAATTGAATATCGCTATGAAAACGACAAGGATGACGAGCTGGGCTTTAAA
ACGTTCACCGAGTTCCTGAACTGTTACGCCAACAATGCTTATGATGGTGGCACGCAGTGCTCCGCAGAGCTGAAACAATC
GCTGATCGATAACAAGATGATCTACGGTGAAGGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTTAACTATATGG
AAAAACCGCTGACGCGCCTGATGCTGGGGCGTTCCTGGTGGGATCTGAACATCAAGGTTGATGTCGAGAAGTATCCGGGG
GCGGTATCGGCTGAAGGTGAGGAGGTTACTGAAACCATCAACCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACAT
GCAGTCTACTGGCCTGTGGGCTCCGGCTCAGCAGGAAGTCAGCATTAAGTCCAATGCGAAAGTCCCTGTGACTGTTACCG
TGGCGCTGGCTGACGACCTGACCGGGCGTGAGAAGCATGAGGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACATAC
TCTCTGGATGCTAGCGGCACGGTGAAGTTCAAGGTTCCTTACGGTGGTCTGATTTATATCAAGAGCGACAGTAAAGAGGA
GAAATCAGCCAACTTCACCTTTACTGGCGTGGTAAAAGCGCCGTTCTATAAAGACGGTAAATGGAAAAACGACCTGAAAT
CCCCTGCGCCGTTGGGTGAGCTGGAGTCTGCGTCGTTCGTCTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTAC
AAGGGCGGTCTGAAACAATTCGCTGAGGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTGATGGCGA
AAGCGGTAAGCACCGGATGTTTACCTATGAAGCATTGACGGGGCACAAACATCGTTTCACCAACGATGTGCAGATCTCCA
TCGGTGATGCGCACTCTGGTTATCCGGTGATGAACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACGCCGCTG
AACGACTGGCTGATCTGGCACGAAGTAGGGCACAACGCTGCAGAAACGCCGCTGACTGTACCGGGCGCAACTGAAGTGGC
GAACAACGTGCTGGCGCTGTACATGCAGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTTGCGC
CGGAATATCTGGAGGAGAGCAACGGTCAGGCATGGGCGCGTGGCGGTGCGGGTGACCGTCTGCTGATGTACGCGCAGCTG
AAGGAATGGGCAGAGAAAAACTTTGATATCAAACAGTGGTATCCAGAAGGCTCTCTGCCAGCGTTCTACAGCGAGCGTGA
AGGGATGAAAGGCTGGAACCTGTTCCAGTTGATGCACCGTAAAGCACGCGGCGATGATGTTGGCAATGACAAATTTGGCA
ACAGAAACTACTGTGCCGAATCCAACGGTAACGCTGCCGACACGCTGATGCTGTGTGCATCCTGGGTCGCTCAGACGGAC
CTTTCCGCATTCTTTAAGAAATGGAATCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGACAGAGATGAGCTTCGAGGG
CGGTGTGAGCCAGTCGGCTTACAACACGCTCGCGTCACTCGATCTGCCGAAACCGAAGCAAGGGCCGGAAACCATTAACA
AGGTTACCGAGTATTCGATGCCTGCTGAATAA

Protein sequence :
MSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTPEPEPEPVPTKTGYLT
LGGSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQELAGSDDKKSNAVSLV
TSSNSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPVTTPGTKPDLNAS
FVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIA
LTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFNTG
QAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNISN
AAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVRPENVTRETATFNLPFISLGQVGDGKLMVIGNPHYNSILRC
PNGYSWNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSNDRWLPDAKSNMTVGTNLDTVYFKKHGQVTGNSAAFGFHPDFA
GISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYMNKGGSVLIMENVMSNLKE
ESASGFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRSTPIWVYERYPAVDGKPPYTIDDTTKEVIWKYQQENKPDDK
PKLEVASWQEEVEGKQVTQFAFIDEADHKTPESLAAAKKRILDAFPGLEECKDSDYHYEVNCLEYRPGTGVPVTGGMYVP
QYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNKIEYRYENDKDDELGFK
TFTEFLNCYANNAYDGGTQCSAELKQSLIDNKMIYGEGSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPG
AVSAEGEEVTETINLYSNPTKWFAGNMQSTGLWAPAQQEVSIKSNAKVPVTVTVALADDLTGREKHEVALNRPPRVTKTY
SLDASGTVKFKVPYGGLIYIKSDSKEEKSANFTFTGVVKAPFYKDGKWKNDLKSPAPLGELESASFVYTTPKKNLEASNY
KGGLKQFAEDLDTFASSMNDFYGRDGESGKHRMFTYEALTGHKHRFTNDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPL
NDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQAWARGGAGDRLLMYAQL
KEWAEKNFDIKQWYPEGSLPAFYSEREGMKGWNLFQLMHRKARGDDVGNDKFGNRNYCAESNGNAADTLMLCASWVAQTD
LSAFFKKWNPGANAYQLPGATEMSFEGGVSQSAYNTLASLDLPKPKQGPETINKVTEYSMPAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 90
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcHS_A3142 YP_001459760.1 hypothetical protein VFG0106 Protein 0.0 49