Gene Information

Name : LY180_15365 (LY180_15365)
Accession : YP_008566137.1
Strain : Escherichia coli LY180
Genome accession: NC_022364
Putative virulence/resistance : Virulence
Product : Accessory colonization factor AcfD
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3229811 - 3234241 bp
Length : 4431 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
TTGCCGGAAGTGAAACCTGATCCAACACCAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACGCCGGA
ACCGACACCTGATCCTGAGCCAACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACTCTGGGCG
GAAGCCTGCGGGTAACTGGTGATATCACCTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGACAAAGTC
ACCTGTGTGGCAGGGAACAACACGACAATTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGA
AAAAGTGTCGTTTAGTCTTGAGGACGCGCAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCGCTGGTCA
CCTCCATGAACAGTTGCCCGGCGAATACAGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGAC
TCGCTGTATAAGCAAATCGATCTGGCACCGGAAGAATTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGAC
CGATAAAGCGCCATCCACTCATACTTCACCGGTCGTGCCCGCCACCACTCCGGGAACAAAACCGGATCTAAACGCTTCCT
TCGTGTCGGCTAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTCTCTGAAGGTCGACTGGTCGATAGT
CAGGGGGATGGTGTTGTTGGTGTCAACTATTACACCAATTCCGGCCGTGGTGTAACCGGAGAAAACGGGGAATTTTCCTT
TAGTTGGGGGGAAACCATCTCCTTTGGCATCGACACTTTTGAGCTTGGTTCTGTGCGTGGTAACAAGTCGACTATTGCAT
TGACTGAACTGGGTGATGAAGTTCGCGGGGCAAATATCGATCAGTTGATTCACCGCTATTCGAAGGCTGGACAAAATCAC
ACGCGTGTAGTTCCGGATGAAGTGCGCAAGGTTTTTGCTGAATATCCCAACGTGATTAACGAGATTATCAATCTCTCGTT
ATCCAATGGTGCGACGCTGGGGGAAGGTGAGCAAGTCGTTAATCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTC
AGGCCAAAGAGATCGATACCGCGATTTGTGCGAAAACCGATGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGC
AATGTTAATGACGGCCAGATTCAGGGCGTTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGTAAGTT
CCATGTATTCCATGACTCTACCAACTTCTATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACG
CGGCCTTCCCGATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAAT
GAGCTGGCGTACATTACTGAAGCGCCTTCCATTGTGCGACCAGAGAACGTGACACGCGAAACCGCCACCTTCAACCTGCC
GTTTATCTCGCTGGGGCAAGTGGGCGATGGCAAGCTGATGGTTATCGGTAACCCACACTACAACAGCATCCTGCGTTGCC
CGAACGGCTACAGCTGGAACGGGGGCGTTAATAAAGACGGACAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAAC
TTCATGGAGAACGTGCTGCGCTATCTGTCAAATGATCGCTGGTTGCCGGATGCAAAATCCAGTATGACCGTGGGTACTAA
CCTGGACACGGTATATTTCAAAAAACATGGTCAGGTGCTGGGAAATAGCGCACCGTTTGCGTTCCACAAGGATTTCACTG
GCATCACGGTCAAACCAATGACCAGCTATGGCAATCTGAATCCAGATGAAGTTCCTCTGTTGATCCTCAATGGCTTTGAA
TACGTCACACAATGGGGTAGCGATCCTTACTCCATTCCTCTGCGCGCAGATACCAGCAAACCGAAGCTGACCCAGCAGGA
TGTGACCGATTTGATCGCCTATATGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAG
AGAGCGCATCTGGCTTTGTACGTCTGCTTGATGCCGCAGGTTTGTCGATGGCGCTTAACAAGTCGGTAGTAAATAACGAT
CCGCAAGGCTACCCTGACCGTGTTCGTCAACGACGTTCAACGCCAATTTGGGTCTATGAGCGTTATCCGGCTGTCGATGG
TAAACCACCGTATACCATTGATGACACCACGAAAGAAGTTATCTGGAAATATCAGCAAGAAAACAAACCTGATGACAAAC
CGAAGCTGGAAGTTGCCAGCTGGCAGGAAGAAGTTGAGGGTAAACAGGTAACCCAATTCGCCTTTATTGATGAAGCCGAC
CACAAAACGCCTGAGTCACTGGCTGCGGCAAAACAGAGAATTCTGGACGCGTTCCCAGGGCTGGAAGTGTGTAAGGATTC
TGACTATCACTATGAGGTCAACTGTCTGGAATACCGCCCAGGCACGGGTGTGCCGGTAACCGGTGGCATGTATGTTCCGC
AGTATACGCAGCTGGATCTTGGAGCTGACACTGCGAAAGCGATGCTGCAGGCTGCGGATTTAGGCACCAATATTCAGCGC
CTGTATCAGCATGAGCTTTATTTCCGTACCAATGGCCGCCAGGGTGAGCGTCTCAACAGCGTCGATCTGGAACGTTTATA
CCAAAACATGTCCGTCTGGCTGTGGAACGAGACGAAATATCGTTATGAAGAGGGTAAAGAAGACGAGCTGGGCTTTAAAA
CGTTCACTGAGTTTCTGAACTGCTACACCAACAATGCATACGTTGGCACGCAGTGTTCCGCTGAGCTGAAAAAATCGCTG
ATCGATAACAAGATGATTTACGGTGAAGAAAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACTATATGGA
AAAACCGCTGACACGCCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTTGAGAAGTATCCGGGAG
CAGTATCGGAAGAGGGACAGAACGTTACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCAGGTAACATG
CAGTCAACTGGCCTGTGGGCACCGGCTCAGAAAGAGGTCACCATTAAGTCCAATGCGAACGTTCCTGTGACCGTCACCGT
GGCGCTGGCTGACGACCTGACCGGAAGAGAGAAGCATGAAGTCGCGCTGAACCGTCCGCCAAGAGTAACCAAAACGTACT
CTCTGGACGCTAGTGGTACGGTGAAATTCAAGGTGCCTTACGGTGGTCTGATTTATATCAAAGGCGACAGCAAAGACAAT
GAATCTGCCAGCTTCACCTTTACTGGTGTGGTAAAAGCACCGTTCTATAAAGACGGCGCATGGAAAAACGATCTGAACTC
ACCGGCTCCGCTGGGTGAGCTGGAATCAGACGCTTTCGTTTATACCGCACCGAAGAAGAACCTGAATGCCAGCAATTACA
CTGGCGGACTGAAGCAATTCGCTAACGATCTGGATACCTTTGCCAGCTCGATGAATGATTTCTACGGTCGTAATGAGGAA
GACGGTAAGCACCGGATGTTTACCTATAAAAACTTGACGGGCCACAAGCATCGTTTCGCCAACGATGTGCAGATCTCCAT
CGGTGATGCGCATTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGA
ACGACTGGCTGATCTGGCACGAAGTAGGGCACAACGCCGCAGAAACGCCGCTGACTGTTCCAGGCGCGACCGAAGTGGCG
AACAACGTGCTGGCGCTGTACATGCAGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACC
GGAATATCTGGAGGAGAGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCGCAGCTGA
AGGAATGGGCAGAGAAAAACTTTGATATCAAGAAATGGTATCCAGAAGGTGAATTGCCGAAGTTCTTCAGCGATCGTGAA
GGGATGAAAGGCTGGAACCTGTTCCAGTTGATGCACCGTAAAGCGCGCGGCGATGACGTTGGTGACAAGACCTTTGGTGG
CAAGAATTACTGTGCTGAATCCAACGGTAACGCTGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATC
TTTCGGAGTTCTTTAAGAAATGGAATCCGGGTGCGAATGCTTACCAGCTGCCGGGGGCGAGCGAGATGAGCTTCGAGGGC
GGTGTGAGCCAGTCGGCGTACAACACGCTGGCGTCACTCAAGCTGCCGAAACCGGAACAGGGACCGGAAACCATTAACAA
GGTTACCGAGCATAAGATGTCTGTCGAGTAA

Protein sequence :
MPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKTGYLTLGGSLRVTGDITCNDESSDGFTFTPGDKV
TCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDAQELAGSDNKKSNALSLVTSMNSCPANTEQVCLEFSSVIESKRFD
SLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPATTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDS
QGDGVVGVNYYTNSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSKAGQNH
TRVVPDEVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTR
NVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKN
ELAYITEAPSIVRPENVTRETATFNLPFISLGQVGDGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKN
FMENVLRYLSNDRWLPDAKSSMTVGTNLDTVYFKKHGQVLGNSAPFAFHKDFTGITVKPMTSYGNLNPDEVPLLILNGFE
YVTQWGSDPYSIPLRADTSKPKLTQQDVTDLIAYMNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNND
PQGYPDRVRQRRSTPIWVYERYPAVDGKPPYTIDDTTKEVIWKYQQENKPDDKPKLEVASWQEEVEGKQVTQFAFIDEAD
HKTPESLAAAKQRILDAFPGLEVCKDSDYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLDLGADTAKAMLQAADLGTNIQR
LYQHELYFRTNGRQGERLNSVDLERLYQNMSVWLWNETKYRYEEGKEDELGFKTFTEFLNCYTNNAYVGTQCSAELKKSL
IDNKMIYGEESSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNM
QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGDSKDN
ESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESDAFVYTAPKKNLNASNYTGGLKQFANDLDTFASSMNDFYGRNEE
DGKHRMFTYKNLTGHKHRFANDVQISIGDAHSGYPVMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVA
NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPEGELPKFFSDRE
GMKGWNLFQLMHRKARGDDVGDKTFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEG
GVSQSAYNTLASLKLPKPEQGPETINKVTEHKMSVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 88
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
LY180_15365 YP_008566137.1 Accessory colonization factor AcfD VFG0106 Protein 0.0 49