Gene Information

Name : EC55989_1588 (EC55989_1588)
Accession : YP_002402664.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Unknown
Product : VgrG protein, Encoded within repeats that are hotspots for chromosomal duplication formation; Function of protein is unknown
Function : -
COG functional category : S : Function unknown
COG ID : COG3501
EC number : -
Position : 1637768 - 1639912 bp
Length : 2145 bp
Strand : +
Note : Evidence 4 : Homologs of previously reported genes of unknown function

DNA sequence :
ATGTGCGGATTATTATTCATTTTCACGGAGGTTGCTATGTCAACCGGATTACGTTTCACACTGGAAGTGGACGGCCTGCC
GCCGGATGCTTTTGCGGTGGTTTCCTTTCATCTGAACCAGTCACTCTCTTCGCTCTTTTCCCTCGATCTCTCCCTGGTCA
GCCAGCAGTTTCTCTCCCTTGAATTTGCGCAGGTGCTGGACAAAATGGCCTACCTGACGATATGGCAGGGCGATGAAGTA
CAGCGCCGGGTGAAAGGCGTGGTGACCTGGTTTGAACTCGGGGAGAACGACAAAAACCAGATGCTGTACAGCATGAAGGT
GCACCCGCCGCTGTGGCGTGCCGGTCTGCGCCAGAACTTCCGTATCTTCCAGAACGAGGACATCAAAAGCATCCTCGGCA
CGATGTTGCAGGAAAACGGGGTGACCGAATGGAGTCCGCTGTTCAGCGAGCCGCATCCTTCCCGTGAGTTTTGTGTCCAG
TACGGTGAGACTGATTACGATTTCCTGTGCCGGATGGCGGCGGAGGAAGGCATCTTCTTTTATGAGGAGCATGCTTACAA
AAGTACCGACCAGAGCCTGGTGCTGTGCGACACAGTCCGCCATCTGCCCGAATCTTTTGAAATCCCATGGAACCCGAACA
CCCGTACCGAGGTGAGCACCCTCTGCATCAGCCAGTTCCGCTACAGCGCACAAATCCGCCCTTCTTCCGTGGTGACCAAA
GACTACACCTTTAAACGCCCCGGCTGGGCCGGACGTTTTGATCAGGAAGGCCAGCACCAGGATTACCAGCGCACACAGTA
TGAAGTGTATGACTACCCCGGACGTTTCAAGGGGGCCCACGGGCAGAACTTTGCCCGCTGGCAGATGGATGGCTGGCGAA
ACAACGCCGAAGTGGCGCGCGGAACAAGCCGTTCTCCGGAGATATGGCCGGGACGGCGAATTGCGCTGACGGGGCATCCG
CAGGCGAACCTGAACCGGGAATGGCAGGTGGTGGCGAGCGATCTTCACGGCGAACAGCCGCAGGCGGTACCGGGACGCAG
TGGTTCAGGCACCACGCTGAATAATCACTTTGCGGTGATCCCGGCAGACAGAACATGGCGACCACAGCCGTTGCTGAAAC
CGCTGGTGGACGGCCCGCAGAGCGCTGTCGTGACGGGACCGGCAGGCGAGGAAATCTTCTGTGATGAACATGGCCGCGTG
CGGGTGAAATTTAACTGGGACCGTTATAACCCGTCAAACCAGGACAGTTCATGCTGGAGCCGTGTGGCACAGGCGTGGGC
AGGCACCGGATTCGGTAACCTGGCGATACCGCGTGTGGGTCAGGAGGTGATTGTGGACTTCCTCAACGGCGATCCGGACC
AGCCGATCATTTTGGGGCGCACCTACCACCAGGAAAACCGCACACCCGGCAGCCTGCCGGGAACAAAGACGCAGATGACC
ATTCGTTCGAAAACCTATAAGGGCAGCGGGTTTAATGAACTGAAGTTTGACGATGCGACAGGGAAAGAACAGGTCTACAA
CCACGCGCAGAAGAACATGAACACCGAGGTGCTGAATAACCGCACCACTGATGTGATAAACAACCATGCTGAAACCATTG
GCAACAATCAGATGATTGCGGTTACCAACAATCAGATACAGACGGTGGGTGTTAACCAGATAGAGACGGTGGGCAGTAAC
CAGATAATTAAAGTGGGTTCTGTTCAGGTGGAAACGATTGGACTTGTTCGTGCGCTGACCGTGGGCGTGGCGTATCAGAC
GACGGTAGGTGGCATTATGAACACCTCGGTGGCACTGATGCAGTCCTCGCAGATGGGTTTGCATAAATCGTTGAGGGTCG
GGCTGGGTTATGACGTCAAAGTCGGAAATAACGTTACCTTCACCGTTGGTAAAACGAAAAAGGATGATACCGGGCAGACC
GCGATTTACTCCGCCGGTGAGCATCTGGAGCTCTGCTGTGGTAAGGCAAGGCTGGTGCTGACGAAGGACGGACAAATTTT
TCTCAACGGCACAAAAATTCATTTGCAGGGTAAGGAGCAGGTTAATGGTGACTCACTGTTGATTAACTGGAACTGTGCAG
CCTCGAAATCCCCACCGAAGACCCCTGATGAAAAGCAGGATACGCCGGATATGAGAGAGTACTGA

Protein sequence :
MCGLLFIFTEVAMSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEFAQVLDKMAYLTIWQGDEV
QRRVKGVVTWFELGENDKNQMLYSMKVHPPLWRAGLRQNFRIFQNEDIKSILGTMLQENGVTEWSPLFSEPHPSREFCVQ
YGETDYDFLCRMAAEEGIFFYEEHAYKSTDQSLVLCDTVRHLPESFEIPWNPNTRTEVSTLCISQFRYSAQIRPSSVVTK
DYTFKRPGWAGRFDQEGQHQDYQRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAEVARGTSRSPEIWPGRRIALTGHP
QANLNREWQVVASDLHGEQPQAVPGRSGSGTTLNNHFAVIPADRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRV
RVKFNWDRYNPSNQDSSCWSRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIILGRTYHQENRTPGSLPGTKTQMT
IRSKTYKGSGFNELKFDDATGKEQVYNHAQKNMNTEVLNNRTTDVINNHAETIGNNQMIAVTNNQIQTVGVNQIETVGSN
QIIKVGSVQVETIGLVRALTVGVAYQTTVGGIMNTSVALMQSSQMGLHKSLRVGLGYDVKVGNNVTFTVGKTKKDDTGQT
AIYSAGEHLELCCGKARLVLTKDGQIFLNGTKIHLQGKEQVNGDSLLINWNCAASKSPPKTPDEKQDTPDMREY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec15 YP_851429.1 hypothetical protein Not tested PAI II APEC-O1 Protein 5e-165 55
aec15 AAQ96709.1 Aec15 Not tested AGI-1 Protein 3e-165 55
vgrG AAN64196.1 VgrG Not tested macrophage toxin pathogenicity island Protein 5e-131 48