Gene Information

Name : c5215 (c5215)
Accession : NP_757063.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : T : Signal transduction mechanisms
COG ID : COG0467
EC number : -
Position : 4967922 - 4969685 bp
Length : 1764 bp
Strand : -
Note : Residues 5 to 587 of 587 are 96.39 pct identical to residues 1 to 583 of 774 from GenPept.129 : >gb|AAK00457.1| (AF200692) unknown [Shigella flexneri 2a]

DNA sequence :
ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA
GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG
TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA
CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA
GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG
GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT
GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG
GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA
CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC
TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG
ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA
TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG
GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT
CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC
CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT
GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT
CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC
AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA
GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT
ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA
TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA
ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT
GTGA

Protein sequence :
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL
LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT
AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL
FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA
ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ
EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ
TTYRQQIDSAHEIVLKEQQAVQEWQRL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
c5215 NP_757063.1 hypothetical protein Not tested PAI II CFT073 Protein 0.0 100
unnamed AAK00457.1 unknown Not tested SHI-1 Protein 0.0 97