Gene Information

Name : c5145 (c5145)
Accession : NP_756993.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1484
EC number : -
Position : 4921500 - 4922255 bp
Length : 756 bp
Strand : +
Note : Residues 1 to 244 of 251 are 41.60 pct identical to residues 1 to 250 of 298 from SwissProt.40 : >sp|P55500|Y4IQ_RHISN INSERTION SEQUENCE ATP-BINDING PROTEIN Y4IQ/Y4ND/Y4SD

DNA sequence :
GTGAGCAATATCCATCACCTTGAACGCAGCCTGCGTAAACTACGCCTGACACGAGTTGGAGCTGAATGGCACGCTCTGGA
AAAACGAGCACTGGCAGAAGGCTGGACACCATCGCGCTATCTTCTGACGCTATGCAATGAAGAACTCCTGTGGCGCGAGA
GTGAAAAACTGCGTCGTTATAAAAAGGAGGCCCGGTTGCCAGTTGCCAAAACGCTAAGCGAATACGACTTCAGTCAGGTG
CCGGAACTGAATGGAGCTCAGTTCCGGCAACTCTGTGAAACGACAGACTGGGTTGATGCAGGAGAAAACGTTCTGCTGTT
CGGAGCCAGCGGGTTGGGGAAAAGCCATCTGGCGGCAGCGATCGTGGATGGCGTAGTAGGCCAGGGCTACCGGGCCCGGT
TCTACAGCGCAGGAGAGTTGTTGCAGGAACTACGTAAAGCCAGAGCGCAGTTGAAACTGAATGAGCTGCTACTGAAACTG
GATCGCTACCGGGTGATAGTGGTGGATGATCTTGGCTATGTCAAACGCGACAGCGCCGAAACGGGAGTACTGTTCGAGTT
AATAGCGCATCGCTATGAACGTGGGAGCCTGGTGATAACCAGTAACCATCCGTTCAGCATGTGGGGCAGCATCTTCGTGG
ATGAGACTATGGCGGTGGCGGCGGCAGACCGGCTGATCCATCACGGATATATGTTCGAACTGAAAGGTGAAAGCTACAGG
AAAAAGACAGCGAAGGCAGTAACAAGCGCGACTTGA

Protein sequence :
MSNIHHLERSLRKLRLTRVGAEWHALEKRALAEGWTPSRYLLTLCNEELLWRESEKLRRYKKEARLPVAKTLSEYDFSQV
PELNGAQFRQLCETTDWVDAGENVLLFGASGLGKSHLAAAIVDGVVGQGYRARFYSAGELLQELRKARAQLKLNELLLKL
DRYRVIVVDDLGYVKRDSAETGVLFELIAHRYERGSLVITSNHPFSMWGSIFVDETMAVAAADRLIHHGYMFELKGESYR
KKTAKAVTSAT

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
c3608 NP_755483.1 hypothetical protein Not tested PAI I CFT073 Protein 2e-108 100
c5145 NP_756993.1 hypothetical protein Not tested PAI II CFT073 Protein 2e-108 100
orfB AAC61729.1 OrfB Not tested PAI I CFT073 Protein 2e-81 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
c5145 NP_756993.1 hypothetical protein VFG1729 Protein 1e-108 100