Gene Information

Name : ECO103_1875 (ECO103_1875)
Accession : YP_003221814.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Unknown
Product : terminase large subunit
Function : -
COG functional category : S : Function unknown
COG ID : COG5484
EC number : -
Position : 1966448 - 1968199 bp
Length : 1752 bp
Strand : -
Note : Prophage ECO103_P07

DNA sequence :
ATGATTCAGGACGCTTTTGTGCGCCAGCGTGCGCGGCAACTTTACTGGCAGGGTTATCCGCCCGCAGAAATATCACGTCT
GATGGGAATAAACCCGAACACGATTTATGCGTGGAAAAAACGCGACCAGTGGGATGAAACGCCACCCGTGCAGCGTGTCA
CGCAGTCCATCGATGCGCGCCTCATCCAGCTTACTGAAAAACAGAATAAAACAGGCGGTGACTTTAAGGAAATAGACCTG
CTGACCCGGCAGCTTAAAAAACTGCATGATGGCCAGCCGGATGCGACGGCCACAGGAAAGAAAGGCCGGGCGAAAAAACT
CAAAAATCATTTCACGCCGGAACAGATTGCCGCACTGCGGGAAAAAATCATCAGCAGGCTGGAGTGGCATCAGCGGGGCT
GGTTTGACTCCCTGACCCTTTGCAGGGAAGCCGGGATACGTAACAGGATGATCCTGAAATCCCGACAGATTGGGGCGACC
TGGTATTTTGCACAGGAAGCACTGCTGATGGCGCTGCGTGACGATGTGGCGCAACCTTACCAGCGTAACCAGATTTTTTT
GTCTGCGTCGCGTCGTCAGGCGTTCCAGTTTAAAAGCATTATTCAGAAGGCCGCGGCTGAAGTTGATGTGGAGCTGAAAG
GGGGCGATAAAATCATCCTCTCCAACGGCGCAGAGCTGCATTTTCTCGGCACTTCTGCTGCGTCGGCACAGTCCTATACG
GGCAATTTTTATTTTGATGAATTTTTCTGGGTCAGTCGCTTTGCTGAACTGCGCAAGGTGGCTGGCGCTATGGCAACCCT
CAGCGGACTGCGGCGCACCTACTTCTCCACGCCATCCACCGAAACGCACGAGGCATACGCCTACTGGAACGGCGACCGCT
GGAACGAGAAAAAGGCCTCGCATAAACGCCAGCGTTTTTCTGTGGACTGGAAAACGCTGCATAACGGGCTTATCTGCCCT
GACCGGACGTGGCGGCAAATTGTCACGCTGGAAGATGTGGTTAATCACGGCTGGAAACACACCGATATCGACGAAATTCG
TGATGAAAACACCGAAGACGAGTTCCTCAATCTCTATATGTGTGAGTTTGTCCGCGAAGGGGAATCGGCATTTAACCTGA
ATATCCTGATTGGCTGCGGTGTTGACGGATACGACGACTGGAAAGACTGGAAACCTTTTGCTCCCCGCCCGATGGGGAAT
CGTCCGGTATGGATTGGGTATGACGCAAACGGAAGCAGTGGCAACGGCGACAGCGGCGCTGTGTCCGTGGTGGTTCCTCC
GGCTGTTCCTGGTGGCCGTTTTCGAACGGTGGAGACGCGACGCGTTCAGGGGCTGGAGTTTGAAGAACAGGCCAGAGTCA
TTGAAGAGTTCACGTGTCGCTACAACGTGGAACACATCGGCATTGATGTGACGGGCGGGAACGGGGAGGCTGTTTATCAG
ATAGTGAAACGGTTTTTCCCTGCCGCTATTCCGTACACCTTCACGCTGTCATCAAAACGGTCGCTGGTACTGAAAATGCT
GCAAATAATGCGTGCCGGGCGGTGGGAATACGATCGCGCCGAACGCGAGCTGGTCGCGGCCTTTAACGCCGTGCGTAAGG
TGAAAACACCGGGCGGCTTTATCACTTACGAAACGGACCGCGCGAGGGGGATCAGCCACGGCGACCTTGCGTGGGCAACC
ATGCTTGCTGTCATTAACGAACCAATTGGCGGCGAAGGAGAAAACGAGCGTTTCACGGTTATGGAGTTCTGA

Protein sequence :
MIQDAFVRQRARQLYWQGYPPAEISRLMGINPNTIYAWKKRDQWDETPPVQRVTQSIDARLIQLTEKQNKTGGDFKEIDL
LTRQLKKLHDGQPDATATGKKGRAKKLKNHFTPEQIAALREKIISRLEWHQRGWFDSLTLCREAGIRNRMILKSRQIGAT
WYFAQEALLMALRDDVAQPYQRNQIFLSASRRQAFQFKSIIQKAAAEVDVELKGGDKIILSNGAELHFLGTSAASAQSYT
GNFYFDEFFWVSRFAELRKVAGAMATLSGLRRTYFSTPSTETHEAYAYWNGDRWNEKKASHKRQRFSVDWKTLHNGLICP
DRTWRQIVTLEDVVNHGWKHTDIDEIRDENTEDEFLNLYMCEFVREGESAFNLNILIGCGVDGYDDWKDWKPFAPRPMGN
RPVWIGYDANGSSGNGDSGAVSVVVPPAVPGGRFRTVETRRVQGLEFEEQARVIEEFTCRYNVEHIGIDVTGGNGEAVYQ
IVKRFFPAAIPYTFTLSSKRSLVLKMLQIMRAGRWEYDRAERELVAAFNAVRKVKTPGGFITYETDRARGISHGDLAWAT
MLAVINEPIGGEGENERFTVMEF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
t4322 NP_807916.1 terminase subunit Not tested SPI-7 Protein 5e-91 45
STY4627 NP_458709.1 probable terminase subunit Not tested SPI-7 Protein 2e-91 45
unnamed ABR13464.1 predicted ATPase terminase subunit Not tested PAGI-6 Protein 1e-92 45