Gene Information

Name : ECO103_1870 (ECO103_1870)
Accession : YP_003221809.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Unknown
Product : replication protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1960149 - 1962971 bp
Length : 2823 bp
Strand : +
Note : Prophage ECO103_P07

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAACAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCCGCTCCTAA
ATCCGCGCTTTCCTCCTGGCTGGATGCCTACCGGGTAGAGAACGAGCGCCGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CCACGCCGCTGGGCAACCTGATTAATAAAAACCTGGACGCACAGGAAAAACAGGACAAAACCATCACACTGGCAGGAGAC
GCCAGAAAACAGGCACGCGGCGCGGTGGATGAAGCCATGGCCTCGCTGCGCCTGCTGCCGTCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCCTTCCTGCGCAAAAAACAGGAAGCCGATCGCCGGAAAGGCAAAAAGAGCTGGCAGGCGGAACGCT
ATGCACGCGGAACCCTGCGCAAAATATTCGAACGTCTGGACCGCACCGACCACCGCTGGCTGACACCGGGTTATCGCTCC
CTTGCCGGACGCGAACGCCTGGATGATTTGCTTTACCTGCCGCAGCTCAACAAACACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACTTTCGAAAAACTCTGCGATGGCTTTGGCGCGACCGATGGCGAGCTGACCATGGATG
TAACGCTGAAGGCGTATCAGATGCTGGCCCGCATGGCGTTACACCTGCACGCCATGCCTCCACATTATGACGCACTGACA
ACAGACAAAGACCGGAGGAACGAACCGGACACGGAGCTGCTGCCGGGTGCAATCCTTCGCCTGACCTGTGCGGAATGGTG
GAAACGCAAACTGTGGCTGTTACGTTGCGAGTGGCGGGAAGAACAACTCCGCGCCGCCTGTCTGGTTTCCAGAAAAACAT
CGCCCTATCTGAGTCAGGACGCATTAAGCGAGTTTCGCGCGCAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATG
CTGGAAAACGAAGACGGGTTCACGATTGATCTCGAGACAGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGC
AGAAATGATGGCCACCATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGCGACAGAGCGGTGTTTCTGACTGTCACCT
GCCCGTCAAAATACCACGCCACAACAGAGAACGGTAATCCGAATCCCAAATGGAACGGGGCCACCATGCGCGACTCCAGC
GATTACCTGGTTAACACGTTTTTTGCGGCAGTCCGCAAAAAACTGAACCGCGACGGTCTGCGCTGGTATGGCATCCGCAC
GGTGGAGCCTCACCATGACGGCACCGTGCACTGGCATATGATGGTCTTTGCTCATCCGGAAGAAATCGACACCATTGTGT
CCCACACCCGCGATATTGCCATTCAGGAAGATCGTCACGAGCTGGGTGATGATATTACCCCACGCTTTAAGGCAGAGTAC
GTCGACGGTTCGAAAGGTACGCCGACCAGCTACATCGCCACCTACATCGGAAAGAACCTGGACAGCCGCGCCGTGGGTGG
CATTGACCCGAAAACAAGCAAGCCACGCGTTGATCACGAAACCGGAAAATCAATGGCCGAGAGCGTGGAACGCGCCATCG
GCTGGGCGCGCCTTCACCGCGTCCGCCAGTTCCAGTTCTTTGGTATCCCCTCCCGTCAGGTATGGCGTGAACTCCGCCGC
CTTGCCAGTCAGATGGCCCGCAACCCGGAAGGTCCACAACGTCTGGAAAATGACGCAATGGATGCGGTACTCGCTGCCGC
TGATGCCGGGTGTTTTGCCACCTACATTGAGAAACAGGGTGGCGTACTTGTTCCACGCAAGGATTACCTGATTCGCACCG
CCTACGACCTCGCAGAAGAGCTGAACGATTACGGCGAGCAAAGCGTACAGATTTACGGGATCTGGTCGCCACAAATCGGG
GAATCTTCCCGCGTGTGCACGCACCCGGATAACTGGAAGCTGGTAAGACGTAAACCGGAAGCGGAAGACAGCGCCCGCGA
AAATGGTTTTGACCTTCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACGAACA
ACAACGGGACAGAACAGCCGGAAGAACGGCCAGCACCGTGGCCGCAGCTCCCTGATGGCGTTGAAGTGAACGAATGGATG
CGCTCACTGAAACGGCACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAAT
GCAGAGCTGGACACAGAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAAT
CTGCCGAATCTCTCGGCCTGCATATCGGTGAACAGCAGATGCAGCACCTGTTACGGGGCGGCAGCCTGTACGTTGACGGC
AGCATCATTGCACCGCAGGGATTTGAAATTGTACGCAAACCAGATACCCGCCCGGACAGCCGAATCACGCAGCTCTGGCA
GCGCCTGAGCCGTAATCACGGCGTAAGCAGCACAGAGATCCGCCATAACCCGGTCGCCAGCTATCTGGAACAACTGGGGG
CATCAGACCCCGAAGCCGCCGCACGTCTGGCATCCACACTTCAGCAGGACCAGAACACCATGAAAACCCCCGTTACCGTG
CTTTCTGACATGCTGCGCGCCATCCGTGACGCAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCCACCGCAA
AGCAGACCTGCTGCGGGGTAGCCTGACCAGTGGAAACAAAAAACAGACAGAAACGGGACTCACAAATCCCGTAAATGAGC
AAAAAACGCGCCGCGATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPKSALSSWLDAYRVENERRQEMADAAFSATPLGNLINKNLDAQEKQDKTITLAGD
ARKQARGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQEADRRKGKKSWQAERYARGTLRKIFERLDRTDHRWLTPGYRS
LAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFEKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALT
TDKDRRNEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFM
LENEDGFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDRAVFLTVTCPSKYHATTENGNPNPKWNGATMRDSS
DYLVNTFFAAVRKKLNRDGLRWYGIRTVEPHHDGTVHWHMMVFAHPEEIDTIVSHTRDIAIQEDRHELGDDITPRFKAEY
VDGSKGTPTSYIATYIGKNLDSRAVGGIDPKTSKPRVDHETGKSMAESVERAIGWARLHRVRQFQFFGIPSRQVWRELRR
LASQMARNPEGPQRLENDAMDAVLAAADAGCFATYIEKQGGVLVPRKDYLIRTAYDLAEELNDYGEQSVQIYGIWSPQIG
ESSRVCTHPDNWKLVRRKPEAEDSARENGFDLQGGPAAPWTRGNNCPRVQETNNNGTEQPEERPAPWPQLPDGVEVNEWM
RSLKRHERRALMRSLRDKQAKNSSDEMQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDG
SIIAPQGFEIVRKPDTRPDSRITQLWQRLSRNHGVSSTEIRHNPVASYLEQLGASDPEAAARLASTLQQDQNTMKTPVTV
LSDMLRAIRDAEHAQRISETTERAHRKADLLRGSLTSGNKKQTETGLTNPVNEQKTRRDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 3e-134 43
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 8e-134 42