Gene Information

Name : EC55989_2098 (EC55989_2098)
Accession : YP_002403152.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Unknown
Product : phage replication protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2147285 - 2150107 bp
Length : 2823 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type h : extrachromosomal origin

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAACAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCCGCTCCTGA
ATCCGCGCTTTCCTCCTGGCTGGATGCCTACCGGGCAGAGAACGAGCGCCGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CCACGCCGCTGGGCAACCTGATTAATAAAAGCCTGGACGCACAGGAAAAACAGGACAAAACAATCACACTGGCAGGAGAC
GCCAGAAAACAGGCACGCGGCGCGGTAGATGAGGCCATGGCCTCACTGCGCCTGCTGCCGTCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCCTTCCTGCGCAAAAAACAGGAAGCCGATCGTCAGAAAGGAAAAAACGCCAGGCAGGCAGAACGCT
ATGCGCGTGGGACCCTGCGCAAAATATTCGAACGTCTGGAGCGCACCGATCACCGCTGGCTGACATCGGGTTATCGCTCC
CTTGCCGGACGTGAACGCCTGGACGATTTGCTTTACCTGCCGCAACTCAACAAACACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACCTTCGAAAAACTCTGCGATGGTTTTGGCGCGACTGATGGCGAGCTGACCATGGATG
TAACGCTGAAGGCGTATCAGATGCTGGCCCGCATGGCGTTACACTTACACGCCATGCCTCCGCATTATGACGCACTGACA
ACAGACAAAGACCGGAGGCACGAACCGGACACAGAACTGCTGCCGGGCGCAATCCTTCGCCTGACCTGTGCGGAATGGTG
GAAACGCAAACTGTGGCTGTTACGTTGCGAGTGGAGAGAAGAACAACTCCGCGCCGCCTGTCTGGTTTCCAGAAAAACAT
CGCCCTATCTGAGCCAGGACGCGTTAAGCGAGTTTCGCGCACAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATG
CTGGAAAATGAAGACGGGTTCACGATTGATCTCGAGACGGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGC
AGAAATGATGGCCACCATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGCGACAAAGCGGTGTTTCTGACTGTCACCT
GCCCGTCAAAATACCACGCCACAACGGAGAACGGTCATCCGAATCCCAAATGGAACGGGGCCACAATGCGCGACTCCAGC
GATTACCTGGTTAACACGTTTTTTGCGGCGGTCCGCAAAAAACTGAACCGCGACGACCTGCGCTGGTATGGCATCCGCAC
GGTGGAGCCTCATCATGACGGCACCGTGCACTGGCATATGATGGTCTTTGCACATCCGGAAGAAATCGACACCATTGTGT
CCCATACCCGCGATATTGCCATTCAGGAAGACCGCCACGAGCTGGGCAATGATATTACTCCGCGCTTTAAGGTGGAGTAT
GTCGACGGCTCAAAAGGCACGCCAACCAGCTACATCGCCACCTACATCGGAAAGAACCTGGACAGCCGCGCCGTGGATGG
CATCGACCCGAAAACGGACAAGCCACGCGTGGACCACGAAACCGGAAAATCAATGGCCGAGAGCGTGGAACGCGCCATCG
GCTGGGCGCGTCTTCACCGCGTCCGCCAGTTCCAGTTCTTTGGTATCCCCTCCCGTCAGGTATGGCGTGAACTCCGCCGC
CTTGCCAGTCAGATGGCCCGCAACCCGGAAGGTCCACAACGTCTGGAAAATGACGCAATGGATGCGGTACTCGCTGCCGC
TGATGCCGGGTGTTTTGCCACCTACATTGAGAAACAGGGTGGCGTACTTGTTCCACGCAAAGACTACCTGATTCGCACCG
CCTACGACCTCGCAGAAGAGCTGAACGATTACGGCGAGCAAAGCGTACAGATTTACGGGATCTGGTCGCCACAAATCGGG
GAATCTTCCCGCGTGTGCACACACCCGGATAACTGGAAGCTGGTAAGACGTAAACCGGAAGCGGAAGACAGCGCCCGCGA
AAATGGTTTTGACCTTCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACGGACA
ACAACGGGACAGAACAGCCGGAAGAACGGCCAGCACCGTGGCCGCAGCTCCCTGACGGCGTTGAAGTGAACGAATGGATG
CGCTCACTGAAACGGCACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAGC
GCAGAGCTGGACACAGAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAGT
CTGCTGAATCTCTCGGCCTGCATATCGGTGAACAACAGATGCAGCACCTGTTACGGGGCGGCAGTCTGTACGTTGACGGC
AGCATCATTGCACCGCAGGGATTTGAAATTGTACGCAAACCGGATACCCGCCTGGACAGCCGAATCACGCAGCTCTGGCA
GCGCCTGAGCCGTAATCATGGCGTAAGCAGCACGGAGATCCGCCATAACCCGGTCGCCAGCTATCTGGCACAGCTGGGGG
CATCAGACCCTGAAGCCGCCGCACGCCTGGCATCCACACTTCAGCAGGACCAGAACACCATGAAAACACCCGTTACCGTG
CTTTCTGACATGCTGCGCGCCATCCGCGACGCAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCAGCCGCAA
AGCAGACCTGCTGCGGGGTGGCCTAACCAGTGGAAACAAAAAACAGACAGAAACGGGACTCACAAATCCCGTAAATGAGC
AAAAAACGCGCCGCGATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPESALSSWLDAYRAENERRQEMADAAFSATPLGNLINKSLDAQEKQDKTITLAGD
ARKQARGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQEADRQKGKNARQAERYARGTLRKIFERLERTDHRWLTSGYRS
LAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFEKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALT
TDKDRRHEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFM
LENEDGFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATMRDSS
DYLVNTFFAAVRKKLNRDDLRWYGIRTVEPHHDGTVHWHMMVFAHPEEIDTIVSHTRDIAIQEDRHELGNDITPRFKVEY
VDGSKGTPTSYIATYIGKNLDSRAVDGIDPKTDKPRVDHETGKSMAESVERAIGWARLHRVRQFQFFGIPSRQVWRELRR
LASQMARNPEGPQRLENDAMDAVLAAADAGCFATYIEKQGGVLVPRKDYLIRTAYDLAEELNDYGEQSVQIYGIWSPQIG
ESSRVCTHPDNWKLVRRKPEAEDSARENGFDLQGGPAAPWTRGNNCPRVQETDNNGTEQPEERPAPWPQLPDGVEVNEWM
RSLKRHERRALMRSLRDKQAKNSSDEAQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDG
SIIAPQGFEIVRKPDTRLDSRITQLWQRLSRNHGVSSTEIRHNPVASYLAQLGASDPEAAARLASTLQQDQNTMKTPVTV
LSDMLRAIRDAEHAQRISETTERASRKADLLRGGLTSGNKKQTETGLTNPVNEQKTRRDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 3e-133 43
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 5e-133 42