Gene Information

Name : ECH74115_2663 (ECH74115_2663)
Accession : YP_002271008.1
Strain : Escherichia coli EC4115
Genome accession: NC_011353
Putative virulence/resistance : Unknown
Product : bacteriophage replication gene A protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2505696 - 2508518 bp
Length : 2823 bp
Strand : +
Note : identified by match to protein family HMM PF05840

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAACAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCCGCTCCTGA
ATCCGCGCTTTCCTCCTGGCTGGATGCCTACCGGGTAGAGAACGAGCGCCGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CAACGCCGCTGGGCAACCTGATTAATAAAAGCCTGGACGCACAGGAAAAACAGGACAAAACCATCACACTGGCAGGAGAC
GCCAGAAAACAGGCACGCGGTGCGGTGGATGAAGCCATGGCCTCGCTGCGCCTGCTGCCGTCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCCTTCCTGCGCAAAAAACAGGAAGCCGATCGTCAGAAAGGCAAAAAGAGCTGGCAGGCTGAACGCT
ACGCGCGCGGAAACCTGCGCAAAATATTCGAACGTCTGGAGCGCACCGATCACCGCTGGCTGACACAGGGTTATCGCTCC
CTTGCCGGACGCGAACGCCTGGACGATTTGCTTTACCTGCCGCAGCTCAACAAACACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACCTTCGAAAAACTCTGCGATGGCTTTGGCGCGACCGATGGCGAACTGACCATGGATG
TAACGCTGAAGGCGTATCAGATGCTGGCCCGCATGGCGTTACACCTGCACGCCATGCCTCCACATTATGACGCACTGACA
ACAGACAAAGACCGGAGGAACGAACCGGACACGGAGCTGCTGCCGGGCGCAATCCTTCGCCTGACCTGTGCGGAATGGTG
GAAACGCAAACTGTGGCTGTTACGTTGCGAGTGGAGAGAAGAACAACTCCGCGCCGCCTGTCTGGTTTCCAGAAAAACAT
CGCCCTATCTGAGCCAGGACGCGTTAAGCGAGTTTCGCGCACAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATG
CTGGAAAACGAAGACGGGTTCACGATTGATCTCGAGACAGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGC
AGAAATGATGGCCACCATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGCGACAAAGCGGTGTTTCTGACTGTCACCT
GCCCGTCAAAATACCACGCTACAACAGAGAACGGTCATCCGAATCCCAAATGGAACGGGGCCACCATGCGCGACTCCAGC
GATTACCTGGTTAACACGTTTTTTGCGGCGGTCCGCAAGAAACTGAACCGCGACGGCCTGCGCTGGTATGGCATCCGCAC
GGTGGAGCCTCACCATGACGGCACCGTGCACTGGCATATGATGGTCTTTGCTCATCCGGAAGAAATCGACACCATTGTGT
CCCACACCCGCGATATTGCCATTCAGGAAGATCGTCACGAGCTGGGCGATGATATTACTCCGCGCTTTAAGGCGGAGTAT
GTCGACGGCTCAAAAGGCACGCCAACCAGCTACATCGCCACCTACATCGGAAAAAACCTGGACAGCCGCGCCGTGGATGG
CATCGACCCGAAAACAGGCAAACCACGCGTTGACCACGAAACCGGAAAATCAATGACCGAGAGCGTGGAACGCGCCATTG
GCTGGGCGCGCCTTCACCGGGTCCGCCAGTTCCAGTTCTTTGGCATCCCCTCCCGTCAGGTGTGGCGTGAACTGCGTCGC
CTTGCCAGCCAGATGGCACGCAACCCGGAAGGCCCGCAACGGCTGAAGGATGACGCAATGGATGCGGTTCTTGCTGCCGC
TGATGCCGGATGTTTTACCACCTACATTGAGAAACAGGGAGGCGTACTTGTTCCACGCAAGGACTACCTGATTCGCACCG
CCTACGACCTCGCAGATGAGCTGAACGATTACGGCGAACAGAGCGTACAGATTTACGGGATCTGGTCACCACTCATCGGG
GAATCCTCCCGTGTGTGCACGCACCCGGATAACTGGAAGCTGGTAAGACGTAAACCGGGAGTAGAAGACAGCGCCCGCGA
AAATGGTTTTGACCTTCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACGGACA
ACAACGGGACAGAACAGCCGGAAGAACGGCCAGCACCGTGGCCGCAGCTTCCTGACGGCGTTGACGTGAACGAATGGATG
CGCTCACTGAAACGGCACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAAT
GCAGAGCTGGACACAGAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAGT
CTGCTGAATCTCTCGGCCTGCATATCGGTGAACAACAGATGCAGCACCTGTTACGGGGCGGCAGTCTGTACGTTGACGGC
AGCATCATTGCACCGCAGGGATTTGAAATTGTACGCAAACCGGATACCCGCCCGGACAGCCGAATCACGCAGCTCTGGCA
GCGCCTGAGCCGTAATCATGGCGTAAGCAGCACGGAGATCCGCCATAACCCGGTCGCCAGCTATCTGGCACAGCTGGGGG
CATCAGACCCTGAAGCCGCCGCACGCCTGGCATCCACACTTCAGCAGGACCAGAACACCATGAAAACACCCGTTACCGTG
CTTTCTGACATGCTGCGCGCCATCCGCGACGCAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCAGCCGCAA
AGCAGACCTGCTGCGGGGTGGCCTGACCAGTGGAAACAAAAAACAGACAGAAACGGGACTCACAAATCCCGTAAATGAGC
AAAAAACGCGCCGCGATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPESALSSWLDAYRVENERRQEMADAAFSATPLGNLINKSLDAQEKQDKTITLAGD
ARKQARGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQEADRQKGKKSWQAERYARGNLRKIFERLERTDHRWLTQGYRS
LAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFEKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALT
TDKDRRNEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFM
LENEDGFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATMRDSS
DYLVNTFFAAVRKKLNRDGLRWYGIRTVEPHHDGTVHWHMMVFAHPEEIDTIVSHTRDIAIQEDRHELGDDITPRFKAEY
VDGSKGTPTSYIATYIGKNLDSRAVDGIDPKTGKPRVDHETGKSMTESVERAIGWARLHRVRQFQFFGIPSRQVWRELRR
LASQMARNPEGPQRLKDDAMDAVLAAADAGCFTTYIEKQGGVLVPRKDYLIRTAYDLADELNDYGEQSVQIYGIWSPLIG
ESSRVCTHPDNWKLVRRKPGVEDSARENGFDLQGGPAAPWTRGNNCPRVQETDNNGTEQPEERPAPWPQLPDGVDVNEWM
RSLKRHERRALMRSLRDKQAKNSSDEMQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDG
SIIAPQGFEIVRKPDTRPDSRITQLWQRLSRNHGVSSTEIRHNPVASYLAQLGASDPEAAARLASTLQQDQNTMKTPVTV
LSDMLRAIRDAEHAQRISETTERASRKADLLRGGLTSGNKKQTETGLTNPVNEQKTRRDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 4e-137 42
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 2e-137 42