Gene Information

Name : ETEC_1982 (ETEC_1982)
Accession : YP_006115550.1
Strain : Escherichia coli ETEC H10407
Genome accession: NC_017633
Putative virulence/resistance : Unknown
Product : putative bacteriophage replication gene A
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2137038 - 2139845 bp
Length : 2808 bp
Strand : +
Note : -

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAGCAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCCGCTCCTGA
ATCCGCGCTTTCCTCCTGGCTGGATGCCTACCGGGCAGAGAACGAGCGCTGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CCACGCCGCTGGGCAACCTGATTAATAAAAGCCTGGACGCACAGGAAAAACAGGACAAAACCATCACACTGGCAGGAGAC
GCCAGAAAGCAGGTACGCGGCGCGGTGGATGAAGCCATGGCCTCGCTGCGCCTGCTGCCATCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCCTTCCTGCGCAAAAAACAGGAATCTGACCGCCAGAAAGGTAAAAAGAACCAGCAGGCTGAACGCT
ACGCGCGCGGGACCCTGCGCAAAATATTCGAACGTCTGGATCGCACTGACGGGCGCTGGCTGACACCGGGTTATCGCTCC
CTTGCCGGGCGCGAACGCCTGGACGATTTGCTTTACCTGCCGCAGCTCAACAAGCACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACCTTCGAAACACTCTGCGATGGCTTTGGTGCCAGAGATGGCGAGCTGACCATGGATG
TAATGTTGAAGGCTTACCGGATGCTGGCCCGTATCGCATTACGCCTGCACATCATGCCGCCACATTACGAAGCCCTGAAC
AAGAGCGATCCGGATACGGAACTGTTACCGGGCGCAATCCTTCGCCTGACCTGTGCGGAATGGTGGAAACGCAAATTGTG
GCTGTTACGTTGCGAGTGGAGAGAAGAACAACTCCGCGCCGCCTGTCTGGTTTCCAGAAAAACATCACCCTATCTGAGCC
AGGACGCGTTAAGCGAGTTTCGCGCACAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATGCTGGAAAACGAAGAC
GGGTTCACGATTGATCTCGAGACAGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGCAGAAATGATGGCCAC
CATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGCGACAAAGCGGTGTTTCTGACTGTCACCTGCCCGTCAAAATACC
ACGCCACAACAGAGAACGGTCATCCGAATCCCAAATGGAACGGGGCCACCATGCGCGACTCCAGCGATTACCTGGTTAAC
ACGTTTTTTGCGGCGGTCCGCAAGAAACTGAACCGCGACGGCCTGCGCTGGTATGGCATCCGCACGGTGGAGCCTCACCA
TGACGGCACCGTGCACTGGCATATGATGGTCTTTGCACATCCGGACGAGATTGAAACCATCGTGTCCCACGTCTGCGATA
TTGCCATTCAGGAAGACCGCCACGAGCTGGGCGATGACATAACTCCGCGTTTTAAGGCGGAGTACGTAGACGGCTCAAAA
GGCACACCAACCAGCTACATCGCCACCTACATCGGAAAGAACCTGGACAGCCGCGCCGTGGATGGCATCGACCAGAAAAC
GGGCAAGCCACGCGTTGACCACGAAACCGGAAAATCAATGGCCGAGAGCGTGGAACGCGCCATCGGCTGGGCGCGCCTTC
ACCGGGTCCGCCAGTTCCAGTTCTTTGGCATCCCCTCCCGTCAGGTGTGGCGTGAACTCCGCCGCCTTGCCAGCCAGATG
GCACGCAACCCGGAAGGCCCGCAACGGCTGAAGGATGACGCAATGGATGCGGTACTCGCTGCCGCTGATGCAGGATGTTT
TGCCACCTACATAGAGAAACAGGGCGGCGTACTTGTTCCACGCAAAGACTACCTGATTCGCACCGCCTACGACCTCGCAG
ATGAGCTGAACGATTACGGCGAACAGAGCGTACAGATTTACGGGATCTGGTCACCACTCATCGGGGAGTCTTCCCGTGTG
TGCACGCATCCGGATAACTGGAAGCTGGTAAGACGCAAACCGGAAGCGGAAGACAGCGCCCGCGAAAATGGTTTTGACCT
TCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACAGGCAACAGCGGGACAGAAC
AGTCGAAAGAACGGCCAGCACCGTGGCCGCAGCTTCCTGACGGCGTTGAAGTGAACGAATGGATGCGCTCACTGAAACGG
CACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAATGCAGAGCTGGACACA
GAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAGTCTGCCGAATCTCTCG
GCCTGCATATCGGTGAACAGCAGATGCAGCACCTGCTACGGGGCGGCAGCCTGTACGTTGACGGCAGCATCATTGCACCG
CAGGGATATGAAATTGTACGCAAACCGGATACCCGCCCGGACAGCCGAATCACGCAGCTCTGGCAGCACCTGAGCCGTAA
TCACGGCGTAAGCAGCACGGAGATCCGCCATAACCCGGTCGCCAGCTATCTGGCACAGCTGGGGGCATCAGACCCCGAAG
CCGCCGCACGCCTGGCATCCGCACTTCAGCAGGATCAGAACACCATGAAAACACCCGTTACCGTGCTTTCTGACATGCTG
CGCGCCATCCGCGACGCAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCAGCCGCAAAGCAGACCTGCTGCA
GGGTGGCCTGACCAGTGGAAACAAAAAACAGACAGAAACGGGATCCACAAATCCCGTAAATGAGCAAAAAACGCGCCGCG
ATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPESALSSWLDAYRAENERCQEMADAAFSATPLGNLINKSLDAQEKQDKTITLAGD
ARKQVRGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQESDRQKGKKNQQAERYARGTLRKIFERLDRTDGRWLTPGYRS
LAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFETLCDGFGARDGELTMDVMLKAYRMLARIALRLHIMPPHYEALN
KSDPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFMLENED
GFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATMRDSSDYLVN
TFFAAVRKKLNRDGLRWYGIRTVEPHHDGTVHWHMMVFAHPDEIETIVSHVCDIAIQEDRHELGDDITPRFKAEYVDGSK
GTPTSYIATYIGKNLDSRAVDGIDQKTGKPRVDHETGKSMAESVERAIGWARLHRVRQFQFFGIPSRQVWRELRRLASQM
ARNPEGPQRLKDDAMDAVLAAADAGCFATYIEKQGGVLVPRKDYLIRTAYDLADELNDYGEQSVQIYGIWSPLIGESSRV
CTHPDNWKLVRRKPEAEDSARENGFDLQGGPAAPWTRGNNCPRVQETGNSGTEQSKERPAPWPQLPDGVEVNEWMRSLKR
HERRALMRSLRDKQAKNSSDEMQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDGSIIAP
QGYEIVRKPDTRPDSRITQLWQHLSRNHGVSSTEIRHNPVASYLAQLGASDPEAAARLASALQQDQNTMKTPVTVLSDML
RAIRDAEHAQRISETTERASRKADLLQGGLTSGNKKQTETGSTNPVNEQKTRRDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 5e-133 42
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 2e-133 42