Gene Information

Name : EcSMS35_1186 (EcSMS35_1186)
Accession : YP_001743247.1
Strain : Escherichia coli SMS-3-5
Genome accession: NC_010498
Putative virulence/resistance : Unknown
Product : putative phage terminase, large subunit
Function : -
COG functional category : R : General function prediction only
COG ID : COG4626
EC number : -
Position : 1192092 - 1193849 bp
Length : 1758 bp
Strand : +
Note : identified by match to protein family HMM PF03354

DNA sequence :
ATGGCAAAAGTGGCTGACGGGATCCGCTACGCCGAACGTGTTGTTGCAGGAGAAATTGTTGCTGGCGAATTTGTCCGCCT
GGCCTGCCAGCGTTTTCTTGATGATCTGAAGTACGGCGAAGAGCGGGGGATTTATTTCAGTGAACCCCGTGCGCAGCACA
TCCTGAATTTCTACAAATTTGTGCCTCATGTAAAAGGGGCGCTGGCAGGCCAGCCCATTGAGTTGATGGACTGGCATGTA
TTTATCCTCATTAATATTTTTGGTTTTGTCATTCCGCTGGTCAATGAAGAGACCGGGGAAGTTGTCATGCGCAGCGATGG
CAGCGGACGTCCGGTGATGGTGCGCCGGTTCCGGACGGCGTACAACGAAGTCGCCCGTAAAAACGCAAAATCAACTCTGT
CATCGGGTATCGGCCTGTATATGACGGGGGCAGATGGTGAAGGCGGAGCTGAGGTGTATTCAGCCGCAACCACGCGTGAC
CAGGCCAGAATCGTGTTTGAAGACGCCAAAAATATGGTCAGAAAAGCCCGGTCGACACTCGGGCGGTTGTTTGATTTCAA
CAAGCTGGCGATTTACCAGGAGCAGAGCGCATCAAAATTTGAACCGCTTTCTTCGGATGCAAACAACCTGGATGGTCTGA
ACATCCACTGCGCCATTATTGATGAGCTGCATGCACATAAAACTCGTGACGTGTGGGACGTTCTGGAAACGGCAACCGGT
GCCCGTCTGCAGTCCCTTTTATTTGGTATCACCACGGCAGGGTTTAACAAGGAAGGGATTTGTTACGAGCAGCGTGATTA
CGCCATCAAGGTATTGCGTGGCTATAACAGCGACGTGGAGGGCGCGGTAAAAGACGACTCCTACTTTGCGATTATTTACA
CCCTCGATGAGGGAGATGATCCGTTTGATGAAACGGTCTGGCAGAAAGCGAATCCCGGCCTGGGCATCTGTAAACGCTGG
GATGATCTGCGTCGCCTGGCGAAAAAAGCGAAAGAACAGGTCTCTGCGCGGGTGAATTTTTTTACCAAACACATGAATGT
GTGGGTAACAGCAGAGTCTGCCTGGATGGACATGATTAAGTGGGATAAGTGCGAATACATTGCCCCACGACATGAGCTGA
AAACGTATCCCATGTGGGTCGGCGTTGACCTTGCTCATAAGATTGATATCTGTGCGGCGGCAAAACTCTGGCGAACGGAT
AACGGGCATGTTCATGCCGATTTTAAATTCTGGCTTCCGGAAGGACGGCTGGAACGATGCTCGCGGCAGCAGGCAGAACT
TTACCGGAAGTGGGCGGAGATGGATAAGCTGATTCTGACGGATGGTGATGTTATCGATCATGCTCAGATAAAAAGTGACT
TACTGGAATGGATTGGTGGTGAAAACCTCAGGGAACTGGGATTTGACCCGTGGAGCGCGATGCAGTTCAGCCTGGCACTG
GCTGAAGAAGGGATACCGCTGGTGGAGGTTCCGCAGACGGTTCGCAATCTGTCAGAGGCCATGAAGGAAACGGAATCACT
GGTCTATGCCGGGCGTTTCCATCACAGCAATCATCCGGTCATGAACTGGATGATGTCTAACGTTACGGTAAAACCGGACA
AAAACGACAATATCTTCCCGAATAAATCCACGCTGGAAGCCAAAATCGACGGCCCTGTTGCGATGTTTACAGCAATGAGC
CGGATGCTGGTCAATGGTGGTGAACCGGAGCTGGATCTGTCTGAACATCTGGTCAGCGTGGGCATCCGCTCGCTTTAA

Protein sequence :
MAKVADGIRYAERVVAGEIVAGEFVRLACQRFLDDLKYGEERGIYFSEPRAQHILNFYKFVPHVKGALAGQPIELMDWHV
FILINIFGFVIPLVNEETGEVVMRSDGSGRPVMVRRFRTAYNEVARKNAKSTLSSGIGLYMTGADGEGGAEVYSAATTRD
QARIVFEDAKNMVRKARSTLGRLFDFNKLAIYQEQSASKFEPLSSDANNLDGLNIHCAIIDELHAHKTRDVWDVLETATG
ARLQSLLFGITTAGFNKEGICYEQRDYAIKVLRGYNSDVEGAVKDDSYFAIIYTLDEGDDPFDETVWQKANPGLGICKRW
DDLRRLAKKAKEQVSARVNFFTKHMNVWVTAESAWMDMIKWDKCEYIAPRHELKTYPMWVGVDLAHKIDICAAAKLWRTD
NGHVHADFKFWLPEGRLERCSRQQAELYRKWAEMDKLILTDGDVIDHAQIKSDLLEWIGGENLRELGFDPWSAMQFSLAL
AEEGIPLVEVPQTVRNLSEAMKETESLVYAGRFHHSNHPVMNWMMSNVTVKPDKNDNIFPNKSTLEAKIDGPVAMFTAMS
RMLVNGGEPELDLSEHLVSVGIRSL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ESA_01025 YP_001437130.1 hypothetical protein Not tested Not named Protein 0.0 89