Gene Information

Name : EcSMS35_A0124 (EcSMS35_A0124)
Accession : YP_001740006.1
Strain :
Genome accession: NC_010488
Putative virulence/resistance : Unknown
Product : Tn3 family transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 93016 - 95982 bp
Length : 2967 bp
Strand : +
Note : identified by match to protein family HMM PF01526

DNA sequence :
ATGCCACGTCGTTCAATCCTGTCCGCCGCCGAGCGCGAAAGCCTGCTGGCGTTGCCGGACACCAAGGATGAGTTGATCCG
TCACTACACGTTCAGCGAAACCGACCTCTCCATCATCCGGCAGCGGCGCGGCCCGGCCAACCGGCTGGGCTTCGCCGTGC
AGCTCTGTTACCTGCGCTTTCCTGGTGTCATCCTGGGCGTCGATGAGCCGCCGTTTCCGCCCTTGTTGAAACTGGTCGCC
GACCAGCTCAAGGTCAGCGTCGAAAGCTGGGACGAATACGGGCAGCGGGAGCAGACCCGGCGCGAGCACCTGGTCGAACT
GCAAACGGTGTTCGGCTTCCAGCCCTTTACCATGGGCCACTACCGGCAGGCCGTCCAGTTGCTGACCGAGATGGCCTTGC
AGACCGACAAGGGCATCGTGCTGGCCAGCACCTTGATCGAGCACCTGCGGCAGCAGTCGGTCATTCTGCCTGCCCTCAAC
GCCGTCGAGCGGGCGAGCGCCGAAGCAATCACCCGCGCCAACCGGCGCATCTACGATGCCTTGGCCGAACCGCTGTCGGA
CGCGCATCGCCGCCGCCTCGACGATCTGCTCAAGCGTCGGGACAACGGCAAAACGACCTGGCTGGCCTGGCTGCGCCAAT
CGCCCGTCAAACCGAATTCGCGGCACATGCTGGAACACATCGAACGCCTCAAAGCGTGGCAGGCGCTCGACCTGCCTTCT
GGCATCGAGCGGTCGGTGCACCAGAACCGCCTGCTCAAGATCGCCCGTGAGGGTGGCCAGATGACGCCCGCCGACCTGGC
CAAGTTCGAGGCGCAGCGACGCTATGCCACCCTGGTGGCGCTTGCCATCGAGGGCATGGCCACCGTCACCGACGAAATCA
TCGACCTGCACGACCGCATCCTGGGCAAGCTGTTCAACGCCGCCAAGAACAAGCATCAGCAGCAATTCCAGGCGTCCGGC
AAGGCGATCAACGCCAAGGTGCGGCTGTTCGGCCGCATCGGCCAGGCGCTGATCGAGGCCAAGCAGGCGGGCCGCGATCC
GTTCGCCGCCATCGAGGCCGTCATGTCCTGGGATGCCTTCGCCGAGAGCGTCACCGAAGCGCAGAAGCTTGCGCAGCCCG
AGGACTTCGATTTCCTGCACCGCATCGGCGAAAGCTACGCCACGCTGCGCCGCTACGCGCCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCTCCCGCCGCGAAGGACGTGCTCGACGCCATCGAGGTGCTGCGCGGCATGAACAGCGACAACGCCCG
CAAGGTGCCCGCCGACGCGCCGACCGAGTTCATCAAGCCGCGCTGGCAGAAGCTGGTCATGACCGACACCGGCATCGACC
GGCGCTACTACGAACTGTGCGCGCTGTCGGAGATGAAGAACGCGTTGCGTTCCGGCGACATCTGGGTGCAGGGGTCGCGC
CAGTTCAAGGACTTCGAGGACTACCTGGTGCCGCCCGCGAAATTCGCCAGCCTCAAGCAGGCCAGCGAATTGCCGCTGGC
CGTGGCCACCGACTGCAACCGGTACCTGAACGACCGGCTGACGCTGCTGGAAACACAGCTTGCCACCGTCAACCGTATGG
CGACGGCCAACGAGCTGCCGGACGCCATCATCACCGAGTCAGGCTTGAAGATCACGCCGCTCGACGCGGCGGTACCCGAC
ACCGCCCAAGCGCTGATCGACCAGACGGCAATGATCCTGCCGCACGTCAAGATCACCGAACTGCTGCTGGAGGTGGACGA
ATGGACGGGCTTCACTCGGCATTTCGCGCATCTGAAATCGGGCGACCCGGCCAAAGACAAGAACCTGTTGCTGACCACGA
TCCTCGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCGGAGTCTTGCCCCGGCACGACCTACGCCAAGCTGGCTTGG
CTGCAAGCCTGGCACATCCGCGACGAAACCTACGGGGCGGCGCTGGCCGATCTGGTCAACGCACAGTTCCGCCATCCCTT
CGCCGAGCACTGGGGCGACGGCACCACCTCATCGTCGGACGGCCAGAACTTCCGCACCGGCAGCAAGGCCGAGAGCACCG
GCCACATCAACCCGAAATACGGGAGCAGCCCAGGGCGGACGTTCTACACCCACATTTCTGACCAGTACGCGCCATTTCAC
ACCAAGGTCGTGAACGTCGGCGTGCGCGATTCGACCTACGTGCTCGACGGCCTGCTGTACCACGAGTCCGACTTGCGGAT
CGAGGAGCATTACACCGACACGGCGGGCTTCACCGATCACGTCTTCGCCCTGATGCACCTCCTGGGCTTCCGCTTCGCGC
CGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGCGACGCCGCCTATGACGCGCTGAAACCCATGATC
GGCGGCACGCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACCTCGATCAAGCAGGGCAC
GGTGACGGCCTCCCTGATGCTCCGAAAGCTCGGCAGCTACCCACGCCAGAACGGCCTGGCCGTGGCGCTCCGCGAGCTGG
GCCGCATCGAGCGCACGCTGTTCATCCTGGACTGGCTGCAAAGCGTGGAACTGCGCCGCCGCGTGCATGCCGGCCTGAAC
AAGGGCGAGGCGCGCAATGCGCTGGCCAGGGCAGTGTTTTTCAACCGCCTGGGTGAAATCCGCGACCGCAGTTTCGAGCA
GCAGCGCTACCGGGCTAGCGGCCTCAATCTGGTAACGGCTGCCGTCGTGTTGTGGAACACGGTCTATCTGGAACGGGCTG
CGCACGCGCTGCGTGGCAACGGCCATGCCGTTGATGACGCGCTGTTGCAGTACCTGTCGCCGCTCGGTTGGGAGCACATC
AACCTCACCGGCGATTACCTCTGGCGCAGCAGCGCCAAGATCGGCGCGGGCAAGTTCAGGCCGCTACGACCGCTGCAACC
GGCTTAG

Protein sequence :
MPRRSILSAAERESLLALPDTKDELIRHYTFSETDLSIIRQRRGPANRLGFAVQLCYLRFPGVILGVDEPPFPPLLKLVA
DQLKVSVESWDEYGQREQTRREHLVELQTVFGFQPFTMGHYRQAVQLLTEMALQTDKGIVLASTLIEHLRQQSVILPALN
AVERASAEAITRANRRIYDALAEPLSDAHRRRLDDLLKRRDNGKTTWLAWLRQSPVKPNSRHMLEHIERLKAWQALDLPS
GIERSVHQNRLLKIAREGGQMTPADLAKFEAQRRYATLVALAIEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLFGRIGQALIEAKQAGRDPFAAIEAVMSWDAFAESVTEAQKLAQPEDFDFLHRIGESYATLRRYAPEFLAVL
KLRAAPAAKDVLDAIEVLRGMNSDNARKVPADAPTEFIKPRWQKLVMTDTGIDRRYYELCALSEMKNALRSGDIWVQGSR
QFKDFEDYLVPPAKFASLKQASELPLAVATDCNRYLNDRLTLLETQLATVNRMATANELPDAIITESGLKITPLDAAVPD
TAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKSGDPAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHIRDETYGAALADLVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGVRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYIPKGDAAYDALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAHALRGNGHAVDDALLQYLSPLGWEHI
NLTGDYLWRSSAKIGAGKFRPLRPLQPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 95
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 93
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 88
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 73
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcSMS35_A0124 YP_001740006.1 Tn3 family transposase VFG1031 Protein 0.0 93