Gene Information

Name : EcSMS35_2272 (EcSMS35_2272)
Accession : YP_001744320.1
Strain : Escherichia coli SMS-3-5
Genome accession: NC_010498
Putative virulence/resistance : Unknown
Product : ISL3 family transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3464
EC number : -
Position : 2292586 - 2294205 bp
Length : 1620 bp
Strand : +
Note : identified by match to protein family HMM PF01610

DNA sequence :
ATGGGCAACGCAATGCACTCTCTTAAGACACTTCTACAGTTACCTTGCGGATGGCGATGCAGTCGACAAATTATTAGCTC
TGACGGTATCACCCTCCATCTCCACGGAAAACGCAAAACAGCACAATGTCCTGAATGCTCTAAGCGTAGCGACTCTGTTC
ATAGTTCTCGTCGGCGCCGGATACAGCATCTACCCTGCTCCGGGCAGACGCTATGGCTTGTATTTTCCGTCCGCCACTGG
TACTGCCGTAACCCTGTTTGTTCACGAAAAATTTTTGCCGAGTCGCTTGCTCCCTTCGCCGGTTCACACCAGCAGTCTTC
ACAGGCGTTACAAAATTTACAACGTCAACTGGGATTAATAGCCGGAGGTGAGGCTGGAAAACGGGCTGCAACGGCAGTGG
GTCTCCGTTGCAGTGCAGATACTCTTCTTCGCAGGGTTATCAATACCCCGGGGACGAAACAGTCAGGCGCGCCTCATGTC
GGTATTGATGAGTGGGCGTGGCATCGGGGCCACCGTTACGGTAAGTTAATCGTCAATCTTGATACTCACCGTCCCCTCGT
CCTGCTTCCCGGTCGTGATCAGCGTACGCTGGCGACCTGGTTCAGAAAATATCCGGAAATACAGGTTGTCTCGCGTGATC
GCAGTGGAGTCTATGCAACAGCAGCACGTGAAGGTGCACCTCAGGCCAGACAGGTGGCCGATCGATGGCACCTGCTAAAA
AATATTGGCGATGCGCTTGAACGAATGATGTACAGACATATACCTCTGATACGTCTTGTTGCCAGTGAGTTGTCACTAAA
GAAATCACCTGAGCCAGAACTGTCTGTGCCTGCAGTATCGCTCCGTCGTCCGGAACGCCTTAAACAGCAAACCCGCAAAA
AACGGCATCAGCGTTGGACAGAGGTTATGGCCCTGCATAACAAGGGATGTAGTTTCAGGGAAATATCCCGTATTACAGGC
CTGTCGCGTGTGACAGTCAGTCGCTGGGTGCGTTCAGGAACATTCCCTGAAATGTCAACCCGACCTCCAAAGCGAGGGCT
TCTGGACCCATGGAGGGAGTGGTTAAAAGAGCAACGAGAAAGCGGTAATTATAACGCCAGCCGGATATGGCGGGAAATGG
TGGCCCGGGGGTTTACAGGCAGTGAAACCATCGTCAGGGATGCTGTTGCCAAATGGCGTAAAGGCTGGATCCCACCGGTT
ACTACTGCCGCCAGACTTCCTTCAGTGTCCCGGGTAAGCCGGTGGTTGATGCCCTGGAGAATAATCAGGGGGGAAGAAAA
TTATGCTTCCCGATTTATTAGTCTGATGTGTGAAAAAGAACCGGAGCTGAAAATAGCGCAGCAACTGGTACTCGAGTTCT
ACCGTATTCTGAAAACCCAAAATAAATCACAGCTTAGCAGCTGGTTCACTCGAGTCCACGAAAGCGGCTCAGCAGAACTT
CGGCGCGTGGCTGCGGGGATGGAAGCTGATGCTGCGGCTATATGTGAGGCAATCAGCAGTCGCTGGAGTAATGGTGTTGT
CGAAGGTCATGTAAATCGCCTGAAGATGTTGAAACGCCAGATGTATGGTCGAGCCGGATTTGAACTGCTCAGGCAGAGGG
TCATGAGTCCACTGGCATGA

Protein sequence :
MGNAMHSLKTLLQLPCGWRCSRQIISSDGITLHLHGKRKTAQCPECSKRSDSVHSSRRRRIQHLPCSGQTLWLVFSVRHW
YCRNPVCSRKIFAESLAPFAGSHQQSSQALQNLQRQLGLIAGGEAGKRAATAVGLRCSADTLLRRVINTPGTKQSGAPHV
GIDEWAWHRGHRYGKLIVNLDTHRPLVLLPGRDQRTLATWFRKYPEIQVVSRDRSGVYATAAREGAPQARQVADRWHLLK
NIGDALERMMYRHIPLIRLVASELSLKKSPEPELSVPAVSLRRPERLKQQTRKKRHQRWTEVMALHNKGCSFREISRITG
LSRVTVSRWVRSGTFPEMSTRPPKRGLLDPWREWLKEQRESGNYNASRIWREMVARGFTGSETIVRDAVAKWRKGWIPPV
TTAARLPSVSRVSRWLMPWRIIRGEENYASRFISLMCEKEPELKIAQQLVLEFYRILKTQNKSQLSSWFTRVHESGSAEL
RRVAAGMEADAAAICEAISSRWSNGVVEGHVNRLKMLKRQMYGRAGFELLRQRVMSPLA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
st55 CAC81893.1 ST55 protein Not tested LEE II Protein 0.0 96
unnamed CAI43808.1 putative transposase Not tested LEE Protein 0.0 96
ECO26_5298 YP_003232177.1 transposase Not tested LEE Protein 0.0 96
unnamed AAL57569.1 putative transposase Not tested LEE Protein 0.0 96