Gene Information

Name : EcSMS35_2272 (EcSMS35_2272)
Accession : YP_001744320.1
Strain : Escherichia coli SMS-3-5
Genome accession: NC_010498
Putative virulence/resistance : Unknown
Product : ISL3 family transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3464
EC number : -
Position : 2292586 - 2294205 bp
Length : 1620 bp
Strand : +
Note : identified by match to protein family HMM PF01610

DNA sequence :
ATGGGCAACGCAATGCACTCTCTTAAGACACTTCTACAGTTACCTTGCGGATGGCGATGCAGTCGACAAATTATTAGCTC
TGACGGTATCACCCTCCATCTCCACGGAAAACGCAAAACAGCACAATGTCCTGAATGCTCTAAGCGTAGCGACTCTGTTC
ATAGTTCTCGTCGGCGCCGGATACAGCATCTACCCTGCTCCGGGCAGACGCTATGGCTTGTATTTTCCGTCCGCCACTGG
TACTGCCGTAACCCTGTTTGTTCACGAAAAATTTTTGCCGAGTCGCTTGCTCCCTTCGCCGGTTCACACCAGCAGTCTTC
ACAGGCGTTACAAAATTTACAACGTCAACTGGGATTAATAGCCGGAGGTGAGGCTGGAAAACGGGCTGCAACGGCAGTGG
GTCTCCGTTGCAGTGCAGATACTCTTCTTCGCAGGGTTATCAATACCCCGGGGACGAAACAGTCAGGCGCGCCTCATGTC
GGTATTGATGAGTGGGCGTGGCATCGGGGCCACCGTTACGGTAAGTTAATCGTCAATCTTGATACTCACCGTCCCCTCGT
CCTGCTTCCCGGTCGTGATCAGCGTACGCTGGCGACCTGGTTCAGAAAATATCCGGAAATACAGGTTGTCTCGCGTGATC
GCAGTGGAGTCTATGCAACAGCAGCACGTGAAGGTGCACCTCAGGCCAGACAGGTGGCCGATCGATGGCACCTGCTAAAA
AATATTGGCGATGCGCTTGAACGAATGATGTACAGACATATACCTCTGATACGTCTTGTTGCCAGTGAGTTGTCACTAAA
GAAATCACCTGAGCCAGAACTGTCTGTGCCTGCAGTATCGCTCCGTCGTCCGGAACGCCTTAAACAGCAAACCCGCAAAA
AACGGCATCAGCGTTGGACAGAGGTTATGGCCCTGCATAACAAGGGATGTAGTTTCAGGGAAATATCCCGTATTACAGGC
CTGTCGCGTGTGACAGTCAGTCGCTGGGTGCGTTCAGGAACATTCCCTGAAATGTCAACCCGACCTCCAAAGCGAGGGCT
TCTGGACCCATGGAGGGAGTGGTTAAAAGAGCAACGAGAAAGCGGTAATTATAACGCCAGCCGGATATGGCGGGAAATGG
TGGCCCGGGGGTTTACAGGCAGTGAAACCATCGTCAGGGATGCTGTTGCCAAATGGCGTAAAGGCTGGATCCCACCGGTT
ACTACTGCCGCCAGACTTCCTTCAGTGTCCCGGGTAAGCCGGTGGTTGATGCCCTGGAGAATAATCAGGGGGGAAGAAAA
TTATGCTTCCCGATTTATTAGTCTGATGTGTGAAAAAGAACCGGAGCTGAAAATAGCGCAGCAACTGGTACTCGAGTTCT
ACCGTATTCTGAAAACCCAAAATAAATCACAGCTTAGCAGCTGGTTCACTCGAGTCCACGAAAGCGGCTCAGCAGAACTT
CGGCGCGTGGCTGCGGGGATGGAAGCTGATGCTGCGGCTATATGTGAGGCAATCAGCAGTCGCTGGAGTAATGGTGTTGT
CGAAGGTCATGTAAATCGCCTGAAGATGTTGAAACGCCAGATGTATGGTCGAGCCGGATTTGAACTGCTCAGGCAGAGGG
TCATGAGTCCACTGGCATGA

Protein sequence :
MGNAMHSLKTLLQLPCGWRCSRQIISSDGITLHLHGKRKTAQCPECSKRSDSVHSSRRRRIQHLPCSGQTLWLVFSVRHW
YCRNPVCSRKIFAESLAPFAGSHQQSSQALQNLQRQLGLIAGGEAGKRAATAVGLRCSADTLLRRVINTPGTKQSGAPHV
GIDEWAWHRGHRYGKLIVNLDTHRPLVLLPGRDQRTLATWFRKYPEIQVVSRDRSGVYATAAREGAPQARQVADRWHLLK
NIGDALERMMYRHIPLIRLVASELSLKKSPEPELSVPAVSLRRPERLKQQTRKKRHQRWTEVMALHNKGCSFREISRITG
LSRVTVSRWVRSGTFPEMSTRPPKRGLLDPWREWLKEQRESGNYNASRIWREMVARGFTGSETIVRDAVAKWRKGWIPPV
TTAARLPSVSRVSRWLMPWRIIRGEENYASRFISLMCEKEPELKIAQQLVLEFYRILKTQNKSQLSSWFTRVHESGSAEL
RRVAAGMEADAAAICEAISSRWSNGVVEGHVNRLKMLKRQMYGRAGFELLRQRVMSPLA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ECO26_5298 YP_003232177.1 transposase Not tested LEE Protein 0.0 96
unnamed AAL57569.1 putative transposase Not tested LEE Protein 0.0 96
st55 CAC81893.1 ST55 protein Not tested LEE II Protein 0.0 96
unnamed CAI43808.1 putative transposase Not tested LEE Protein 0.0 96