Gene Information

Name : EC042_4089 (EC042_4089)
Accession : YP_006098372.1
Strain : Escherichia coli 042
Genome accession: NC_017626
Putative virulence/resistance : Unknown
Product : transposase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4354499 - 4357465 bp
Length : 2967 bp
Strand : -
Note : -

DNA sequence :
ATGCCACGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACTCCAAGGACGACCTGATCCG
ACATTACACATTCAACGATACCGACCTCTCGATCATCCGACAGCGGCGCGGGCCAGCCAATCGGCTGGGCTTCGCGGTGC
AGCTCTGTTACCTGCGCTTTCCCGGCGTCATCCTGGGCGTCGATGAACTACCGTTCCCGCCCTTGTTGAAGCTGGTCGCC
GACCAGCTCAAGGTCGGCGTCGAAAGCTGGAACGAGTACGGCCAGCGGGAGCAGACCCGGCGCGAGCACCTGAGCGAGCT
GCAAACCGTGTTCGGTTTCCGGCCCTTCACCATGAGCCATTACCGGCAGGCCGTCCAGATGCTGACCGAGCTGGCGATGC
AAACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATCGGGCACCTGCGGCGGCAGTCGGTCATTCTGCCCGCCCTCAAC
GCCGTCGAGCGGGCGAGTGCCGAGGCGATCACCCGTGCTAACCGGCGCATCTACGACGCCTTGGCCGAACCACTGGCGGA
CGCGCATCGCCGCCGCCTCGACGATCTGCTCAAGCGCCGGGACAACGGCAAGACGACCTGGTTGGCTTGGTTGCGCCAGT
CTCCGGCCAAGCCAAATTCGCGGCATATGCTGGAACACATCGAACGCCTCAAGGCATGGCAGGCACTCGATCTGCCTACC
GGCATCGAGCGGCTGGTTCACCAGAACCGCCTGCTCAAGATTGCCCGCGAGGGCGGCCAGATGACACCCGCCGACCTGGC
CAAATTCGAGCCGCAACGGCGCTACGCCACTCTCGTGGCGCTGGCCACCGAGGGCATGGCCACCGTCACCGACGAAATCA
TCGACCTGCACGACCGCATCCTGGGTAAGCTGTTTAACGCTGCCAAGAATAAGCATCAGCAGCAGTTCCAGGCGTCAGGC
AAGGCCATCAACGCCAAGGTACGTCTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAGCAATCAGGCCGCGATGC
GTTTGCCGCCATCGAGGCCGTCATGTCCTGGGATTCCTTTGCCGAGAGCGTCACCGAGGCGCAGAAGCTCGCGCAACCCG
ATGACTTCGATTTCCTGCATCGCATCGGCGAGAGCTACGCCACCCTGCGCCGCTATGCACCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCGCCCGCCGCCAAAAACGTGCTTGATGCCATTGAGGTGCTGCGCGGCATGAACACCGACAACGCCCG
CAAGCTGCCAGCCGATGCACCGACCGGCTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCCGGCATCGACC
GGCGCTACTACGAACTGTGCGCGCTGTCCGAGTTGAAGAACTCCCTGCGCTCGGGCGACATCTGGGTGCAGGGTTCACGC
CAGTTCAAGGACTTCGAGGACTACCTGGTACCGCCCGAGAAGTTCACCAGCCTCAAGCAGTCCAGCGAATTGCCGCTGGC
CGTGGCCACCGACTGCGAACAATATCTGCATGAGCGGCTGACGCTGCTGGAAGCACAACTTGCCACCGTCAACCGCATGG
CGGCAGCCAACGACCTGCCGGATGCCATCATCACCGAGTCGGGCTTGAAGATCACGCCGCTGGATGCGGCGGTGCCCGAC
ACCGCGCAGGCGCTGATAGACCAGACAGCCATGGTCCTGCCGCACGTCAAGATCACCGAACTGCTGCTCGAAGTCGATGA
GTGGACGGGCTTCACCCGGCACTTCACGCACTTGAAATCGGGCGATCTGGCCAAGGACAAGAACCTGTTGTTGACCACGA
TCCTGGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCCGAGTCCTGCCCCGGCACGACCTACGCGAAGCTCGCTTGG
CTGCAAGCCTGGCATACCCGCGACGAAACGTACTCGACAGCGTTGGCTGAACTGGTCAACGCTCAGTTTCGGCATCCCTT
TGCCGGGCACTGGGGCGATGGCACCACATCATCATCGGACGGACAGAATTTCCGAACCGCTAGCAAGGCAAAGAGCACGG
GGCACATCAACCCAAAATATGGCAGCAGCCCAGGACGGACTTTCTACACCCACATCTCCGACCAATACGCGCCATTCCAC
ACCAAGGTGGTCAATGTCGGCCTGCGCGACTCAACCTACGTGCTCGACGGCCTGCTGTACCACGAATCCGACCTGCGGAT
CGAGGAGCACTACACCGACACGGCGGGCTTCACCGATCACGTCTTCGCCCTGATGCACCTCTTGGGCTTCCGCTTCGCGC
CGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATC
GGCGGCACGCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACCTCGATCAAGCAGGGCAC
GGTGACGGCCTCGCTGATGCTCAGGAAACTCGGCAGCTACCCGCGCCAGAACGGCTTGGCCGTCGCGCTGCGCGAGTTGG
GCCGCATCGAGCGCACGCTGTTCATCCTCGACTGGCTGCAAAGCGTCGAGCTACGCCGCCGCGTGCATGCCGGGCTGAAC
AAGGGCGAGGCGCGCAATGCGCTGGCCCGTGCCGTGTTCTTCAACCGCCTTGGTGAAATCCGTGACCGCAGTTTCGAGCA
GCAGCGCTACCGGGCCAGCGGCCTCAACCTGGTGACGGCGGCCATCGTGCTGTGGAACACGGTCTACCTGGAGCGTGCGG
CGCATGCGTTGCGCGGCAATGGTCATGCCGTCGATGACTCGCTATTGCAGTACCTGTCGCCACTCGGCTGGGAGCACATC
AACCTGACCGGTGATTACCTATGGCGCAGCAGCGCCAAGATCGGCGCGGGGAAGTTCAGGCCGCTACGGCCTCTGCAACC
GGCTTAG

Protein sequence :
MPRRSILSAAERESLLALPDSKDDLIRHYTFNDTDLSIIRQRRGPANRLGFAVQLCYLRFPGVILGVDELPFPPLLKLVA
DQLKVGVESWNEYGQREQTRREHLSELQTVFGFRPFTMSHYRQAVQMLTELAMQTDKGIVLASALIGHLRRQSVILPALN
AVERASAEAITRANRRIYDALAEPLADAHRRRLDDLLKRRDNGKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPT
GIERLVHQNRLLKIAREGGQMTPADLAKFEPQRRYATLVALATEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLYGRIGQALIDAKQSGRDAFAAIEAVMSWDSFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLAVL
KLRAAPAAKNVLDAIEVLRGMNTDNARKLPADAPTGFIKPRWQKLVMTDAGIDRRYYELCALSELKNSLRSGDIWVQGSR
QFKDFEDYLVPPEKFTSLKQSSELPLAVATDCEQYLHERLTLLEAQLATVNRMAAANDLPDAIITESGLKITPLDAAVPD
TAQALIDQTAMVLPHVKITELLLEVDEWTGFTRHFTHLKSGDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHTRDETYSTALAELVNAQFRHPFAGHWGDGTTSSSDGQNFRTASKAKSTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGLRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYIPKGDAAYDALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERAAHALRGNGHAVDDSLLQYLSPLGWEHI
NLTGDYLWRSSAKIGAGKFRPLRPLQPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 100
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 99
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 89
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 73
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase VFG1031 Protein 0.0 99