Gene Information

Name : ECUMN_4811 (ECUMN_4811)
Accession : YP_002415404.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Unknown
Product : transposase TnpA, Tn21
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 4988794 - 4991865 bp
Length : 3072 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; Product type pe : putative enzyme

DNA sequence :
ATGCCACGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACTCCAAGGACGACCTGATCCG
ACATTACACATTCAACGATACCGACCTCTCGATCATCCGACAGCGGCGCGGGCCAGCCAATCGGCTGGGCTTCGCGGTGC
AGCTCTGTTACCTGCGCTTTCCCGGCGTCATCCTGGGCGTCGATGAACTACCGTTCCCGCCCTTGTTGAAGCTGGTCGCC
GACCAGCTCAAGGTCGGCGTCGAAAGCTGGAACGAGTACGGCCAGCGGGAGCAGACCCGGCGCGAGCACCTGAGCGAGCT
GCAAACCGTGTTCGGTTTCCGGCCCTTCACCATGAGCCATTACCGGCAGGCCGTCCAGATGCTGACCGAGCTGGCGATGC
AAACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATCGGGCACCTGCGGCGGCAGTCGGTCATTCTGCCCGCCCTCAAC
GCCGTCGAGCGGGCGAGTGCCGAGGCGATCACCCGTGCTAACCGGCGCATCTACGACGCCTTGGCCGAACCACTGGCGGA
CGCGCATCGCCGCCGCCTCGACGATCTGCTCAAGCGCCGGGACAACGGCAAGACGACCTGGTTGGCTTGGTTGCGCCAGT
CTCCGGCCAAGCCAAATTCGCGGCATATGCTGGAACACATCGAACGCCTCAAGGCATGGCAGGCACTCGATCTGCCTACC
GGCATCGAGCGGCTGGTTCACCAGAACCGCCTGCTCAAGATTGCCCGCGAGGGCGGCCAGATGACACCCGCCGACCTGGC
CAAATTCGAGCCGCAACGGCGCTACGCCACTCTCGTGGCGCTGGCCACCGAGGGCATGGCCACCGTCACCGACGAAATCA
TCGACCTGCACGACCGCATCCTGGGTAAGCTGTTTAACGCTGCCAAGAATAAGCATCAGCAGCAGTTCCAGGCGTCAGGC
AAGGCCATCAACGCCAAGGTACGTCTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAGCAATCAGGCCGCGATGC
GTTTGCCGCCATCGAGGCCGTCATGTCCTGGGATTCCTTTGCCGAGAGCGTCACCGAGGCGCAGAAGCTCGCGCAACCCG
ATGACTTCGATTTCCTGCATCGCATCGGCGAGAGCTACGCCACCCTGCGCCGCTATGCACCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCGCCCGCCGCCAAAAACGTGCTTGATGCCATTGAGGTGCTGCGCGGCATGAACACCGACAACGCCCG
CAAGCTGCCAGCCGATGCACCGACCGGCTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCCGGCATCGACC
GGCGCTACTACGAACTGTGCGCGCTGTCCGAGTTGAAGAACTCCCTGCGCTCGGGCGACATCTGGGTGCAGGGTTCACGC
CAGTTCAAGGACTTCGAGGACTACCTGGTACCGCCCGAGAAGTTCACCAGCCTCAAGCAGTCCAGCGAATTGCCGCTGGC
CGTGGCCACCGACTGCGAACAATATCTGCATGAGCGGCTGACGCTGCTGGAAGCACAACTTGCCACCGTCAACCGCATGG
CGGCAGCCAACGACCTGCCGGATGCCATCATCACCGAGTCGGGCTTGAAGATCACGCCGCTGGATGCGGCGGTGCCCGAC
ACCGCGCAGGCGCTGATAGACCAGACAGCCATGGTCCTGCCGCACGTCAAGATCACCGAACTGCTGCTCGAAGTCGATGA
GTGGACGGGCTTCACCCGGCACTTCACGCACTTGAAATCGGGCGATCTGGCCAAGGACAAGAACCTGTTGTTGACCACGA
TCCTGGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCCGAGTCCTGCCCCGGCACGACCTACGCGAAGCTCGCTTGG
CTGCAAGCCTGGCATACCCGCGACGAAACGTACTCGACAGCGTTGGCTGAACTGGTCAACGCTCAGTTTCGGCATCCCTT
TGCCGGGCACTGGGGCGATGGCACCACATCATCATCGGACGGACAGAATTTCCGAACCGCTAGCAAGGCAAAGAGCACGG
GGCACATCAACCCAAAATATGGCAGCAGCCCAGGACGGACTTTCTACACCCACATCTCCGACCAATACGCGCCATTCCAC
ACCAAGGTGGTCAATGTCGGCCTGCGCGACTCAACCTACGTGCTCGACGGCCTGCTGTACCACGAATCCGACCTGCGGAT
CGAGGAGCACTACACCGACACGGCGGGCTTCACCGATCACGTCTTCGCCCTGATGCACCTCTTGGGCTTCCGCTTCGCGC
CGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATC
GGCGGCACGCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACCTCGATCAAGCAGGGCAC
GGTGACGGCCTCGCTGATGCTCAGGAAACTCGGCAGCTACCCGCGCCAGAACGGCTTGGCCGTCGCGCTGCGCGAGTTGG
GCCGCATCGAGCGCACGCTGTTCATCCTCGACTGGCTGCAAAGCGTCGAGCTACGCCGCCGCGTGCATGCCGGGCTGAAC
AAGGGCGAGGCGCGCAATGCGCTGGCCCGTGCCGTGTTCTTCAACCGCCTTGGTGAAATCCGTGACCGCAGTTTCGAGCA
GCAGCGCTACCGGGCCAGCGGCCTCAACCTGGTGACGGCGGCCATCGTGCTGTGGAACACGGTCTACCTGGAGCGTGCGG
CGCATGCGTTGCGCGGCAATGGTCATGCCGTCGATGACTCGCTATTGCAGTACCTGTCGCCACTCGGCTGGGAGCACATC
AACCTGACCGGTGATTACCTATGGCGCAGCAGCGCCAAGATCGGCGCGGGGAAGTTCAGGCCGCTACGGCCTCTGCAACC
GGCTTGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTA
CCAGGCGCATTTCGCCCAGGGGATCACCATAA

Protein sequence :
MPRRSILSAAERESLLALPDSKDDLIRHYTFNDTDLSIIRQRRGPANRLGFAVQLCYLRFPGVILGVDELPFPPLLKLVA
DQLKVGVESWNEYGQREQTRREHLSELQTVFGFRPFTMSHYRQAVQMLTELAMQTDKGIVLASALIGHLRRQSVILPALN
AVERASAEAITRANRRIYDALAEPLADAHRRRLDDLLKRRDNGKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPT
GIERLVHQNRLLKIAREGGQMTPADLAKFEPQRRYATLVALATEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLYGRIGQALIDAKQSGRDAFAAIEAVMSWDSFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLAVL
KLRAAPAAKNVLDAIEVLRGMNTDNARKLPADAPTGFIKPRWQKLVMTDAGIDRRYYELCALSELKNSLRSGDIWVQGSR
QFKDFEDYLVPPEKFTSLKQSSELPLAVATDCEQYLHERLTLLEAQLATVNRMAAANDLPDAIITESGLKITPLDAAVPD
TAQALIDQTAMVLPHVKITELLLEVDEWTGFTRHFTHLKSGDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHTRDETYSTALAELVNAQFRHPFAGHWGDGTTSSSDGQNFRTASKAKSTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGLRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYIPKGDAAYDALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERAAHALRGNGHAVDDSLLQYLSPLGWEHI
NLTGDYLWRSSAKIGAGKFRPLRPLQPAWHCCKVSDEAAFCLIQRPYISKTLLTRRISPRGSP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 100
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 99
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 89
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 73
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECUMN_4811 YP_002415404.1 transposase TnpA, Tn21 VFG1031 Protein 0.0 99