Gene Information

Name : Arth_4458 (Arth_4458)
Accession : YP_829209.1
Strain :
Genome accession: NC_008537
Putative virulence/resistance : Unknown
Product : transposase Tn3 family protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 78949 - 81915 bp
Length : 2967 bp
Strand : -
Note : PFAM: transposase Tn3 family protein; KEGG: mfa:Mfla_1495 transposase Tn3

DNA sequence :
TTGGCTCTGCGTTCTTTGCTCACGGCTGCCGAGCGCGGCCAAATCCTGGCAATGCCCGCTGAAACAGAGGACCTTGCAGC
CCACTACACGCTCAGTGATGCGGATATGTCGTTGATCCGGCAACGCCGCGGGGACGCGAACAGGCTGGGCTTTGCGGTCC
AGCTGTGTCTGCTGCGCCACCCGGGCATCGGGCTGGCCGACGACACCGACGTGCCGCCGGAACTCATTGCCTGGCTGGCC
TCCAGCCTGGGCGTTTCCATTGATGCCTGGGACGAGTATGGAACACGTGAGGAAACGCGGCAGGAGCATGGACGGGAGAT
CCGCGGGTATCTGGGCATGTCAGCGTTCGGCATCGCGGACTACCGCTGGCTCGTGGAGCATGTGGGTGTTGTGGCTGCGC
ACACGGACAAGGGCCTGGTCCTGGTCGAAAGCGCGAGGGATTTCCTGCAGGCAAGGAAGGTCGCGTTGCCCGGGATCGGG
GTCATTGAAAAGGCCTGTGCGCAGGCGCTGACCAGGGCCAATAGGCGGATTTACCTCACTTTGGGTGAGCAGCTGACCGT
GGGTCACCGGCAACGCCTCGACGGGCTGCTGCGCCGCCGCCGCGATAGTTCTCTGACGGAAATCGGTTGGCTGCGGCAGG
CGCCGCTGCGACCCAACGCGCGGGCGATGAATGAGCACATCGACCGGCTCACCACCTGGCGTGCGCTGGAGCTGCCCTGG
GCTGCGGGACGGCTGGTTCATCGGAACAGGTTGCTGAAGCTTGCCCGGGAGGGTGCCTCGATGACCGCGGCGGATCTGGC
CAGATTTGAACCGGCACGCCGGTACGCGACCCTCTTCGCCATGGCCACCGAAAGCATGGCCACCGTCACCGACGAAATCA
TCGATCTGCACGACCGGATCATCGGCCGGCTCATCCGGACCGCACAGAACAAGCAAAACCAGGCCACCCTGGCATCCCGC
TCCACCGTCGCCGCCATGATGCGCATCCATTCCAGGCTCGGTGATGCCCTCTTTGAGGCCAAGGAAAACGGCGAAGATCC
CTTCGCCGCCATCGAAACGGCCATCGGCTGGGAATCCCTGGCCGAAAGCATCGCCCACGCCAAGGAACTGACCCGCCCGG
CCCTCGAGGACCCCCTCGCCCTCGTCAGCGCCCACTTCACCACCCTGCGCCGCTACACCCCGGCATTTCTTGCCGTCCTT
GACCTCAACGCGGCCCCGGCGGCACAGGACCTGCTGGCAGCAATCAACCTCGTCCGCACCCTGAACACCGCCGGAGCCCG
AAAAATCCCCGACGATGCGCCCACCTCGTTCGTCCGGCCCCGGTGGAAGCCGCTGGTCTTCACGGAAAACGGCATAGACC
GGGGCTTCTACGAGTTCTGTGCCCTCGCGGAACTAAAGAATGCGCTTCGTTCCGGAGACTTGTGGGTCACCGGATCCCGT
CAATTCCGTGACTTCGATGACTACCTTCTTGCCGGTCCTGACTACACGGTTATGAAAACCACCGGGAAGCTGCCTCTGGT
CACGACCGACGGCGGCGAAAGCTATCTCCAGAACCGGCTGGCCCTGCTCAACGAACGGCTGCACCATGTCAACGACCTCG
CCTCCCGCGATGAACTGCCCGGGGTGATGGTCACGGACAAGGGCGTGAAAATCACCCCGCTGGAGACAATCGTGCCAAAA
CACGCGCAGCCACTGATCGATCAGGCAAGCGCAATGTTTCCGCGGATCCGGATCACCGATCTGTTGATGGAAGTTGATGG
CTGGACCGGGTTCACCCGCCATTTCACCAGCCTGAAATCCGGCCAGCCCTCCAAAGACAAGCAACTTCTTCTCACCGCCA
TCCTCGCGGACGGAATCAACCTGGGCCTGACGAAGATGGCCGAGTCATGCACCGGCGTCAGCTACGCCCAGCTGGACCGC
CACCAGGCCTCCTACATCCGGGACGAAACCTACAGCGCCGCTCTGGCGGAACTGGTGAACACCCAGCACGGACACCCCTT
CGCCGCACAGTGGGGCGACGGGACCACCTCCTCATCGGACGGGCAGCGGTTCCGTGCCAGCAGCAAAGCCGAATCCACCG
GGCATGTGAACCCCAAGTACGGTGCCGAGCCCGGCCGGCTGATCTACACACACATCTCGGACCAGTACTCGCCCTTCCAC
AGCAAGCTCGTCAACGTCGGCGACCGCGACGCGACCTACGTCCTGGACGGGCTGCTCTACCACGAGTCCGACCTGGCGAT
CCAGGAGCACTACACGGATACGGCCGGATTCACCGATCACCTCTTCGCTCTTATGCACCTGCTCGGGTACCGGTTCGCCC
CACGGATCCGCAACATCGGCGACACCCGTCTCTACACACCTACCACCGATCCGGGACTTGCCACGTTGGCGCCGCTGATC
GGCGGGACCATCAACACGAAAATGATTGCCCTGCATTGGGATGAAATCCTCCGCCTCGCCGCGTCCATCAAGACCGGCAC
CGTGACCGCGTCCCTGATGATGCGAAAACTCGGCGCCTACCCGCGCCAGAACGGGCTCGCACTCGCGCTGCGGGAGCTGG
GCAGACTGGAGCGGACCCTCTTCCTGCTGGACTGGCTCCAGAACCCCGGCCTGCGCCGCAAAGTCACGGCCGGCCTGAAC
AAGGGCGAGGCCCGGAACACCCTCGCCCGGGCCGTCTTCTTCAACCGCCTCGGCGAAATCCGCGACCGCTCCTTCGAACA
GCAACGCTACCGCGCCAGCGGACTGAACCTTCTCACCGCGGCCATTATTCTCTGGAACACCGTCTACCTCGACCGCACCA
TCACCACCCTCAATAAGGACGGGAACGCCACGGACCCTGACCTGCTGCGGTTCCTCTCACCCCTGGGCTGGGAACACATC
AACCTCACCGGCGACTACACCTGGCCCCGCGCCAACCAGATCAAACCCGGCAAATACAGGCCACTACGCCGCCCGGCAAA
ACCTTAA

Protein sequence :
MALRSLLTAAERGQILAMPAETEDLAAHYTLSDADMSLIRQRRGDANRLGFAVQLCLLRHPGIGLADDTDVPPELIAWLA
SSLGVSIDAWDEYGTREETRQEHGREIRGYLGMSAFGIADYRWLVEHVGVVAAHTDKGLVLVESARDFLQARKVALPGIG
VIEKACAQALTRANRRIYLTLGEQLTVGHRQRLDGLLRRRRDSSLTEIGWLRQAPLRPNARAMNEHIDRLTTWRALELPW
AAGRLVHRNRLLKLAREGASMTAADLARFEPARRYATLFAMATESMATVTDEIIDLHDRIIGRLIRTAQNKQNQATLASR
STVAAMMRIHSRLGDALFEAKENGEDPFAAIETAIGWESLAESIAHAKELTRPALEDPLALVSAHFTTLRRYTPAFLAVL
DLNAAPAAQDLLAAINLVRTLNTAGARKIPDDAPTSFVRPRWKPLVFTENGIDRGFYEFCALAELKNALRSGDLWVTGSR
QFRDFDDYLLAGPDYTVMKTTGKLPLVTTDGGESYLQNRLALLNERLHHVNDLASRDELPGVMVTDKGVKITPLETIVPK
HAQPLIDQASAMFPRIRITDLLMEVDGWTGFTRHFTSLKSGQPSKDKQLLLTAILADGINLGLTKMAESCTGVSYAQLDR
HQASYIRDETYSAALAELVNTQHGHPFAAQWGDGTTSSSDGQRFRASSKAESTGHVNPKYGAEPGRLIYTHISDQYSPFH
SKLVNVGDRDATYVLDGLLYHESDLAIQEHYTDTAGFTDHLFALMHLLGYRFAPRIRNIGDTRLYTPTTDPGLATLAPLI
GGTINTKMIALHWDEILRLAASIKTGTVTASLMMRKLGAYPRQNGLALALRELGRLERTLFLLDWLQNPGLRRKVTAGLN
KGEARNTLARAVFFNRLGEIRDRSFEQQRYRASGLNLLTAAIILWNTVYLDRTITTLNKDGNATDPDLLRFLSPLGWEHI
NLTGDYTWPRANQIKPGKYRPLRRPAKP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 62
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 61
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 61
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 61
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 60

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Arth_4458 YP_829209.1 transposase Tn3 family protein VFG1031 Protein 0.0 61