Gene Information

Name : CAP2UW1_4525 (CAP2UW1_4525)
Accession : YP_003162850.1
Strain :
Genome accession: NC_013190
Putative virulence/resistance : Unknown
Product : transposase Tn3 family protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 571 - 3537 bp
Length : 2967 bp
Strand : +
Note : PFAM: transposase Tn3 family protein; KEGG: asa:ASA_P4G106 TnpA transposase

DNA sequence :
ATGCCTCGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACACCAAGGACGATTTGATCCG
GCATTACTCGCTCAGTGATAGTGACCTTTCGATCATCCGGCAGCGGCGCGGGCCTGCGAACCGGTTGGGCTTCGCGGTGC
AGCTTTGCTACCTGCGCTTTCCCGGCATTCTCCTTGGCGTCGATCAGCCGCTGTTCCTGCCCTTGCTGAAACTGGTCGCC
GACCAGCTCAAGGTCGGTGTTGAAAGCTGGAACGACTACGGGCAGCGGGAACAGACCCGGCGCGAGCACCTGGTCGAACT
GCAAACGGTGTTCGGCTTCCAGCCGTTCACCATGAGCCATTACCGACAGGCCGTCCACACGCTGACCGAGCTGGCCATGC
AGACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATTGAGCAACTGCGGCGGCAGTCGATCATCCTGCCCGCGCTCAAC
GCCATCGAGCGCGCCAGTGCTGAGGCCATCACTCGCGCCAACCGACGCATCTATGAAGCCCTGTCCGAGCCGCTGTCGAA
CGGACACCGGCATCGCCTCGACGATCTGCTGAAACGCCGCGACAACGGCAAGATGACCTGGCTGGCCTGGCTGCGTCAGT
CGCCTGTCAAACCGAACTCCCGCCACATGCTCGAACACATTGAGCGCCTCAAAGCCTGGCAGGCCCTCGACCTGCCTTCT
GGCATCGAGCGGCTGGTTCACCAGAACCGGCTGCTCAAAATCGCCCGCGAGGGCGGCCAGATGACGCCCGCCGACCTGGC
CAAGTTCGAGCCGCGGCGGCGCTACGCCACCCTCGTGGCGCTGGCCATCGAAGGCATGGCCACCGTCACCGACGAAATCA
TCGACTTGCACGACCGCATCCTGGGGAAGCTGTTCAACGCCGCCAAGAACAAGCATCAGCAGCAGTTCCAGGCTTCTGGT
AAAGCCATCAACGCCAAGGTGCGACTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAACAGTCAGGCCGCGACCC
GTTCGCGGCCATTGAGGCCGTCATGTCCTGGGACGCTTTCGCCGAGAGCGTCACCGAGGCGCAAAAGCTCGCGCAGCCCG
ATGACTTCGACTTCCTGCATCGCATCGGCGAGAGTTACGCCACCTTGCGCCGCTACGCGCCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCGCCCGCCGCCAAGGGTGTGCTTGATGCCATCGAGGTGCTGCGCGGTATGAACACCGACAACGCCCG
CAAGGTGCCAACCGATGCCCCGACCGACTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCTGGTATCGACA
GACGCTACTACGAACTGTGCGCCCTGTCGGAACTCAAGAACTCCCTTCGCTCGGGCGACATTTGGGTGCAGGGTTCGCGC
CAGTTCAAGGACTTCGAGGACTACCTGGTGCCGCCCGATAAGTTCACCAGCCTCAAGCAGTCCAGCACATTGCCGCTGGC
CGTTGTCACCGACTGCGACCAATATCTGCACGACCGGCTGACCCTACTGGAAACGCAGCTTGCCACCGTCAACCGCATGG
CGGCGGCCAACGATCTGCCCGATGCCATCATCACCGAGTCCGGCTTGAAAATCACGCCGCTGGATGCGGCGGTGCCCGAC
ACCGCGCAGGCGCTGATCGACCAGACGGCCATGATCCTGCCACACGTCAAGATCACCGAATTGCTGCTCGAAGTCGATGA
GTGGACGGGCTTCACCCGCCACTTCACGCACCTGAAATCGGGCGATCTGGCCAAGGACAAAAACTTGCTGCTAACGACGA
TCCTGGCCGACGCAATCAACCTGGGCCTGACGAAGATGGCTGAGTCCTGTCCCGGCACGACCTACGCCAAGCTCGCCTGG
TTGCAAGCCTGGCATACCCGCGACGAAACCTATTCGACGGCGCTGGCTGAATTGGTGAACGCCCAATTCCGGCATCCCTT
CGCCGAGCATTGGGGCGACGGCACCACGTCATCATCGGACGGCCAGAACTTCCGCACCGGCAGCAAGGCCGAGAGTACCG
GCCACATCAACCCGAAATATGGCAGCAGCCCAGGACGGACGTTCTACACCCACATCTCCGACCAGTACGCGCCGTTCCAC
ACCAAGGTGGTAAATGTCGGCGTGCGCGACTCCACCTATGTGCTCGATGGCCTGCTGTACCACGAGTCCGACCTGCGCAT
CGAAGAACACTACACCGACACGGCGGGCTTCACCGATCATGTCTTTGCTCTGATGCACCTCTTGGGCTTCCGCTTCGCGC
CGCGCATCCGCGATTTGGGAGACACCAAGCTCTACATTTCCAAGGGCGAAACCGCCTATGACGCGCTCAAGCCGATGATC
GGCGGCACGCTCAACATCAAGCATGTCCGCGCCCATTGGGACGAAATTTTGCGGCTGGCCACCTCGATCAAGCAGGGCAC
GGTGACGGCCTCGCTGATGCTCAGGAAGCTCGGCAGTTACCCGCGCCAGAACGGCCTGGCCGTCGCCCTGCGCGAGTTGG
GCCGCATCGAGCGCACGCTGTTCATCCTCGACTGGCTGCAAAGCGTCGAGCTGCGCCGCCGTGTGCATGCCGGGTTGAAC
AAGGGCGAAGCCCGCAACGCACTGGCCCGGGCCGTGTTCTTCAACCGCTTGGGCGAAATCCGCGACCGCAGTTTCGAGCA
GCAGCGCTATCGGGCCAGCGGCCTCAACCTGGTGACGGCGGCCGTTGTGCTGTGGAACACGGTCTATCTGGAGCGCGCAG
CGCAAGCGTTACGTGGCAATGGCCATACTGTCGATAACGCGTTGTTGCAGTACCTGTCGCCGCTCGGCTGGGAACACATC
AACCTGACCGGCGACTACCTCTGGCGCAGCAGCGCCAAGATCGGCGCGGGCAAGTTCAGGCCGCTACGACCATTGCAACC
GGCTTAG

Protein sequence :
MPRRSILSAAERESLLALPDTKDDLIRHYSLSDSDLSIIRQRRGPANRLGFAVQLCYLRFPGILLGVDQPLFLPLLKLVA
DQLKVGVESWNDYGQREQTRREHLVELQTVFGFQPFTMSHYRQAVHTLTELAMQTDKGIVLASALIEQLRRQSIILPALN
AIERASAEAITRANRRIYEALSEPLSNGHRHRLDDLLKRRDNGKMTWLAWLRQSPVKPNSRHMLEHIERLKAWQALDLPS
GIERLVHQNRLLKIAREGGQMTPADLAKFEPRRRYATLVALAIEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLYGRIGQALIDAKQSGRDPFAAIEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLAVL
KLRAAPAAKGVLDAIEVLRGMNTDNARKVPTDAPTDFIKPRWQKLVMTDAGIDRRYYELCALSELKNSLRSGDIWVQGSR
QFKDFEDYLVPPDKFTSLKQSSTLPLAVVTDCDQYLHDRLTLLETQLATVNRMAAANDLPDAIITESGLKITPLDAAVPD
TAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFTHLKSGDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHTRDETYSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGVRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYISKGETAYDALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAQALRGNGHTVDNALLQYLSPLGWEHI
NLTGDYLWRSSAKIGAGKFRPLRPLQPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 95
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 93
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 88
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 73
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CAP2UW1_4525 YP_003162850.1 transposase Tn3 family protein VFG1031 Protein 0.0 93