Gene Information

Name : Hneap_1205 (Hneap_1205)
Accession : YP_003263088.1
Strain : Halothiobacillus neapolitanus c2
Genome accession: NC_013422
Putative virulence/resistance : Unknown
Product : transposase Tn3 family protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG4644
EC number : -
Position : 1308489 - 1311455 bp
Length : 2967 bp
Strand : -
Note : PFAM: transposase Tn3 family protein; KEGG: tmz:Tmz1t_2145 transposase Tn3 family protein

DNA sequence :
ATGCCGCGTCGTTCAATCCTGTCCGCCGCCGAGCGGGAAAACCTGCTGGCGTTGCCGGACTCCAAGGACGACCTGATCCG
ACATTACACATTCAGCGATACCGACCTCTCGATCATCCGACAGCGGCGCGGGCCAGCCAATCGGCTGGGCTTCGCGGTGC
AGCTCTGCTACCTGCGCTTTCCAGGCGTCATCCTAGGCGTCGATGAGCCGCCGTTCCCGCCCTTGTTGAAGCTGGTCGCC
GAGCAGATCAAGGTCGGCGTCGAAAGCTGGGACGAGTACGGCCAGCGGGAGCAGACCCGGCGCGAGCACCTGGTCGAGCT
GCAAACCGTGTTCGGTTTCCGGCCCTTCACCATGAGCCATTACCGGCAGGCCGTCCAGATGCTGACCGAGCTGGCCATGC
AAACCGACAAGGGCATCGTGCTGGCCAGTGCCTTGATCGAGCACCTGCGGCGGCAGTCGGTCATTCTGCCCGCGCTCAAC
GCCGTCGAGCGGGTGAGTGCCGAGGCGATCACCCGCGCCAACCGGCGCATCTACGACACCTTGGCCGAACCACTGGCGGA
CGCGCATCGCCGTCGCCTTGATGACTTGCTCAAGCGCCGGGACAACGGCAAGACGACCTGGCTGGCCTGGCTGCGCCAGT
CACCGGCCAAGCCCAATTCGCGGCATATGCTCGAACACATCGAACGCCTCAAGGCATGGCAGGCACTCGACCTGCCTTCC
GGCATCGAGCGGCTGGTTCACCAGAACCGGCTGCTCAAGATCGCCCGCGAGGGTGGACAGATGACGCCCGCCGACCTGGC
CAAGTTCGAGGCGCAGCGGCGCTACGCGACCCTGGTGGCGCTGGCCATCGAGGGCATGGCCACCGTCACCGACGAAATCA
TCGACCTGCACGACCGCATCCTGGGCAAGCTGTTCAATGCCGCCAAGAACAAGCATCAGCAGCAATTCCAGGCATCCGGC
AAGGCCATCAACGCCAAGGTGCGGCTGTTCGGGCGCATCGGCCAGGCGCTGATCGAGGCCAAGCAATCGGGTCGCGATCC
GTTCGCCGCCATCGAGGCCGTCATGTCCTGGGACGCCTTCGCCGAGAGCGTCACCGAAGCGCAGAAGCTCGCGCAGCCCG
AGGATTTCGATTTCCTGCACCGCATCGGCGAGAACTACGCCACGCTGCGCCGCTACGCGCCGGAATTCCTTGCCGTGCTC
AAGCTGCGGGCCGCGCCCGCCGCCAAGGACGTGCTCGACGCCATCGAAGTGCTGCGCGGCATGAACAGCGACAACGCCCG
CAAGGTGCCCGCCGACGCGCCGACCGACTTCATCAAGCCACGCTGGCAGAAGCTGGTGATGACCGACACCGGCATCGACC
GGCGTTACTACGAGCTGTGCGCACTATCGGAGCTGAAGAACGCACTGCGCTCGGGCGACATCTGGGTGCAGGGATCGCGC
CAGTTCAAGGACTTCGAGGACTACCTGGTGCCGCCCGCGAAATTCGCCAGCCTCAAGCTGGCCAGCGAATTGCCGCTGGC
CGTGGCCACCGACTGCGATCAGTACCTGCATGAACGGCTGACGCTACTGGAAACGCAGCTTGCCACCGTCAACCGCATGG
CAGCGGCCAATAACCTGCCGGATGCCATCATCACCGAGTCGGGCCTGAAGATCACGCCGCTGGATGCGGCGGTGCCCGAC
ACCGCGCAGGCCCTGATCGACCAGACGGCGATGATCCTGCCGCACGTCAAGATCACCGAACTGCTGCTGGAGGTGGACGA
GTGGACAGGCTTTACCCGTCACTTCGCACACCTGAAGTCAGGCGACCTGGCCAAGGACAGGAACCTGCTGCTGACTACTA
TCCTGGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCCGAGTCCTGCCCCGGCACGACCTACGCCAAGCTCGCCTGG
CTGCAAGCCTGGCACATCCGCGACGAAACTTACTCGACGGCGCTGGCCGAGCTGGTCAACGCGCAGCTCCGCCACCCGTT
CGCCGAGCATTGGGGCGACGGCACCACGTCATCGTCAGACGGCCAGAATTTTCGCACCGGCAGCAAAGCCGAGAGCACCG
GCCACATCAACCCGAAATACGGCAGCAGCCCAGGGCGGACGTTCTACACCCACATCTCCGACCAGTACGCGCCGTTCCAC
ACCAAGGTGGTCAATGTCGGCGTGCGTGACTCGACCTACGTCCTCGACGGGCTGCTGTACCACGAATCCGACCTGCGCAT
CGAGGAGCACTACACCGACACGGCAGGTTTCACCGATCACGTCTTCGCGCTGATGCACCTCTTGGGCTTCCGCTTCGCCC
CGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGTGATGCCACCTACGAGGCATTGAAACCGATGATC
GGCGGCACCCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACGTCGATCAAGCAGGGGAC
GGTGACGGCCTCCCTCATGCTCAGGAAGCTCGGCAGCTACCCGCGCCAGAACGGCCTGGCCGTCGCGCTGCGCGAGTTGG
GCCGCATTGAGCGCACGCTGTTCATCCTGGACTGGCTGCAAAGCGTCGAGCTGCGCCGCCGCGTGCATGCCGGGCTGAAC
AAGGGCGAGGCGCGCAACGCGCTGGCCCGTGCCGTGTTCTTCAACCGCCTTGGTGAAATCCGTGACCGCAGTTTCGAGCA
GCAGCGCTACCGCGCCTCCGGCCTCAATCTGGTAACGGCCGCCATCGTGTTGTGGAATACGGTCTATCTGGAGCGGGCCG
CGAACGCCCTGCGTGTCCACGGCCAGACTGTTGATGACGGCCTATTGCAGTATCTGTCGCCGCTGGGCTGGGAACACGTC
AACCTGACCGGCGATTACCTCTGGCGCAACAGCGCCAAGATCGGCGCAGGCAAGTTCAGGCCGCTACGGCCACTGCATCC
GGCTTAG

Protein sequence :
MPRRSILSAAERENLLALPDSKDDLIRHYTFSDTDLSIIRQRRGPANRLGFAVQLCYLRFPGVILGVDEPPFPPLLKLVA
EQIKVGVESWDEYGQREQTRREHLVELQTVFGFRPFTMSHYRQAVQMLTELAMQTDKGIVLASALIEHLRRQSVILPALN
AVERVSAEAITRANRRIYDTLAEPLADAHRRRLDDLLKRRDNGKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPS
GIERLVHQNRLLKIAREGGQMTPADLAKFEAQRRYATLVALAIEGMATVTDEIIDLHDRILGKLFNAAKNKHQQQFQASG
KAINAKVRLFGRIGQALIEAKQSGRDPFAAIEAVMSWDAFAESVTEAQKLAQPEDFDFLHRIGENYATLRRYAPEFLAVL
KLRAAPAAKDVLDAIEVLRGMNSDNARKVPADAPTDFIKPRWQKLVMTDTGIDRRYYELCALSELKNALRSGDIWVQGSR
QFKDFEDYLVPPAKFASLKLASELPLAVATDCDQYLHERLTLLETQLATVNRMAAANNLPDAIITESGLKITPLDAAVPD
TAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKSGDLAKDRNLLLTTILADAINLGLTKMAESCPGTTYAKLAW
LQAWHIRDETYSTALAELVNAQLRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFH
TKVVNVGVRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLYIPKGDATYEALKPMI
GGTLNIKHVRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLN
KGEARNALARAVFFNRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERAANALRVHGQTVDDGLLQYLSPLGWEHV
NLTGDYLWRNSAKIGAGKFRPLRPLHPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 95
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 93
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 88
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 74
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 73

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Hneap_1205 YP_003263088.1 transposase Tn3 family protein VFG1031 Protein 0.0 93