Gene Information

Name : Anacy_1207 (Anacy_1207)
Accession : YP_007155660.1
Strain : Anabaena cylindrica PCC 7122
Genome accession: NC_019771
Putative virulence/resistance : Unknown
Product : transposase Tn3 family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1444613 - 1447591 bp
Length : 2979 bp
Strand : -
Note : PFAM: Transposase; COGs: COG4644 Transposase and inactivated derivatives TnpA family; InterPro IPR002513; KEGG: npu:Npun_CR033 transposase Tn3 family protein; PFAM: Transposase, Tn3; SPTR: Transposase Tn3 family protein

DNA sequence :
TTGGCAACTCGTGAACTTTTATCTCCGGCACAGCGATTACAATTTACGGAAATTCCCAATTCCATCACCATAAGAGACAT
AGCTCGCTATTATACTTTTAGCAACGATGAACTCAAAGTTATCAAAGAACGTCGTAGACCACACAATCGTCTGGGTTTTG
CAGTGCAGTTGTGTTACCTCCGCTTTCCTGGTCGGGTTTGGAGTTTGGGAGAAATAGTTCCAGAATCTGTACTTTCTTAT
ATTGCTTCCCAATTAAAAATTGACCCAACAATCATTACAGAATATTCCCAAAGAGATACAATACGGCGTGAACATTTAGT
AGAAATTCAAAATATTTTTGGGTTTCATTCTTTTAATATATCTACATATAAACTCCTCTCAAAATGGTTATTACCATTTG
CCATCTCCTCAGAGCAAGGGATGGCACTTGTGGGAGCATTAATTGATGAAATGCGCTTTCGTAAAATTATTATTCCAGCT
ATCTCTACTGTAGAACGTTTGGCTTGGGAAGTGAGACACCGCGCTCAAAAATTAGTGTGTCTAGAATTAACTCAAAATCT
GACAATATTACAAAAAACAGCCCTGGATAAACTATTAATTTTAGAGCCTGATAAAAAGCTGACTGATTTAATTTGGCTGC
GTCAACCACCAGGAATACCTAATCCTAGAAACTTTTTAAAACTAGTAGAACGCCTAGAATTTATTCGCAATCTTCACCTC
GACTCTGGATGCTTGAAACGAGTACATCAAAATCGTCTGTTACAATTTACTAAAATTGGTGCGAAGTCTACACCTGCTCA
CCTATCTAGGTTAGATGAATTAAGGCGCTATGCTATTTTAGTTGCTTTTCTAATTGAATGGAGTGCATCATTAGTCGATT
ATGCCATAGGGATGCACGATAAAATGATGGGTAAATTGTTTAATAAAAGTGAGCATCAGCATGGCGAAAAGTTTCAACAT
GATGGGAAAGCGATTAATGACAAGGTGAGATTATATGCCCAATTCGGAAAGGCGTTAATTGCTGCTAGAGAGGAAGAAAA
TGATGCTTATCAAGCGATTGAGTCGGTCTTGGATTGGGAGAAGTTTATCAATAGTGTTGTTGAAGCTGAAAAGTTAGCTA
GACCCGCAGATTTTGATTATCTTGAACTACTTGATAACCGTTATTCACAGTTGCGAAGATATACACCTAAGTTGTTGGAG
ACGTTTGAATTTAAAGCAACAACTGCTAGTTTACCAGTTATTGAAGCTTTAGCGGTGATTAAAGAATTAAATATATCTGG
ACGCAGAAATATACCGGAATCTACTCCTACTAGTTTTGTGAAACCTCGTTGGTTAAAACACGTAATGAAGGGCGATACTA
TTGACCGTCACTATTATGAGATGTGTGCCTTGGCTGAGTTACGTAGTGGTTTACGTTCTGGGGATATTTGGGTAGTAGGT
TCTCGTCAGTTCCAAGATTTTGAGGATTATCTGTTAACTGATAGTTCATGGCAGTTAATGCGTTCTGCTCAAACAATACC
TGTGGCAGTGACAACTGATTTCACTACCTATATTGAACAACGTTCACTGGAATTGAAGTCACAATTAGGGATTGTTTCTG
ATTTAATGGCAGAGGATAAATTGGTTGATGTCAGAATAGAAGATGAGCGGTTAATTATTACTCCTTTGAGCAATGCTGTT
CCGACTGAGGTTGATGAATTAAGTCGAAAAGTCTCTAGTTTATTGCCGAGAATTAAGCTGACTGATTTGTTGGTGGAAGT
TGATTCTTGGACGCATTTCACTAAACATTTTACACATTTATATTCAGGAACAGAAGTAGAAGATAAGGTGGTTTTGTTGA
GTGCGTTGCTTGCTGATGGGATTAATCTGGGTTTAACGCGCATGGCAGATGCTACTCAAGGAATGTCTTTTGAGCGTTTA
GCTTGGGTGGCTGATTGGTATATTCGAGATGAGACTTATTCTCAGGCTTTGGCAGAGGTGGTAAATTTCCAGGCACAAGT
TCCTTTTGCTGCTTATTGGGGTGATGGGACTACTTCTTCGTCTGATGGTCAGCGTTTTAAAGCTGGTGGACACCGCAGTT
TTAATGAAGAGATTAATGCTAAATATGGTAAGGATAGAAGCGTGATTTTTTACACGCATATTTCTGACCAATATGTGCCT
TTTCATGTGAAGGTGATTAATGCAACGGTCAGGGATGCTAGTTATGTTTTAGATGGTTTGTTGTATCACGAGAGTGATTT
GCAGATTCAGGAGCATTACACTGATACAAGCGGCTATACTGAGCAGGTGTTTGCGATGTGTCATCTGCTGGGGTTTAGAT
TTGCTCCACGAATGCGCGATTTACCTGATAAGAAGTTGTATACTTTTGAGTCTACTTCTGCTGATGAGGTTTTGTCACCT
TTGTTAGGTGGCAAGATTAATGTGAAGTTGATTGAGGATTCTTGGGATGAGATTCTCCGGCTTGCTAGTTCAATTCGCAC
GGGGACGGTCACGGCTTCTTTAATGTTGCGGAAATTGGCTTCTTATCCTCGTCAGAATCGTTTGGCTTTAGCTTTGCGGG
AGTTGGGGAGAATTGAGAGGACTTTGTTTACTTTGGAATGGTTGCAGAGTCCTGAGTTACGACGACGAGCGACTGCGGGG
CTAAATAAGGGTGAGGCGAAACATACTCTGAAAAGGGCGGTGTTTTTTAATCGTCTGGGTGAGGTGCGCGATCGCTCTTA
TGAAGACCAATTTTATCGGGCTAGTGGGTTGAATTTGGTGGTGGCGGCGATTGTTGTATGGAATACGGTGTACATAGAAA
AGGCTGTTGAGCATTTGAAACAACAGGGTATGGATATTCCTGAAGAGCATTTGCAACATTTATCACCTTTGGGTTGGGAA
CATATCAATCTTACAGGTGATTATGTCTGGAATTTGAAGCAGGCAATCAGCTTTGACAAGTTGCGCCCTTTGCGAGTTAA
GGAAAATAGGTATCGCTGA

Protein sequence :
MATRELLSPAQRLQFTEIPNSITIRDIARYYTFSNDELKVIKERRRPHNRLGFAVQLCYLRFPGRVWSLGEIVPESVLSY
IASQLKIDPTIITEYSQRDTIRREHLVEIQNIFGFHSFNISTYKLLSKWLLPFAISSEQGMALVGALIDEMRFRKIIIPA
ISTVERLAWEVRHRAQKLVCLELTQNLTILQKTALDKLLILEPDKKLTDLIWLRQPPGIPNPRNFLKLVERLEFIRNLHL
DSGCLKRVHQNRLLQFTKIGAKSTPAHLSRLDELRRYAILVAFLIEWSASLVDYAIGMHDKMMGKLFNKSEHQHGEKFQH
DGKAINDKVRLYAQFGKALIAAREEENDAYQAIESVLDWEKFINSVVEAEKLARPADFDYLELLDNRYSQLRRYTPKLLE
TFEFKATTASLPVIEALAVIKELNISGRRNIPESTPTSFVKPRWLKHVMKGDTIDRHYYEMCALAELRSGLRSGDIWVVG
SRQFQDFEDYLLTDSSWQLMRSAQTIPVAVTTDFTTYIEQRSLELKSQLGIVSDLMAEDKLVDVRIEDERLIITPLSNAV
PTEVDELSRKVSSLLPRIKLTDLLVEVDSWTHFTKHFTHLYSGTEVEDKVVLLSALLADGINLGLTRMADATQGMSFERL
AWVADWYIRDETYSQALAEVVNFQAQVPFAAYWGDGTTSSSDGQRFKAGGHRSFNEEINAKYGKDRSVIFYTHISDQYVP
FHVKVINATVRDASYVLDGLLYHESDLQIQEHYTDTSGYTEQVFAMCHLLGFRFAPRMRDLPDKKLYTFESTSADEVLSP
LLGGKINVKLIEDSWDEILRLASSIRTGTVTASLMLRKLASYPRQNRLALALRELGRIERTLFTLEWLQSPELRRRATAG
LNKGEAKHTLKRAVFFNRLGEVRDRSYEDQFYRASGLNLVVAAIVVWNTVYIEKAVEHLKQQGMDIPEEHLQHLSPLGWE
HINLTGDYVWNLKQAISFDKLRPLRVKENRYR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 52
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 51
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 51
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 51
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Anacy_1207 YP_007155660.1 transposase Tn3 family protein VFG1031 Protein 0.0 51