Gene Information

Name : Anacy_1207 (Anacy_1207)
Accession : YP_007155660.1
Strain : Anabaena cylindrica PCC 7122
Genome accession: NC_019771
Putative virulence/resistance : Unknown
Product : transposase Tn3 family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1444613 - 1447591 bp
Length : 2979 bp
Strand : -
Note : PFAM: Transposase; COGs: COG4644 Transposase and inactivated derivatives TnpA family; InterPro IPR002513; KEGG: npu:Npun_CR033 transposase Tn3 family protein; PFAM: Transposase, Tn3; SPTR: Transposase Tn3 family protein

DNA sequence :
TTGGCAACTCGTGAACTTTTATCTCCGGCACAGCGATTACAATTTACGGAAATTCCCAATTCCATCACCATAAGAGACAT
AGCTCGCTATTATACTTTTAGCAACGATGAACTCAAAGTTATCAAAGAACGTCGTAGACCACACAATCGTCTGGGTTTTG
CAGTGCAGTTGTGTTACCTCCGCTTTCCTGGTCGGGTTTGGAGTTTGGGAGAAATAGTTCCAGAATCTGTACTTTCTTAT
ATTGCTTCCCAATTAAAAATTGACCCAACAATCATTACAGAATATTCCCAAAGAGATACAATACGGCGTGAACATTTAGT
AGAAATTCAAAATATTTTTGGGTTTCATTCTTTTAATATATCTACATATAAACTCCTCTCAAAATGGTTATTACCATTTG
CCATCTCCTCAGAGCAAGGGATGGCACTTGTGGGAGCATTAATTGATGAAATGCGCTTTCGTAAAATTATTATTCCAGCT
ATCTCTACTGTAGAACGTTTGGCTTGGGAAGTGAGACACCGCGCTCAAAAATTAGTGTGTCTAGAATTAACTCAAAATCT
GACAATATTACAAAAAACAGCCCTGGATAAACTATTAATTTTAGAGCCTGATAAAAAGCTGACTGATTTAATTTGGCTGC
GTCAACCACCAGGAATACCTAATCCTAGAAACTTTTTAAAACTAGTAGAACGCCTAGAATTTATTCGCAATCTTCACCTC
GACTCTGGATGCTTGAAACGAGTACATCAAAATCGTCTGTTACAATTTACTAAAATTGGTGCGAAGTCTACACCTGCTCA
CCTATCTAGGTTAGATGAATTAAGGCGCTATGCTATTTTAGTTGCTTTTCTAATTGAATGGAGTGCATCATTAGTCGATT
ATGCCATAGGGATGCACGATAAAATGATGGGTAAATTGTTTAATAAAAGTGAGCATCAGCATGGCGAAAAGTTTCAACAT
GATGGGAAAGCGATTAATGACAAGGTGAGATTATATGCCCAATTCGGAAAGGCGTTAATTGCTGCTAGAGAGGAAGAAAA
TGATGCTTATCAAGCGATTGAGTCGGTCTTGGATTGGGAGAAGTTTATCAATAGTGTTGTTGAAGCTGAAAAGTTAGCTA
GACCCGCAGATTTTGATTATCTTGAACTACTTGATAACCGTTATTCACAGTTGCGAAGATATACACCTAAGTTGTTGGAG
ACGTTTGAATTTAAAGCAACAACTGCTAGTTTACCAGTTATTGAAGCTTTAGCGGTGATTAAAGAATTAAATATATCTGG
ACGCAGAAATATACCGGAATCTACTCCTACTAGTTTTGTGAAACCTCGTTGGTTAAAACACGTAATGAAGGGCGATACTA
TTGACCGTCACTATTATGAGATGTGTGCCTTGGCTGAGTTACGTAGTGGTTTACGTTCTGGGGATATTTGGGTAGTAGGT
TCTCGTCAGTTCCAAGATTTTGAGGATTATCTGTTAACTGATAGTTCATGGCAGTTAATGCGTTCTGCTCAAACAATACC
TGTGGCAGTGACAACTGATTTCACTACCTATATTGAACAACGTTCACTGGAATTGAAGTCACAATTAGGGATTGTTTCTG
ATTTAATGGCAGAGGATAAATTGGTTGATGTCAGAATAGAAGATGAGCGGTTAATTATTACTCCTTTGAGCAATGCTGTT
CCGACTGAGGTTGATGAATTAAGTCGAAAAGTCTCTAGTTTATTGCCGAGAATTAAGCTGACTGATTTGTTGGTGGAAGT
TGATTCTTGGACGCATTTCACTAAACATTTTACACATTTATATTCAGGAACAGAAGTAGAAGATAAGGTGGTTTTGTTGA
GTGCGTTGCTTGCTGATGGGATTAATCTGGGTTTAACGCGCATGGCAGATGCTACTCAAGGAATGTCTTTTGAGCGTTTA
GCTTGGGTGGCTGATTGGTATATTCGAGATGAGACTTATTCTCAGGCTTTGGCAGAGGTGGTAAATTTCCAGGCACAAGT
TCCTTTTGCTGCTTATTGGGGTGATGGGACTACTTCTTCGTCTGATGGTCAGCGTTTTAAAGCTGGTGGACACCGCAGTT
TTAATGAAGAGATTAATGCTAAATATGGTAAGGATAGAAGCGTGATTTTTTACACGCATATTTCTGACCAATATGTGCCT
TTTCATGTGAAGGTGATTAATGCAACGGTCAGGGATGCTAGTTATGTTTTAGATGGTTTGTTGTATCACGAGAGTGATTT
GCAGATTCAGGAGCATTACACTGATACAAGCGGCTATACTGAGCAGGTGTTTGCGATGTGTCATCTGCTGGGGTTTAGAT
TTGCTCCACGAATGCGCGATTTACCTGATAAGAAGTTGTATACTTTTGAGTCTACTTCTGCTGATGAGGTTTTGTCACCT
TTGTTAGGTGGCAAGATTAATGTGAAGTTGATTGAGGATTCTTGGGATGAGATTCTCCGGCTTGCTAGTTCAATTCGCAC
GGGGACGGTCACGGCTTCTTTAATGTTGCGGAAATTGGCTTCTTATCCTCGTCAGAATCGTTTGGCTTTAGCTTTGCGGG
AGTTGGGGAGAATTGAGAGGACTTTGTTTACTTTGGAATGGTTGCAGAGTCCTGAGTTACGACGACGAGCGACTGCGGGG
CTAAATAAGGGTGAGGCGAAACATACTCTGAAAAGGGCGGTGTTTTTTAATCGTCTGGGTGAGGTGCGCGATCGCTCTTA
TGAAGACCAATTTTATCGGGCTAGTGGGTTGAATTTGGTGGTGGCGGCGATTGTTGTATGGAATACGGTGTACATAGAAA
AGGCTGTTGAGCATTTGAAACAACAGGGTATGGATATTCCTGAAGAGCATTTGCAACATTTATCACCTTTGGGTTGGGAA
CATATCAATCTTACAGGTGATTATGTCTGGAATTTGAAGCAGGCAATCAGCTTTGACAAGTTGCGCCCTTTGCGAGTTAA
GGAAAATAGGTATCGCTGA

Protein sequence :
MATRELLSPAQRLQFTEIPNSITIRDIARYYTFSNDELKVIKERRRPHNRLGFAVQLCYLRFPGRVWSLGEIVPESVLSY
IASQLKIDPTIITEYSQRDTIRREHLVEIQNIFGFHSFNISTYKLLSKWLLPFAISSEQGMALVGALIDEMRFRKIIIPA
ISTVERLAWEVRHRAQKLVCLELTQNLTILQKTALDKLLILEPDKKLTDLIWLRQPPGIPNPRNFLKLVERLEFIRNLHL
DSGCLKRVHQNRLLQFTKIGAKSTPAHLSRLDELRRYAILVAFLIEWSASLVDYAIGMHDKMMGKLFNKSEHQHGEKFQH
DGKAINDKVRLYAQFGKALIAAREEENDAYQAIESVLDWEKFINSVVEAEKLARPADFDYLELLDNRYSQLRRYTPKLLE
TFEFKATTASLPVIEALAVIKELNISGRRNIPESTPTSFVKPRWLKHVMKGDTIDRHYYEMCALAELRSGLRSGDIWVVG
SRQFQDFEDYLLTDSSWQLMRSAQTIPVAVTTDFTTYIEQRSLELKSQLGIVSDLMAEDKLVDVRIEDERLIITPLSNAV
PTEVDELSRKVSSLLPRIKLTDLLVEVDSWTHFTKHFTHLYSGTEVEDKVVLLSALLADGINLGLTRMADATQGMSFERL
AWVADWYIRDETYSQALAEVVNFQAQVPFAAYWGDGTTSSSDGQRFKAGGHRSFNEEINAKYGKDRSVIFYTHISDQYVP
FHVKVINATVRDASYVLDGLLYHESDLQIQEHYTDTSGYTEQVFAMCHLLGFRFAPRMRDLPDKKLYTFESTSADEVLSP
LLGGKINVKLIEDSWDEILRLASSIRTGTVTASLMLRKLASYPRQNRLALALRELGRIERTLFTLEWLQSPELRRRATAG
LNKGEAKHTLKRAVFFNRLGEVRDRSYEDQFYRASGLNLVVAAIVVWNTVYIEKAVEHLKQQGMDIPEEHLQHLSPLGWE
HINLTGDYVWNLKQAISFDKLRPLRVKENRYR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EC042_4089 YP_006098372.1 transposase Not tested Tn2411 Protein 0.0 52
tnpA ACY75538.1 TnpA Not tested Tn6060 Protein 0.0 51
tnpA AAL08440.1 transposase TnpA Not tested SRL Protein 0.0 51
tnpA ACF06153.1 transposase Not tested Tn5036-like Protein 0.0 51
EXA24 ABD94633.1 truncated Tn1721 tranposase Not tested ExoU island A Protein 0.0 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Anacy_1207 YP_007155660.1 transposase Tn3 family protein VFG1031 Protein 0.0 51