Gene Information

Name : Tery_4471 (Tery_4471)
Accession : YP_723931.1
Strain : Trichodesmium erythraeum IMS101
Genome accession: NC_008312
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 6897392 - 6900367 bp
Length : 2976 bp
Strand : -
Note : PFAM: type III restriction enzyme, res subunit protein of unknown function DUF450 DEAD/DEAH box helicase-like; KEGG: dde:Dde_1859 DEAD/DEAH box helicase-like

DNA sequence :
ATGGCTTCCGATATTACTGAAAAAGGTCTCGAAAATATTATCTATCAAAGTCTCATCGACGACTGCCAATATTTAGAAGG
TAACCCCAAAGACTACGACCAAACCTACTGTATCGACACTGAGAAACTTTTCCAGTTTCTCCAAAACACCCAACCTGAAA
AATTAACAGAAATTTCTAACTACCACGGCGCCAACTGGGAGAAAAAACTTTATGAACGCCTCCACCACCAAATAGAAGAG
AAAAGTATAGTCAATATATTACGTCAAGGTATCAAAACTGGAGAAACCCACCTCGAACTTTACTATAAACTTCCCACTTC
CCAACTCAACCCCGACACTATCGAAAATTTCCAAGAAAACGTTTTTTCAGTCACTCGCCAACTAAAATACAAAGAAAACC
GTAACTTCTCCCTCGACTTAGTAATTTTTATAAACGGTTTACCAGTTATTACCTTCGAGCTAAAAAACCAACTAACCAAA
CAAAACTTTCGAGACGCCATAAACCAATATAAAAATGACCGACGCCCCAGAGAATTATTATTTCAATTCAAACGTTGCCT
AGTACATTTTGCCCTCGACGCCGATGAAGTTTGGATGACAACAAAACTCAACGGCAAAAATACAGAATTCATACCATTCA
ACAAAGGCAAAAAATCAAATCCTGATCTACCTTTTCCGGATACAGCAGGGAACCCTCCCAACCCCAACCACATCAAAACA
GATTATTTGTGGAAAGAAATTTTAACCATAGAAAGTCTCGGAAACATCATCGAACATTACGCCCAACTGATAGAAAAAGA
AGAAGATAAAGACAAAGACAAAAAAACAGTCAAAAAGCTAAAACTAATCTTCCCCCGCTACCATCAACTCGACCTAGTCA
AGCAACTTTTAACAAGTGCAAAAAAACATGGAGTCGGCAACCGCTACTTAATCCAACATTCTGCAGGTTCCGGCAAAAGT
AATTCTATAACCTGGCTGAGTCATCAACTCGTAGAACTGAAAAATATTACCGAGAAAGAAAATATTTTTGATTCAGTTTT
AGTCGTGACAGACCGCAAAATTTTAGATAAACAAATTAGGGAAAATATTCAACAATTCGCTCAAGAAGACAAAGTCGTAG
AAGCAACCAAAAACAGCAAAAAATTAAAATCAGCCTTGGAAAACAAACGGAAAATTATTATTACAACAGTGCAAAAATTT
CCATATGTTGTCAAAGAAATTCAATCTTTATCCGATCACAAGTTTGCCATTATTATCGACGAAGCACATTCGAGTCAAAC
TGGCAAAAGTGCAGCCAGCATGAGTGAATCTTTGAGCAAAAAAGATTCGGAAGTAGAAGAAACCACAGAGGATAAAATAA
TCCGAATTATTGAGTCACAAAAACTTTGCCCAAATGCTAATTATTATGCATTTACTGCCACGCCAAAGAATAAAACTTTA
GAGTTATTTGGTGTCAAAAATCCAGAAGATGGAAAATTTTATCCGTTCCATAGTTATTCCATGAAGCAGGCAATTGAAGA
AGGATTTATTCTGAATGTTTTGCAGCATTATACGACCTACAAAACCTATTGTCGATTAGAGAAGAAAGTTATAGACGACC
CTGAATTTGATAGTAAACAAGCAAAAAAGAAGTTAAAACAATATGTAGAAGAGGATCAAGAGAGTATCCGCAAAAAGTCA
GAAGTGATGATTGAGCATTTTTTATCAAAGGTAATTGCTCAGGGAAAAATTAATGGAAAAGCTAAGGCTATGGTCGTTAG
TAATAGTATTAAAAGTGCGATTTATTATAAAAAAGCTTTTGATAAATATTTGAGAGAAAAAAAATCTGATTATCAGACTA
TTGTTGCTTTTTCTGGAAGTAAAGAAATAGACGGCAAAAAGGAAAATGAGTCTTCTATGAATGGATTTTCTAGTAGTAAG
ATTACAGAAAAATTTAATGATAGTAAATATAGGTTTTTAATTGTGGCTAATAAGTATCAAACTGGTTTTGATGAACCGTT
GTTACATACTATGTATGTGGATAAAGTTTTATCTGATGTGAAAGCAGTACAAACTTTGTCTAGGTTAAACCGTTCTTGTG
AGGGAAAAACAGATACTTTTGTTTTAGATTTTGTTAATTCTGCTGATGAAATTCAGAGAGCTTTTGAACCTTATTATAAA
ACAACTATTTTGAGTGAAGAAACAGATAGCGATCGCCTCTATGATTTAGAGGATAGTTTAGCAAGTTTTCAGATTTATTC
TCAAGAAAATGTAGAGAAATTTATGAAGCTTTTTTTGAATTGTGAGTCACGGGAAAATTGGGAGTCAATTTTAGATATTT
GTGTGGAAAAATATAATTGTGATTTGCTAGAGGAGGAAAAAATAGAGTTTAAAAGTAAAGCCAGGAGTTTTGTGAAAAAT
TATCAATTTTTGGTGCAAGTAAAAAGTTTTAAAAATTCCAATTGGGAGAGTTTAAATAGTTTTCTGAAATTGTTAGTTAA
TAAACTGCCACAATTAGATAATTCTGATTTATCGGCAGGAATTATTAATAGTGTGGATATTGAGAGTTATCGAGTAGAGC
TTCTAGCTAGTCAAAGTATTAATTTAAGTGGAGAAAATACCCTATCTCCCATTGCCAAGAATATTGTTAGTGGAAATTCT
CAAAGTAGGTCAGATAAAGTTAGTCAAATAATCGAAGAATTTAATAACCGCTTCGGTGGTAATATTGTTTGGCAAAATGA
GGGTAGGGCATGGAAATTTTTATTAGAGGAGTTGCCAGAAAAAGTCAGAGGAAATGGGGAGTATAAAAATGCTATAAATT
ATAGCGATCCGCAAAATGCCAAACTTACCTTTGAAAATAAATTCAATCAAGAATTACGGCGTTCTACCCGTGAACATATA
GAAGAATATCGTCAATTTACAGGTAATAAAAGTTTTCGAGAATGGTTAATTAATACTTTATTTAATCTTGACTACGAGCA
AGATAAAAATGCTTAG

Protein sequence :
MASDITEKGLENIIYQSLIDDCQYLEGNPKDYDQTYCIDTEKLFQFLQNTQPEKLTEISNYHGANWEKKLYERLHHQIEE
KSIVNILRQGIKTGETHLELYYKLPTSQLNPDTIENFQENVFSVTRQLKYKENRNFSLDLVIFINGLPVITFELKNQLTK
QNFRDAINQYKNDRRPRELLFQFKRCLVHFALDADEVWMTTKLNGKNTEFIPFNKGKKSNPDLPFPDTAGNPPNPNHIKT
DYLWKEILTIESLGNIIEHYAQLIEKEEDKDKDKKTVKKLKLIFPRYHQLDLVKQLLTSAKKHGVGNRYLIQHSAGSGKS
NSITWLSHQLVELKNITEKENIFDSVLVVTDRKILDKQIRENIQQFAQEDKVVEATKNSKKLKSALENKRKIIITTVQKF
PYVVKEIQSLSDHKFAIIIDEAHSSQTGKSAASMSESLSKKDSEVEETTEDKIIRIIESQKLCPNANYYAFTATPKNKTL
ELFGVKNPEDGKFYPFHSYSMKQAIEEGFILNVLQHYTTYKTYCRLEKKVIDDPEFDSKQAKKKLKQYVEEDQESIRKKS
EVMIEHFLSKVIAQGKINGKAKAMVVSNSIKSAIYYKKAFDKYLREKKSDYQTIVAFSGSKEIDGKKENESSMNGFSSSK
ITEKFNDSKYRFLIVANKYQTGFDEPLLHTMYVDKVLSDVKAVQTLSRLNRSCEGKTDTFVLDFVNSADEIQRAFEPYYK
TTILSEETDSDRLYDLEDSLASFQIYSQENVEKFMKLFLNCESRENWESILDICVEKYNCDLLEEEKIEFKSKARSFVKN
YQFLVQVKSFKNSNWESLNSFLKLLVNKLPQLDNSDLSAGIINSVDIESYRVELLASQSINLSGENTLSPIAKNIVSGNS
QSRSDKVSQIIEEFNNRFGGNIVWQNEGRAWKFLLEELPEKVRGNGEYKNAINYSDPQNAKLTFENKFNQELRRSTREHI
EEYRQFTGNKSFREWLINTLFNLDYEQDKNA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-169 41
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-169 41
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-168 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Tery_4471 YP_723931.1 hypothetical protein VFG1098 Protein 8e-170 41