Gene Information

Name : WQG_21420 (WQG_21420)
Accession : YP_007549210.1
Strain : Bibersteinia trehalosi USDA-ARS-USMARC-192
Genome accession: NC_020515
Putative virulence/resistance : Virulence
Product : Type I restriction-modification system restriction subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2289179 - 2292127 bp
Length : 2949 bp
Strand : +
Note : Type I site-specific restriction-modification system, R (restriction) subunit and helicases COG0610; Type I restriction-modification system restriction subunit of Bacteria UniRef RepID=Q48QA1_PSE14

DNA sequence :
ATGCACCACACCAAAGAAATCCATTTTGAAACGGCGATTGAGCAGGATTTGCTCAATCAAGGTTTTATCCAAGCAAGTCC
TAGCGGCTTTAATGCCCGTTTTGCCCTTGATGAGGCGAACTTTTGGGCGTTTTTAAGTGAAAGCCAAGCGGATAAACTGG
CGGATTTCAAACGGCTAAATCCGAATGATTGGCAAGCGAAAATTTTGGCAAGGTTGGATAATGTGTGGAAGCGGGAAGGG
ATTTTGCACCTGTTTAAAAAAGGGTTGGATGTGGACAATGTGCATTTAGATCTGTTTTTTGTGCCGCCGCTTGCCAACAG
TCCGCAGAGAGTTGCCGAGCTGTTTGTTCAAAATCGGTTTAGCGTGATGCGTCAAGTGCCCTATTCTGCCCAAAGCTCGG
AAACCGTGGATATGGCGGTGTTTATTAACGGCTTGCCTTTTGCCACGATGGAGCTTAAAAACGAATGGACGGGGCAAAGC
ACTTATCACGCCAAACAACAATATCGCCACAGAGATAACACCCAAGCCCTTTTTCAGCCTGCCCGCACGCTGGTGCATTT
CGCCATAGACAGCCAAGAAGCCTATATGACCACCAAAATTTGTGGCAATAACACTTTCTTTTTGCCGTTTAATCAGGGTA
ACCATCACGGCAAGGGCAATCCGCCGAACCCCAACGGGTTTAACACCGCTTATCTGTGGCAAGAAGTTTTCCAAAAACAG
AGCATTGCAGGGATTATTTTGCATTTTGCCCGCCTAGAATTTGACGATGAACGCAAAAAGGATTTGAGCAAAGCAACGCT
CTATTTTCCTCGTTATCATCAGCTTGATGTGGTGCGGAAGTTGGTGGCTGATGTGGCACAAAACGGCGTGGGTAAACGCT
ATCTCATTCAGCATTCGGCAGGCTCGGGCAAATCCAATTCCATCACGTGGCTAGCGTTTCATTTGATTGAAATTTACGGC
AAGGCAAGGGAAAAGCCGATTTTTGATTCGGTGATTGTAGTTACCGATAGAAAAGTGCTGGATAAGCAGATCAGCGATAA
CATTCGGGCGTTTTCTTCGGTGAAAAACATCATTGCCCACGCAGATCGTGCCACCGATTTGAAAAATGCGATGGAAAACG
GCAAGCGGATTATTATCACCACCATTCAAAAATTCCCGTTCATTGTGGACGGCATTGCCGATATGGCGGATAAAAAATTT
GCGGTGATTATTGATGAGGCGCACAGCTCGCAATCAGGAACGGCACACGATAATATGAACCGAGCGATGGGAGCAGTAGA
AGAAAGCGATGCCCAAGATTTGATTTTATCGGCAATGCAGGCTCGCAAAATGCGTCACAACGCTTCCTATTTTGCCTTTA
CCGCCACGCCCAAAAACAGCACGCTGGAAAAATTCGGCGAAAAACAGACCGCTTGCAATGCAGATGGCAAGCCCATTTTT
AAACCGTTTCATCTCTATTCTATGAAACAGGCGATTGAAGAAGGCTTTATTTTAGATGTGCTGAAAAATTACACCACTTA
CCAAAGTTATTATGAAGTGCAAAAAGCGGTAGAAGATAACCCTGAATTTGATGTGAAAAAAGCCCAACAGCGTTTAAAAG
CTTTTGTGGAAAGAACGCCCGAAAGCATTGCGGTGAAAGCGGATATTATGTTGAGCCATTTTTTAGATAGAGTGGTGAAA
ACCAAACGGCTGAAAGGCAAAGCCAAAGCGATGGTGATTACGCAAAACATTGAAACGGCGATTCGCTACTATCTGGCAAT
TTGCCAATGGCTTGATGAAAAAGGCAATCCGTTTAAAGCGGTAGTGGCGTTTTCAGGCGAAAAAAACGTAGATGGCGTGG
TTTATACCGAAAGCCAACTAAACGGTTTTGATGAGAATAAAACCAAAGCGATGTTTGATACGGACGACTACCGTATTTTA
GTCGTGGCGAATAAATATCTCACCGGCTTTGACCAACCGAAACTGTGTGCGATGTATGTGGATAAACCGCTTGCCGGCGT
GCTTTGCGTGCAGGCGTTATCTCGGCTTAATCGCAGCAGCCCAAAATATGGCAAAAGTGCCGACGATCTGTTTATTTTGG
ACTTCTTCAATAAAACCGAAGATATTCAGGCAGCTTTTGAGCCATTTTATACCGAAACGGAATTAGAGGGCGAAACCGAT
ATTAACATTCTCTACGATCTACAAAATGAACTGGACGAGGCTGGCATCTATGAATTAGACGAAGTCAGCCGTTTTGTAGA
ACGTCTGTTTGCAGGGGCGGATATGGCGGAGCTACAAACGCTCAATCAGGTGTGTGCTAACCGTTTTAATCACGGTTTGG
GCTGGGAACGAGAGCAAAAGGTGGATTTTAAAATCAAAGCCAAACAGTTTGTGAAAGTCTATAACCAAATGGCAAGCATT
ATCGCTTTTGAAAACCGAGAATGGGAAAAGCGTTACTGGTTCTTAAAATTGCTCATTCCGAAATTAAACGTTGCCGATGA
TGAAAAGGTGATTGATGATTTGCTGGAAAAAGTCAATTTAAGCTCTTATGCGTTAGCGATTAGCGAAAAAGAACACGCTA
TTACCCTTTCTGATGAAACGGGCACATTCACGCCAAGCAACAGTACCGTTCACGGCATACACGAAGATGATGAAGAGCGT
GATGAATTGGACAATATCATCAAAACCTTTAACGAACGCTGGTTTGACGGCTGGGGCGATACCCCCGAAGAACGCAGAAT
CCGCTTTGTGGCGGTGGTGGAGAAAATTCAGCAACATAAAGATTTTGAGAGTAAATATCTTGCCATTCAAGACCCTGCGC
TTCGCCAAGTGATTTTCAGCGACATAGTAAAAGAGGTGATGCGAAACCGCCGAGAACAGGAAATGGAAATTTACCGACAG
TTTATGCGAGACGATTCATTTCAAAAATCTTTGATTGCCGATTTGCAAAGAGCTGTCAATATACGCTAA

Protein sequence :
MHHTKEIHFETAIEQDLLNQGFIQASPSGFNARFALDEANFWAFLSESQADKLADFKRLNPNDWQAKILARLDNVWKREG
ILHLFKKGLDVDNVHLDLFFVPPLANSPQRVAELFVQNRFSVMRQVPYSAQSSETVDMAVFINGLPFATMELKNEWTGQS
TYHAKQQYRHRDNTQALFQPARTLVHFAIDSQEAYMTTKICGNNTFFLPFNQGNHHGKGNPPNPNGFNTAYLWQEVFQKQ
SIAGIILHFARLEFDDERKKDLSKATLYFPRYHQLDVVRKLVADVAQNGVGKRYLIQHSAGSGKSNSITWLAFHLIEIYG
KAREKPIFDSVIVVTDRKVLDKQISDNIRAFSSVKNIIAHADRATDLKNAMENGKRIIITTIQKFPFIVDGIADMADKKF
AVIIDEAHSSQSGTAHDNMNRAMGAVEESDAQDLILSAMQARKMRHNASYFAFTATPKNSTLEKFGEKQTACNADGKPIF
KPFHLYSMKQAIEEGFILDVLKNYTTYQSYYEVQKAVEDNPEFDVKKAQQRLKAFVERTPESIAVKADIMLSHFLDRVVK
TKRLKGKAKAMVITQNIETAIRYYLAICQWLDEKGNPFKAVVAFSGEKNVDGVVYTESQLNGFDENKTKAMFDTDDYRIL
VVANKYLTGFDQPKLCAMYVDKPLAGVLCVQALSRLNRSSPKYGKSADDLFILDFFNKTEDIQAAFEPFYTETELEGETD
INILYDLQNELDEAGIYELDEVSRFVERLFAGADMAELQTLNQVCANRFNHGLGWEREQKVDFKIKAKQFVKVYNQMASI
IAFENREWEKRYWFLKLLIPKLNVADDEKVIDDLLEKVNLSSYALAISEKEHAITLSDETGTFTPSNSTVHGIHEDDEER
DELDNIIKTFNERWFDGWGDTPEERRIRFVAVVEKIQQHKDFESKYLAIQDPALRQVIFSDIVKEVMRNRREQEMEIYRQ
FMRDDSFQKSLIADLQRAVNIR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 57
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 57
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 57

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
WQG_21420 YP_007549210.1 Type I restriction-modification system restriction subunit VFG1098 Protein 0.0 57