Gene Information

Name : join (sll8049)
Accession : NP_942394.1
Strain :
Genome accession: NC_005231
Putative virulence/resistance : Virulence
Product : type I site-specific deoxyribonuclease chain R
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 43288 - 1914 bp
Length : 2970 bp
Strand : -
Note : ORF_ID:sll8049

DNA sequence :
ATGAAAACCACTGACACCAGCGAGAAGGGCCTAGAAACCATAATTGTTAACTCCTTGGTCAATGAGGCAGGCTATGTTCT
GGGCGACCCTAAAGACTACGACAGGGAACATGCCGTTGACCTGGTGAAGCTGTTGGAATTTCTAGAGACAACCCAGCCCG
ATACCTATGGGGTACTTGGCATCAACAGAGAAGGCCCCAAGCGTACCCAGTTCTTGCACCGGCTACAGGGGGAGATTGCC
AAACGGGGTGTAGTGGACGCATTGCAGACTGGTATCAGGCATGGCCCTGCCCACGTAGAGCTTTTCTATGGCACCCCTAC
CCCTGGTAACGTTAAAGCAGTGGAACGGTTCAGTAACAACATTTTCAGTGTCACCCGCCAACTCCGCTACAGCTCAGTGG
AGACGGCTCTGTCATTGGACATGGCCGTGTTCATCAACGGATTACCCATTGCCACCTTCGAGCTAAAGAATCGGCTCACC
AAGCAGACTGTACTGGACGCAGTAGAGCAATATCAAAGGGATCGAGACCCCAAGGAACTGCTATTGCAGTTTGGTCGCTG
TGCAGTGCATTTTGCAGTGGATGACCATGAGGTACGCTTCTGTACCCACCTGACTGGTAAGGGTTCTTGGTTTCTACCCT
TCAACAAAGGTCACAACGACGGGGCAGGTAATCCTCCCAATCCTAACGGCATTGCCACTGACTACCTGTGGAAAGAAGTC
CTTACCAAAGAGGGGCTGACCGACATCCTAGAAAACTACGCTCAGATAGTGGAGGAGAAGGACGAAAAGACCTGCAGGAA
GAAGCTTAAGCAGGTCTTCCCTCGCTATCACCAATTGACAGTGGTGCGTCAGCTTCTGGCTACGGCTAAGCAAGATGGAG
TGGGGAAACGTTACCTGATCCAACATTCAGCCGGTAGCGGTAAGAGTAACTCTATTGCCTGGCTTGCTCATCAACTAGTG
GGACTAGAGCAGGATGGCAAAGCCCTATTCGACTCCATCATCGTTGTGACCGATCGCCAGGTACTGGACCGGCAGATTCG
CAACACCATCAAACAGTTTGCCCAAGTCTCCGCCACGGTGGGCCATGCAGAACGCTCCGGCGACCTACGCCAATTTATCA
AAGACGGCAAGAAAGTCATCATCACCACGGTGCAGAAGTTTCCCTTCATCCTCAACGAAATCGGGGACGAACATCGCCAG
AGTAAATTCGCCATTGTCATTGACGAAGCCCACTCCAGCCAAGGAGGCAAAACCACTGCCGCCATGAACCGGGTGCTGGA
AGGAACAGCCTCCTACAACGTTGCCAACGAAGAGGAAGAAGAAACTACCGAGGACAAGATCAACCGGATCTTGGAGGGAC
GGAAAATGGTTACCAACGCCAGCTACTTCGCCTTCACTGCCACTCCTAAAAATAAGACCCTGGAGATATTCGGTCAGCCC
GCCCCCCAGCCCGATGGCACTGTCAAACATTACCCCTTCCATAGCTACACCATGAAGCAGGCTATCCAGGAGGGCTTTAT
CCTCGATGTGCTGAAAAGTTACACCCCCGTGGAGAGCTATTACCGCCTAGCCAAGACGGTGGAGGATGACCCCCTCTTCG
ATGCTAAGAAAGCCCAAAAGAAACTGCGCCGCTATGTGGAATCCCACCAACATGCCATCCGGGAGAAGACCGAGATCATG
GTGGACCACTTCCACACCCAAGTAATCAACCACGGCAAGATTGGTGGGCAGGCTCGGGCGATGGTCATCACCAACGGTAT
TGGGCAAGCTATCCAGTACTTCTACGCCTTCAAAGACTACCTGCGGGAACGGAAAAGCCCCTACCAGGCGATCGTCGCCT
TTTCTGGAGAGTATGACTACGGAGGGCAGAAGGTCACAGAAGCCACCCTCAACGGCTTCCCTAGTAGTCAGATCACCGAC
AAGATAGAGGAAGACCCTTACCGCTTTCTCATCGTCGCCGACAAGTACCAGACCGGTTACGACCAGCCCCTACTACACAC
CATGTATGTGGATAAGGCCCTCTCCGGTATCAAAGCGGTGCAGACACTCTCTCGACTTAACCGTGCCCATCCCCAGAAGT
ACGACACCTTCGTGCTGGATTTCTACAACGATTCAGGCACTATCCAATCGTCCTTTGACCCCTATTACCGCACTACCATC
CTCAGTAATGAGACCGACCCCAACAAGCTCCATGACTTGAAGGCAGATCTGGACAGCCACCAGGTCTATACACAGGAGCA
AATCCAGTATCTAGTGGAGTTGTACCTGAACGGTGCAGATCGGGATAGGCTTGACCCAGTTTTGGATGCCTGCGTAGCCG
CCTACAACGCAGATCTCGATGAAGATGGACAAGTGGACTTCAAGGGTAAGGCGAAGGCCTTTGTCCGCACCTACGGCTTC
CTTGCCTCAATCCTGCCCTACTCCAATGCCGATTGGGAAAAACTATCCATCTTCCTCAACTTCCTCATTCCCAAGCTCCC
TGCCCCCAAAGAAGAGGATCTATCCCGGGGCATCCTAGAGGCGATCGACATGGATAGCTACCGTGTTGAACTAAAAACTA
GTCTGAAGATCGACCTAGCAGATGAGGATGCCGAGATTAAACCTGTGCCAACCGCCGGCGGGGGCAGTAAACCGGAGGCA
GACCTAGACCAGTTAAGTAACATCATCAAGGCTTTCAATGATCAGTTCGGAAACATTAATTGGAAAGACAATGACAAGAT
TCGTCGGGTCATCGCTGAGGAGATTCCAGCCAAGGTAGCGGCGGACGTAGCCTATCAGAATGCCATGAGGAACAACGACA
AGAGGACTGCCAGAATCGAACATGATGCTGCCCTACAGCGGGTGATGATTGAATTGTTGGCTGACCACACAGAGTTGTTC
AAGCAGTTTAGTGATAATCAGTCCTTCAAGAAATGGCTGGGCGACACCATCTTTGGTGCAACCTACCGGGAAAACTGGGA
AGCAGGTTGA

Protein sequence :
MKTTDTSEKGLETIIVNSLVNEAGYVLGDPKDYDREHAVDLVKLLEFLETTQPDTYGVLGINREGPKRTQFLHRLQGEIA
KRGVVDALQTGIRHGPAHVELFYGTPTPGNVKAVERFSNNIFSVTRQLRYSSVETALSLDMAVFINGLPIATFELKNRLT
KQTVLDAVEQYQRDRDPKELLLQFGRCAVHFAVDDHEVRFCTHLTGKGSWFLPFNKGHNDGAGNPPNPNGIATDYLWKEV
LTKEGLTDILENYAQIVEEKDEKTCRKKLKQVFPRYHQLTVVRQLLATAKQDGVGKRYLIQHSAGSGKSNSIAWLAHQLV
GLEQDGKALFDSIIVVTDRQVLDRQIRNTIKQFAQVSATVGHAERSGDLRQFIKDGKKVIITTVQKFPFILNEIGDEHRQ
SKFAIVIDEAHSSQGGKTTAAMNRVLEGTASYNVANEEEEETTEDKINRILEGRKMVTNASYFAFTATPKNKTLEIFGQP
APQPDGTVKHYPFHSYTMKQAIQEGFILDVLKSYTPVESYYRLAKTVEDDPLFDAKKAQKKLRRYVESHQHAIREKTEIM
VDHFHTQVINHGKIGGQARAMVITNGIGQAIQYFYAFKDYLRERKSPYQAIVAFSGEYDYGGQKVTEATLNGFPSSQITD
KIEEDPYRFLIVADKYQTGYDQPLLHTMYVDKALSGIKAVQTLSRLNRAHPQKYDTFVLDFYNDSGTIQSSFDPYYRTTI
LSNETDPNKLHDLKADLDSHQVYTQEQIQYLVELYLNGADRDRLDPVLDACVAAYNADLDEDGQVDFKGKAKAFVRTYGF
LASILPYSNADWEKLSIFLNFLIPKLPAPKEEDLSRGILEAIDMDSYRVELKTSLKIDLADEDAEIKPVPTAGGGSKPEA
DLDQLSNIIKAFNDQFGNINWKDNDKIRRVIAEEIPAKVAADVAYQNAMRNNDKRTARIEHDAALQRVMIELLADHTELF
KQFSDNQSFKKWLGDTIFGATYRENWEAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-169 42
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-169 42
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 5e-169 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
join NP_942394.1 type I site-specific deoxyribonuclease chain R VFG1098 Protein 4e-170 42