Gene Information

Name : Rahaq2_2753 (Rahaq2_2753)
Accession : YP_005200728.1
Strain : Rahnella aquatilis CIP 78.65
Genome accession: NC_016818
Putative virulence/resistance : Virulence
Product : virulence plasmid B protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2954497 - 2958891 bp
Length : 4395 bp
Strand : -
Note : PFAM: Insecticide toxin TcdB middle/C-terminal region; Insecticide toxin TcdB middle/N-terminal region; Salmonella virulence plasmid 65kDa B protein

DNA sequence :
ATGCAACAAACAGAGAAGTCTTCAACGTTACCCGTAAATACACTCTCCCTGCCGAAAGGCGGGGGAGCCATTCAGGGCAT
GGGCGAAGCACTCGGGAATATCGGCCCCGGCGGTATGGCCGGTATGACTGTGCCGTTGCCCATTTCTGCCGGGCGGGGCT
ATGCGCCTGCGCTGGCGCTGAGTTACAGCAGCGGGGCAGGAAACGGCGAATTCGGTTTGGGGTGGGGATGCGCGGCGGCA
AGCATTCAACGACGGACCAGCCGGGGTGTGCCTCATTATACTGATGAGGACGCGTTTATCGGGGCGAGCGGGGAAGTGCT
TATCCCAAAATACGGCGATGACGGGCAAATAGTCAGCCGTAACGTCTCACGTTACGGACAGGTGACGCTGGATCAAACCT
GGCGCGTCACTCCATACCTTTCGCAAGTCACCGCCGGGAGCGATCTTATTGAGCGCTGGCAGGGTACGCAGGGCGGCGAT
TTCTGGCTGATTCATACCGCCGACGGCCAGCTTCATTGTCTGGGCAAATCGCCAGCGGCACAGCTTACCGACCCCGAAGA
CGCCAGCCGCATTGCGGGCTGGATGTTGCAGGAGTCGGTCAGTCCCAACGGTGAACATATTTGCTATCGTTATAAAAAAG
AAGATGATGCCGGTGTCAGCCTGACCGGGAATGAACAGCAACGCGATCACCGCACGTTTGTGTATCTGACGCAGGTTGAT
TATGGCAACCAAACGGCGCAGGAAGCGCTGTATGCCTGGACGGAGACGATGCCAAATGACAGCCAGTGGCTGTTTAGCCT
GGTGTTCGATTATGGCGAGAGGAGTCTGGATCCGAATGCGCCTGTCGCCTGTGCGCCTGCGACTGACTGGCTGGCGAGAC
CGGATCCTTTTTCACGCTATGACGTGGGCTTTGAGGTTCGCTGTCATCGTCTTTGTCGTCAGGTTCTGATGTTTCATCGT
TTTCCGTCTGAACTGCACGCGGCGGAAACATTAGTGCTCCGGCTTCTGCTGGAGTACCAAAGTTCTGCCACCCTAAGCCA
GATGATCAGCGCGCAAATTCTGGCCTATGAAACCGATGGCACGGTGCAAAGCCGTCCGCCGCTTGATCTGAGTTACAGCC
CGTTCTCTGTCAGTCCCTCATCTGAACAGTGGCAGCCTTTTCCTGCGCTGGCGGGCCTTGACGACGGGCAGCAATATCAG
CTGGTGGATTTATACGGCGAAGGCATTCCGGGCGTCCTGTTTCGCCAGTCCGGCGGCTGGTATTACCGTGCACCGGTACG
AGGCAGCGAAGGTGAGAACAGCGTCACTTACGGTGACTGGCAGCTTTTACCTCGGGTGCCTGCCATGCAGTCTTCCTCTA
TGGCGTTGATGGATGTTAACGGAGACGGACGGCTGGACTGGCTGGTGACTGCGCCTGGCATCAACGGTTTCTTTACCCTC
AACCCGTCGGGTGAATGGTCGTCCTTTACGCCGTTTTCCGCCTTTCCGACAGAATTCACCCATCCTTCAGCACAGTGGGC
CGATCTGTCCGGCTCGGGGCTGACGGACATCGCGTTAATCGGGCCGAAAAGCGTGCGTCTGTATGCCAATAACCGCGAGG
GTTTTACCCGTCCGGTGCAGATCGAGCAGCAATACACTTTGCCGGTTTTTGCCCGGGATGCCACTGAGCTGGTGGGGTTC
AGCGACATGCTGGGATCAGGCCAGCAACATCTGATCCGCATCCGTTATGACGGCATCACCGTCTGGCCGAATCTGGGGCA
TGGCAAATTTGGTGAACCTTTTGTTCTCAGTACGCTGAGCTTTGATCGCCAGAGTTTTGACCCCGGCCGCATTATTCTGG
CCGATCTGGACGGATCGGGTGGCGCTGATTTGTTGTATATGCACAGTGATTACATCGAGGTTTTTGCTAACCTGTCGGGT
AACGGCCTGGCCGCCCCGATGCAGCTTGCGCTGCCAGCCGGTATCCGTTACGACAATCTGTGCCAGTTAAATGTCGCCGA
TTCGCAGGGCCTGGGCGTAGCCAGTCTGATCCTGTCGGTGCCCTACATGACGCCCCGGCACTGGCATTGTGATTTAACCA
GCATAAAACCTTATCTTCTGGAGGCGGTGAATAACAATCAGGGCAATCACAACCAAATTATTTACCGCAGTTCGGCGCAA
GAGTGGCTGGATGAAAAACAGCAGTTTCCCGGCGCTGTTTCAGCACTGCCTTTCCCGATCCATATCGTTTCGCAAACGGT
CAACACCGACGAGATCAGCGGCAGTGTACTGACGCAACGCAGCCGATACCGTAAAGGGGTGTATGACGGCGTTATGCGCG
AATTTTATGGCTTTGGTTATCTTGAACAAACCGATACTACCGGTGGTGTTCTAACGAGTGAAGATACAGAGCCGCTGATC
AGTAAAAGCTGGTTTCATACCGGACGCGAAGAAGATGAAAGCGACCTCTACGGCACACCCTATGTGGGGCGTTTTACTGT
CACGGTGAACCCGACCTGGTTGAGTCAGTTCGATGCGTCCGGTGGCAAAGACATTGCACTGACGGATGTGAGTCACGAAA
ACCGCGCCATGCTCTATCGTGCGCTGAAAGGATCATTACTTGGCAGTGAGGTGTATGGGGCAGATAACCCGATCCCTTAT
ACCCAGACCCGACAACGTTATCAGGTGCGGCTGGTGCAACCCGCCAAAGGTAAAAACGCCAGCGTAGCGATGCCGGGCTT
GCTTGAGTCGGTCAGCTATAACTGGGAGCAGATCGCCACAGATCCGCTGATTGCCCAGCAAGTGGTGATGCAAAAAGACT
GCTTTGGTGCCACGGTGTGGCAGGTGAATATAAACTATCCCCGTCTGACCAAACCCGGGGTTGATCCGTATCCGGCCATA
TTACCGGATACGGCATGGGCGAGTTCGTATGACAGTCAGCAGCAGGTGCTGAGAATAACCGAGGGGCGGGCATCTTATAT
TGATTTAACTGACGCTCAGGCATGGCGTCCGGGGATCAGTGACTGTCAGCGTACCAATATCCTGACCTATGAAAATTACG
CGTTACCGGCCAGCGGCATTACGTTTGAAATGTTGTCTGAACCAGACGGGTTACTCGGCAATAGTCGCCCGCGTATTTAT
GCAGGTCAAAGTCAGATTTTTTATCAACCTGCCGTGCCGGATTTAGGCGCAAGGGTTGATCATGTCGAAACGGCTGAACT
GGATGAGGCCAGTCTGGCGGCCTATGACGGCATACTGAGCCGTGACGAACTTATTCCTCTGCTGGCGCAGGCAGGATATG
TGCACGTACCGGCGTTGTTGCCAGTGCCGGGTGCCAGTGATGAACCGGTTTATTGTATTGCCAGTGCTTATACGACTTAT
CTGCCCGCCAGCCGTTTTTATTTGCCTGAGACACAGCGGCAAACCATGCTAACCGGCGCGACCACACTCACTTATGACGA
TTACGCCTGTTGTGTCATCAGCTCGCGTGATGCTGCCGGGAATACCACCTCCGCCGCCTATGACTATCGCTTCCTCAAGC
CCTGTAAGATCATTGATATCAACGATAATACCGCTGAAATTCAACTGGACGCACTCGGCGGGCAGATGGGAAGCAGTTTC
TACGGAACGGAGCAGGGCGTGATGACCGGATTTGATTCGGTGACGGAATACCCTGTTCCGGTAACACTGACGGTTGAACA
GGCCATTAATCAGGCCGAAGGCAGCCCGCAGCATCTGGCAACCGTTATTGTTGATGACCCCTTCAGCTGGATGGGGCAGG
TTACACAACAACAGTGTGGTAGTGGCTGGGATACTCTTCTGGAACAGCGGTTTATTACCTTTGAGGGCTATATTCGTGCT
GCCGGGCGGGACTGGGCAAACTCGACGCGGGAGATTAGCGGCCTTGATGCTTCTGTTCGTGGGCTTCTTGCTGATGCGGT
GACGACACCTGTTCATGTTGCCACCCTGACGGCAGACCGCTATCCCGATGATGATCAACAACAGGTCCGCATCAGCGTGG
CTTTTTTTGATGGCTTTGGCCGGACATTGCAACAAAGCGGCAAGGTTGCCCCGGGCGATGCGTGGGTGCGCGATACCGCC
GGAGAACTGGTGATCGCCACGGGAGGCGAGCCGGTGACAACACCGGCTGACCCGCGCTGGGCAGTCAGTGGCCGGGTGGA
ATACAACAATAAAGGGCTGCCGGTGCGGGCATACCAGCCTTATTTCATTAACGACTGGCAATACGTTGTTGATTCCGCGC
TGCGCACCGCAGGTTATGCGGATACCCATTTTTATGATGCGCCGGGACGGGAAATTAAAGTGGTCAGAGCGGGGTCAGGT
TACCAGACCCGTCAGCAATATTTCCCGTGGTTTACAGTGAGTGAGGATGAGAACGATACGCAGGAGGCATTATGA

Protein sequence :
MQQTEKSSTLPVNTLSLPKGGGAIQGMGEALGNIGPGGMAGMTVPLPISAGRGYAPALALSYSSGAGNGEFGLGWGCAAA
SIQRRTSRGVPHYTDEDAFIGASGEVLIPKYGDDGQIVSRNVSRYGQVTLDQTWRVTPYLSQVTAGSDLIERWQGTQGGD
FWLIHTADGQLHCLGKSPAAQLTDPEDASRIAGWMLQESVSPNGEHICYRYKKEDDAGVSLTGNEQQRDHRTFVYLTQVD
YGNQTAQEALYAWTETMPNDSQWLFSLVFDYGERSLDPNAPVACAPATDWLARPDPFSRYDVGFEVRCHRLCRQVLMFHR
FPSELHAAETLVLRLLLEYQSSATLSQMISAQILAYETDGTVQSRPPLDLSYSPFSVSPSSEQWQPFPALAGLDDGQQYQ
LVDLYGEGIPGVLFRQSGGWYYRAPVRGSEGENSVTYGDWQLLPRVPAMQSSSMALMDVNGDGRLDWLVTAPGINGFFTL
NPSGEWSSFTPFSAFPTEFTHPSAQWADLSGSGLTDIALIGPKSVRLYANNREGFTRPVQIEQQYTLPVFARDATELVGF
SDMLGSGQQHLIRIRYDGITVWPNLGHGKFGEPFVLSTLSFDRQSFDPGRIILADLDGSGGADLLYMHSDYIEVFANLSG
NGLAAPMQLALPAGIRYDNLCQLNVADSQGLGVASLILSVPYMTPRHWHCDLTSIKPYLLEAVNNNQGNHNQIIYRSSAQ
EWLDEKQQFPGAVSALPFPIHIVSQTVNTDEISGSVLTQRSRYRKGVYDGVMREFYGFGYLEQTDTTGGVLTSEDTEPLI
SKSWFHTGREEDESDLYGTPYVGRFTVTVNPTWLSQFDASGGKDIALTDVSHENRAMLYRALKGSLLGSEVYGADNPIPY
TQTRQRYQVRLVQPAKGKNASVAMPGLLESVSYNWEQIATDPLIAQQVVMQKDCFGATVWQVNINYPRLTKPGVDPYPAI
LPDTAWASSYDSQQQVLRITEGRASYIDLTDAQAWRPGISDCQRTNILTYENYALPASGITFEMLSEPDGLLGNSRPRIY
AGQSQIFYQPAVPDLGARVDHVETAELDEASLAAYDGILSRDELIPLLAQAGYVHVPALLPVPGASDEPVYCIASAYTTY
LPASRFYLPETQRQTMLTGATTLTYDDYACCVISSRDAAGNTTSAAYDYRFLKPCKIIDINDNTAEIQLDALGGQMGSSF
YGTEQGVMTGFDSVTEYPVPVTLTVEQAINQAEGSPQHLATVIVDDPFSWMGQVTQQQCGSGWDTLLEQRFITFEGYIRA
AGRDWANSTREISGLDASVRGLLADAVTTPVHVATLTADRYPDDDQQQVRISVAFFDGFGRTLQQSGKVAPGDAWVRDTA
GELVIATGGEPVTTPADPRWAVSGRVEYNNKGLPVRAYQPYFINDWQYVVDSALRTAGYADTHFYDAPGREIKVVRAGSG
YQTRQQYFPWFTVSEDENDTQEAL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 45
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 44
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 42