Gene Information

Name : YPTS_3739 (YPTS_3739)
Accession : YP_001874149.1
Strain : Yersinia pseudotuberculosis PB1/+
Genome accession: NC_010634
Putative virulence/resistance : Virulence
Product : virulence plasmid 65kDa B protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4164347 - 4168798 bp
Length : 4452 bp
Strand : +
Note : PFAM: virulence plasmid 65kDa B protein; FG-GAP repeat protein; KEGG: yps:YPTB3553 insecticidal toxin complex

DNA sequence :
ATGGAAAACAGTAAACAGCAAGTGGCCGTTGCTCCCTTGTCGCTCCCTAAAGGGGGCGGTGCGATTACCGGTATGGGCGA
TAGTTTAGGTCCTATCGGCCCCAGTGGCATGGCAACACTGACGCTGCCCCTGCCGATCTCCGCAGGCCGCGGTTACGCCC
CCTCGCTCACGCTAAGTTACAGTAGCGGCAGTGGTAACGGCCCGTTTGGCCTTGGCTGGCAACTCGGCACCATGGCGATT
CGCCGCCGAACCAACGCCCAAGTGCCACGTTACGATGAGTATGATGAATTTCTGGCTCCCAATGGGGAAGTCATGGTGGT
TGCCGCGGATCCGCAGGGCAATATCGAACGCACTGAACAGTCACTAAATGGGGAACAATTCAGCGTAATTCGTTATTTAC
CACGTATTGAAGGCAATTTTCATCGCATTGAGTATTGGCGACCCCGGGCAAATAACAGCCAGGCACCGTTTTGGTTAGTC
CACAGTTCAGATGGCCAAAAACACGGTTTGGGGTATTCCGCTGCCGCGCGAATTGCCGATCCACTGCACCCTGAGCATAT
TGCTGAATGGCTACTGGAAGAGTCGGTGTCTCTGAGCGGTGAGCATATCTGCTACCAGTATCAGGCAGAAGATGAACAGG
ATATTGATGAATCTGAGAAACAAAATCATCCGGCAGCCAGTGCGCAACGCTATCTAAGCACCGTGGTTTATGGCAATAGA
GAAGTGGCGCATGAACTCTATTGTCTGACTCAGCGACCTGCGCCGACAAGCTGGTTATTTAGCCTGATCTTCGATCACGG
CGAATACAGTAATATTGCGGAGCAGGTGCCGGTAATCATAAAAGGAAAATCTTGGAATTTCCGTCAGGACGCCTTCTCAC
GCTTTAGTTGCGGCTTTGAAGTGCGTACCCGCCGCCTGTGTCAGCAAGTGTTGATGTACCATAATTTGTCAGCGCTTAAA
GGTGATGAACCCGATGCTCAAGCCACGCTGGTTAGCCGTCTGCGTTTACACTATCAGCACGATGCTTATGCCACCCAACT
GGTGGGTTGTCAGCAGTTAGCCCATGAACCCGATGGCACCAAACGTAGCCTGCCACCGCTAGAATTTGATTATCAGGATT
TTTCAACGCGTGACGCCCTTGGTTGGCAACCACTTACTGATTGGGCTGAATTTAACTATCAGTACCAAATGGTGGATCTG
AACGGTGAAGGGATACCGGGGATGCTGTATCAAGATAGCGGCCATTGGTGTTATCGCCCACCGGTTCGTCAGCCTGATAC
CCTTGATGGCATCACCTTCGGTCCTGCTCAATCGTTGCCAACCCTGCCAGCCATGGGGGAAAACGCCACCTTGATGGATA
TCAATGGTGACGGCAAGCTGGATTGGGTGATTAGCCAACCGGGGTTGGCCGGCTACTTTAGCCGTGACCCCGACCAAAGC
TGGACACAATTTACCCCGTTGTCTGCTCTGCCAGCGGAGTATTTCCACCCACAGGCGCAACTGGCCGACTTGATCGGTAG
CGGATTATCTGATCTCGCCCTCATTGGGCCAAAAAGTGTCCGTTTGTATGCCAACCAGCGTGATGGTTTTGCGGCCGCAA
CCCAAGTGACACAAGATGATGACATCACGCTACCCCTATCGGGAGCGCACTACAGTGAACTGGTGGCATTCAGTGATGTA
ATGGGGTCGGGCCAACAGCATTTGGTGCGAGTGCGCCACAATAGCGTGACTTGCTGGCCCAATCTCGGCCATGGCCGTTT
TGGTCACCCTATCTCTTTACCGGGGTTTAACCAGCACATTGAGCAATTTAATCCGCTAGCGATCTATCTGGCGGATATTG
ATGGCTCCGGTACCATTGACCTTATTTATGCCACTGCCAGCCAACTGCTGATTTACCGCAACCAGAGTGGTAACCGCTTT
GCCGAACCAGTGGCTATCGCATTGCCAACAGGCATTCGCTTTGATAATAGCTGCCAACTGTCACTGGCAGATATTCAGGG
GCTGGGCGTGGCCAGCATCATGTTGAGTGTGCCGCACCCCACCACACAACATTGGCGCTATGATTTTGTCGCCAGTAAAC
CCTATTTACTGTGTACCACCAATAACAATATGGGTGCAGAGAACCAGTTGTTTTACCGCAGTTCCGTCCAATTCTGGTTG
GATGAAAAAGCGCAGGCGGCCAAACAGGGCCAATCACTGGCCTGCCAGTTACCCTTCCCGATCCATCTATTAGCGCAAAC
CACACAACTTGATGAGATCACCGGAAACAGCCTGAGTCAAACCGCCCGCTACTTCCATGGTTTTTATGATGGCGTACAAC
GTGAATTCTGTGGTTTTGGCCGTGTCGATACGCTGGATACCGATACCTCGGCACAAGGCAGCGCCGCTGAACGCACTGCA
CCGACCAAAAGTAGGAGCTGGTTCCATACCGGCCGGGCAGACAATGAGACACTATGGCAAAGTGAGTACTGGCAGGGCGA
TGGCCAAGACTACCCATTACTCCCCACCCGTCTGACAACATTTATTAACGATACCCAAGGTGACGACTTACTCAGTGAAC
TCGATGATAATCAAACGTTTTGGTTACACCGGGCGCTGAAAGGTTTATTGCTACGCAGTGAACTCTATGGTCTGGACGAC
AGCGAACTGGCCACACAACCGTATAGCGTCAACAGCTCTCGTTATCAGGTGCGGCAAATTCAACCCTCTGCTGATGGAAT
CTCATCCCCGGTTGCCTTGCCGATAATGCTGGAACAACTCAGTTATCACTACGAACGCATCGTGCAAGACCCGCAATGCA
GCCAACAGATAGTGCTACGTTGCGATGAATTTGGTCACCCATTGCACAGTGCGATAATCAATTATCCACGCCGCGATAAA
TACAGTATTCCCCCGTATTCCTGGCTGGCTAAAGAACATTGGGACAGCAATTTTGACGAGCAACAGCAACAGTTGCGCAT
CACTGAAAGCCAGCAATCTTATCACCATGAGATCAGTGATAAATTCTATGTGCTGGGCCTGCCTGCCGGGCAGCGCAGTG
ATGTACTGACCTATCCCGACAATTTCGTGCCTACCGCGGGGATTCATTGGGAAGAATTACAGCAGCCAGAGGGCCTGCTC
GGTACTAAAGCAGAGCGAACCTTTGCCGGGCAGCAGCAAGTTTTCTACACCTCGGACACCATTCCGGGCCTGGTTGCCTA
TAGTCAACAGGCAGAATTTGACGATCAAACCTTGGCCGCGTTGGATGAATTATTACCAGCGAATGAGCGTAAACAGCAGC
TTATCAAGGCCGGTTATCAAACAGCACCCCGTCTCTTTGCTCGCCCCGGAGAAACGGATATTTGGGTCGCCCAAAGTGGG
TTTACTGATTATGGCGATGCCTCCCGCTTCTATCGCCCCATCAGTCAACGTAGCTCACTCTTGGTAGGAAAAACGGCCTT
AGGGTGGGATAAAAATAGCTGTGCTATCACCCAAATGATATTGGCTGATGGCAGTACCACCCAGGCTGAATATGATTATC
GTTTTATTACCCCTTATCACCTCACGGATATCAACGATAACTCCCGTCATATTGAACTGGATGCGCTGGGGCGCATCACC
TCCAGCCGTTTTTGGGGCACTGAGCTTGACCCACAAACCAATAAAGTCATTGAGACCGGTTTTCCTTCAATCGCGGAACA
GCCCTTTACCCCCCCCAATTCAGTTGATGACGCGATCAGCCTGGAGAATACCCGCATTCCCGTCGCGCAGTTCTCTGTCT
ATCAGCCTCAGAGTTGGATGATTTCATTACAGCGTGATGAGATTGAAATATGGATTAGAGTCAATAAGATCACTCCAGAA
TATCTATTCCAGAATCATATTCTGATCGATAACCATTATCTTTGCCCCCTTGCGCTACACCGCTGGGTAAGACAAAACAA
CCTGCTCATCACTGAAGATGTTGGCCTGGCATTAGAAATGCCCATGCGCCAACCCCCTCATATATTAACCGTCGTCGTGG
ATAACTACTTTTCTGCTGCTGAGCCACAACAACATCAACAAACTCTCGCTTTCAGTGACGGCTTTGGCCGCGTGTTGCAA
AGCGCGCAACGGGTGGAGGCAGGAACCGCTTATTTCCACACAGGAGAGGGCGGTCTGGAGATGGATCAACAAGGTCATCT
CACGCAAGACGAGAGCGATAAACGCTGGGCAGTTTCTGGCCGTACCGAGTACGACAATAAAGGTCTGCCCATCCGCCGCT
ATCAGCCTTATTTCCTTGATGGCTGGCGTTATATCGCGGATGACAGCGCCCGTAAAGAAGCTTGGGCCAACACCCATATT
TACGACCCACTCGGGCGAGAAATAAAAGTGATCACCGCGAAAGGTTACCTGCGACGAACTCACTATTTCCCGTGGTTTGT
TATCAGCGAGGATGAAAATGACACGGCGTCAGAAATCACGCCGAATCCCTAA

Protein sequence :
MENSKQQVAVAPLSLPKGGGAITGMGDSLGPIGPSGMATLTLPLPISAGRGYAPSLTLSYSSGSGNGPFGLGWQLGTMAI
RRRTNAQVPRYDEYDEFLAPNGEVMVVAADPQGNIERTEQSLNGEQFSVIRYLPRIEGNFHRIEYWRPRANNSQAPFWLV
HSSDGQKHGLGYSAAARIADPLHPEHIAEWLLEESVSLSGEHICYQYQAEDEQDIDESEKQNHPAASAQRYLSTVVYGNR
EVAHELYCLTQRPAPTSWLFSLIFDHGEYSNIAEQVPVIIKGKSWNFRQDAFSRFSCGFEVRTRRLCQQVLMYHNLSALK
GDEPDAQATLVSRLRLHYQHDAYATQLVGCQQLAHEPDGTKRSLPPLEFDYQDFSTRDALGWQPLTDWAEFNYQYQMVDL
NGEGIPGMLYQDSGHWCYRPPVRQPDTLDGITFGPAQSLPTLPAMGENATLMDINGDGKLDWVISQPGLAGYFSRDPDQS
WTQFTPLSALPAEYFHPQAQLADLIGSGLSDLALIGPKSVRLYANQRDGFAAATQVTQDDDITLPLSGAHYSELVAFSDV
MGSGQQHLVRVRHNSVTCWPNLGHGRFGHPISLPGFNQHIEQFNPLAIYLADIDGSGTIDLIYATASQLLIYRNQSGNRF
AEPVAIALPTGIRFDNSCQLSLADIQGLGVASIMLSVPHPTTQHWRYDFVASKPYLLCTTNNNMGAENQLFYRSSVQFWL
DEKAQAAKQGQSLACQLPFPIHLLAQTTQLDEITGNSLSQTARYFHGFYDGVQREFCGFGRVDTLDTDTSAQGSAAERTA
PTKSRSWFHTGRADNETLWQSEYWQGDGQDYPLLPTRLTTFINDTQGDDLLSELDDNQTFWLHRALKGLLLRSELYGLDD
SELATQPYSVNSSRYQVRQIQPSADGISSPVALPIMLEQLSYHYERIVQDPQCSQQIVLRCDEFGHPLHSAIINYPRRDK
YSIPPYSWLAKEHWDSNFDEQQQQLRITESQQSYHHEISDKFYVLGLPAGQRSDVLTYPDNFVPTAGIHWEELQQPEGLL
GTKAERTFAGQQQVFYTSDTIPGLVAYSQQAEFDDQTLAALDELLPANERKQQLIKAGYQTAPRLFARPGETDIWVAQSG
FTDYGDASRFYRPISQRSSLLVGKTALGWDKNSCAITQMILADGSTTQAEYDYRFITPYHLTDINDNSRHIELDALGRIT
SSRFWGTELDPQTNKVIETGFPSIAEQPFTPPNSVDDAISLENTRIPVAQFSVYQPQSWMISLQRDEIEIWIRVNKITPE
YLFQNHILIDNHYLCPLALHRWVRQNNLLITEDVGLALEMPMRQPPHILTVVVDNYFSAAEPQQHQQTLAFSDGFGRVLQ
SAQRVEAGTAYFHTGEGGLEMDQQGHLTQDESDKRWAVSGRTEYDNKGLPIRRYQPYFLDGWRYIADDSARKEAWANTHI
YDPLGREIKVITAKGYLRRTHYFPWFVISEDENDTASEITPNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 60
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 51
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 51