Gene Information

Name : YpsIP31758_0414 (YpsIP31758_0414)
Accession : YP_001399407.1
Strain : Yersinia pseudotuberculosis IP 31758
Genome accession: NC_009708
Putative virulence/resistance : Virulence
Product : insecticidal toxin complex protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 483253 - 487770 bp
Length : 4518 bp
Strand : -
Note : identified by similarity to GB:AAG09643.1; match to protein family HMM PF01839; match to protein family HMM PF03534

DNA sequence :
ATGGAAAACAGTAAACAGCAAGTGGCCGTTGCCCCCTTGTCGCTCCCTAAAGGGGGCGGTGCGATTACCGGTATGGGCGA
TAGTTTAGGCGCTATCGGCCCCAGTGGCATGGCAACACTGACGCTACCCCTGCCTATCTCCGCAGGCCGCGGTTATGCCC
CGCCACTGGCACTCAATTACAGTAGCGGCAATGGCAACGGCCCGTTTGGCCTTGGCTGGCAACTTAACACCATGGCCATC
TGCCGCCGAACCAGCAAGCGAGTGCCTCACTACGATGAACATGATGAATTTCTGGCACCGAGTGGTGAAGTATTAGTCGT
TGCCATTGATCAACAGGGAAATATCGAACGCACTGAGCAATCATTAAAAGGGGAGCAATTCAGCGTAATTCGTTATCTGC
CGCGTATTGAAGGCAGCTTTAATCGCATTGAGTATTGGCAACCCAGGGTAGATAACAGCCAGGCACCATTTTGGGTAGTC
CACGGTTCAGATGGTCAAAAACACTGTTTGGGGTATTCCGCCTCAGCCCGAATTGCCGATCCACAGCACCCTGAGCATAT
TGCTGAATGGCTGCTGGAAGAGTCGGTGTCTCTGAGCGGCGAGCATATCTGCTACCTGTATCAGGCAGAAGATGAACAAG
GTATTGACGAGGATGAAAAACAAAATCATCCCGCAGCCAGTGCGCAACGCTATCTAAGCACCGTGGTTTATGGCAATAGA
GAAGTGGCTCATGAACTCTATTGTCTGACTCAGCAACCCACGGAGAAAAGCTGGTTATTTAGCCTGATTTTCGATCATGG
CGAATACAGTAATATCGCCGATCAGGTTCCGATAGCCGAAGAGGGCAAATCCTGGACTTACCGTCAGGATGCCTTCTCTC
ACTTTAACTATGGCTTTGAAATCCGTACCCGCCGCCTGTGTCAGCAAGTGTTGATGTACCATAATTTGTCAGCGCTGGCC
GGTAATGAACCAGACCAACAACCGACATTGGTCAGCCGCTTACGGTTGAACTATCAGCACGATGTTTATGCCACGCAACT
GGTGGGTTGCCAACGTTTAGCCCACGAGCCAAAGGGTAAAACATGCAGCTTGCCACCATTAGAATTTGATTATCAGACAT
TCCCGGACAATAACGAGAAACCCTATTGGCAGCAACGATTAGAGGAAATCAATCCGACAGAATCGGATGATCAGGATTTA
TCCGCCAATTGGCAGCCATTAGAGGATTGGGCAGAATTTAACCACCAGTACCAAATGGTGGATCTGAACGGCGAAGGGAT
ACCGGGGATGCTATATCAGGACAGAGGCCATTGGTGTTATCGCCCACCGGTTCGTCAGCCTGGCACCCTTAATGGCATCA
CCTTCGGTCCTGCCCAATCGTTGCCAACCCTGCCAGCCATGGGGGAAAACGCCACCTTGATGGATATCAATGGTGACGGC
AAGCTGGATTGGGTGATTAGCCAACCGGGGTTGGCAGGCTACTTTAGCCGTGACCCCGACCAAAGCTGGACACAATTTAC
CCCGTTGTCTGCTCTGCCAGCGGAGTATTTCCACCCACAGGCGCAACTGGCCGACTTGATCGGTAGCGGATTATCTGATC
TCGCCCTCATTGGGCCAAAAAGTGTCCGTTTGTATGCCAACCAGCGTGATGGTTTTGCTGCCGCAACCCAAGTGACACAA
GATGATGACATCACGCTACCCCTATCGGGAGCGCACTACAGTGAACTGGTGGCATTCAGTGATGTAATGGGTTCGGGCCA
ACAGCATTTGGTGCGAGTGCGCCACAATAGCGTGACTTGCTGGCCCAATCTTGGCCACGGGCGTTTTGGTCACCCCATCT
CCTTACCGGGATTTAGCCAGCCTATTGAGCAGTTTAATCCGCTGGCGATCCATCTGGCGGATATTGATGGCTCCGGCACC
GTTGACCTGATCTATGCCACTGCACATCAATTACTGATTTACCGTAACCAGAGTGGTAACCGTTTTGCCAAACCGCTTGA
AGTCACACTGCCTGCGGGTGTTCGCTTTGATAATAGCTGCCAATTATCACTGGCAGATATTCAAGGGCTGGGTGTTGCCA
GCATCATCCTGAGTGTTCCACACCCGGCCCCACAGCACTGGCGCTATGATTTTGTCGCCAGTAAACCCTATTTACTGTGT
ACCACCAATAACAATATGGGTGCAGAGAGTCAGTTGTTTTACCGCAGTTCCGTCCAATTCTGGTTGGATGAAAAAGCGCA
GGCGGCCAAACAGGGCCGAACACTGGTCTGCCAGTTACCCTTCCCGATCCATCTATTAGCGCAAACTACGCAATTTGATG
AGATCACCGGAAACAGCCTGAGTCAAACCGCCCGCTATTTCCATGGTTTTTATGATGGCATACAACGTGAATTCTGTGGT
TTTGGCCGTGTTGATACGCTGGATACCAATACCTCGGCACAAGGCAGCGCCGCTGAACGCACCTCCCCAACCCAAACCTG
CCAGTGGTTCCATACCGGGCAGCCTGGCAATGAACAACTTTGGCACCATGAATATTGGCAGGGCGACAGCCAGGCCTACG
GGCTGCTATCCACCCGCCTGACGAAATTTACCGGAGAAATGCAAGGTGATGAAACCTTAAATGACATCAATGATAATCAA
GCTTATTGGTTGCATAGGGCGCTGAAAGGTTCATTGTTGCGCAGCGAACTCTATGGTCTGGACGGCAGCGAACTGGCCAC
ACAACCCTATAGCGTCAACAGTGCTCGCTATCAGGTGCGGCAAATTCAATCCTCTACGGATGAAATAACCTCCCCGGTAG
CATTACCGATGATGCTGGAACAACTCAACTATCACTACGAACGCATAGTGCAAGACCCGCAATGCAACCAACAGATTGTG
CTACGTTGCGATGAATTTGGTCACCCATTGCACAGTGCGACAATCTATTATCCACGCCGCGATAAAGCCAGTATTCCTCC
GTATTCCTGGCTGGCTGAAGGACATTGGGACAGCCATTTTGACGAGCAACAGCAGCAGTTGCGCATCACTGAAAGCCAGC
AATCTTATCGCCATGAGATCAGTGATAAATTCTATGTGCTAGGCCTGCCGGCCGGGCAGCGCAGTGATGTACTGACCTAT
CCAGACAATTTCGTGCCTACAGCGGGGATTCATTGGGAAGAATTACAACAACCAGAGGGCCTGCTCGGTACTAAAGCAGA
GCGAACCTTTGCTGGGCAGCAGCAGGTTTTCTACGCCTCGGACACCATTCCGGGCCTGGTTGCCTATAGCCAACAGGCGG
AATTTGACGATCAAACCTTGGCCGCGTTGGATGAATTATTACCAACGAATGAGCGTAAACAACAGCTTATCGAGGCAGGT
TATCAACGAGCACCCCGTCTATTTTCTCGCCCCGGAGAAACAGATATTTGGGCCGCCCAACGTGGGTTTACTGACTATGG
CGATGCATCTCGTTTCTACCGCCCCATCAGTCAACGTAGCTCACCATTGGTAGGAAAAACAGCCTTAGAATGGGATAAAA
ATAGCTGTGCCATTACCCAAATGATATTGGCTGATGGCAGTACCACCCAGGCTGAATATGATTATCGTTTTATCACCCCT
TATCACCTCACGGATATCAACGATAACTCCCGTCATATTGAACTGGATGCGCTGGGGCGCGTCACCTCCAGCCGTTTTTG
GGGCACCGAGCTTGACTCACAAACCGGTGAGGTCAGCACAACCGGTTTCCCTTTAATCGCTGAGCACCCCTTTACCGTAC
CCAATTCAGTTGATGCCGCTATCAGCATGGAGAATACCCAAGTTCCCGTCGCGCAATTCTCTGTCTATCAACCTCAGAGC
TGGATGATTTCGTTACAGCTTGATGACATTGAAACATTGTCTGAAACCAATAACGTCACTCTAGAATATCTATTCCAGAA
TCATATTCTGATCGATAACTATTATCTTTGCCCCCTTGCTCTACGTCGCTGGATAAGACAAAGCAACCCTCTCATCACCG
AAAATGTCGGCCTGGAATTGAAAAATCCCGTGCGCCAACCCCCCCATGTCTTAACCGTCGTCGTGGATAACTACTTTTCT
GCTGCTGAGCCACAACAACATCAACAAACTCTCGCTTTCAGCGACGGCTTTGGCCGCGTGTTGCAAAGCGCGCAACGGGT
AGAGGCAGGAACCGCTTATTTCCACACAGGAGAGGGCGGACTGATAGCTGATCAGCAAGGTCATCTCATGCAAGACGAGA
GCGATCAACGCTGGGCCGTTTCTGGCCGTACCGAGTACGACAATAAAGGTCTGCCCATCCGCCGCTATCAACCCTATTTC
CTCGATGACTGGCGTTATATCGCTGATGACAACGCCCGCAAAGAGGCCTGGGCCGACACCCATATTTACGACCCACTCGG
ACGAGAAATAAAAGTGATCACCGCCAAAGGTTATCTGCGACGAGCACACTATTTCCCGTGGTTTGTTATCAGCGAGGATG
AAAATGACACGGCGTCAGAAATCACGCCGAATCCCTAA

Protein sequence :
MENSKQQVAVAPLSLPKGGGAITGMGDSLGAIGPSGMATLTLPLPISAGRGYAPPLALNYSSGNGNGPFGLGWQLNTMAI
CRRTSKRVPHYDEHDEFLAPSGEVLVVAIDQQGNIERTEQSLKGEQFSVIRYLPRIEGSFNRIEYWQPRVDNSQAPFWVV
HGSDGQKHCLGYSASARIADPQHPEHIAEWLLEESVSLSGEHICYLYQAEDEQGIDEDEKQNHPAASAQRYLSTVVYGNR
EVAHELYCLTQQPTEKSWLFSLIFDHGEYSNIADQVPIAEEGKSWTYRQDAFSHFNYGFEIRTRRLCQQVLMYHNLSALA
GNEPDQQPTLVSRLRLNYQHDVYATQLVGCQRLAHEPKGKTCSLPPLEFDYQTFPDNNEKPYWQQRLEEINPTESDDQDL
SANWQPLEDWAEFNHQYQMVDLNGEGIPGMLYQDRGHWCYRPPVRQPGTLNGITFGPAQSLPTLPAMGENATLMDINGDG
KLDWVISQPGLAGYFSRDPDQSWTQFTPLSALPAEYFHPQAQLADLIGSGLSDLALIGPKSVRLYANQRDGFAAATQVTQ
DDDITLPLSGAHYSELVAFSDVMGSGQQHLVRVRHNSVTCWPNLGHGRFGHPISLPGFSQPIEQFNPLAIHLADIDGSGT
VDLIYATAHQLLIYRNQSGNRFAKPLEVTLPAGVRFDNSCQLSLADIQGLGVASIILSVPHPAPQHWRYDFVASKPYLLC
TTNNNMGAESQLFYRSSVQFWLDEKAQAAKQGRTLVCQLPFPIHLLAQTTQFDEITGNSLSQTARYFHGFYDGIQREFCG
FGRVDTLDTNTSAQGSAAERTSPTQTCQWFHTGQPGNEQLWHHEYWQGDSQAYGLLSTRLTKFTGEMQGDETLNDINDNQ
AYWLHRALKGSLLRSELYGLDGSELATQPYSVNSARYQVRQIQSSTDEITSPVALPMMLEQLNYHYERIVQDPQCNQQIV
LRCDEFGHPLHSATIYYPRRDKASIPPYSWLAEGHWDSHFDEQQQQLRITESQQSYRHEISDKFYVLGLPAGQRSDVLTY
PDNFVPTAGIHWEELQQPEGLLGTKAERTFAGQQQVFYASDTIPGLVAYSQQAEFDDQTLAALDELLPTNERKQQLIEAG
YQRAPRLFSRPGETDIWAAQRGFTDYGDASRFYRPISQRSSPLVGKTALEWDKNSCAITQMILADGSTTQAEYDYRFITP
YHLTDINDNSRHIELDALGRVTSSRFWGTELDSQTGEVSTTGFPLIAEHPFTVPNSVDAAISMENTQVPVAQFSVYQPQS
WMISLQLDDIETLSETNNVTLEYLFQNHILIDNYYLCPLALRRWIRQSNPLITENVGLELKNPVRQPPHVLTVVVDNYFS
AAEPQQHQQTLAFSDGFGRVLQSAQRVEAGTAYFHTGEGGLIADQQGHLMQDESDQRWAVSGRTEYDNKGLPIRRYQPYF
LDDWRYIADDNARKEAWADTHIYDPLGREIKVITAKGYLRRAHYFPWFVISEDENDTASEITPNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 58
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 51
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 50