Gene Information

Name : EJP617_14820 (EJP617_14820)
Accession : YP_005818050.1
Strain : Erwinia sp. Ejp617
Genome accession: NC_017445
Putative virulence/resistance : Virulence
Product : Insecticidal toxin complex protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1651991 - 1656286 bp
Length : 4296 bp
Strand : +
Note : -

DNA sequence :
ATGCAAAATTCAGATAGTTTGATACTGGAAGCGCCCTCTCTGCCCAAGGGCGGCGGGGCGGTATCCGGCCTGAAAGGGGA
TATGGCGGCGGCCGGGCCGGACGGAGCGGCCACGCTCAGCGTCCCGCTGCCGGTCAGCGCCGGGCGCGGCTATGCGCCGG
CGCTGGCGCTCAGCTACCACAGCCGGGGCGGTAACGGGCCATTCGGTATGGGTTGGGACGTTAACCTGTCAGCCATCCGT
CGCCGCACCAACAAAGGTATACCGACGTACGGCGCGGACGATGAGTTTACCGGGCCGGATGGCGAAGTGCTGGTTCCGCT
ACTGACCGCAGACGGGACGCCGGAGACACGCAGTGCCTCCGCGCTACTGGAAGTCGACGCTGGTGGAAATTACCACGTTC
GCGCTTACCGCAGCCGCACGGAATCCGACTTCAGTCGTCTGGAGTACTGGGTGGCTGACAGCGATAGCGCCGAGGCGTTC
TGGCTGCTGTACCGCCCGGACGGGCAGCTGTTTTTACTGGGCCGCAATGCGCAGGCGCGTATCAGCAATCCGCATCATGC
GCGACAAACCGCCGTCTGGCTGATCGAGTCGTCGGTGTCCGTGAGCGGGGAGCAGATTTATTATCAGTATCGGGCAGAAG
ACGATACTGACTGCGCGGACGCGGAAAAAGCCGCACATGGCGCTGCCACGGCACAGCGTTATCTGATGGCGGTCTGGTAC
GGCAACCGCACCGCTTCCCGTACGCTGCCCGCGTTTACCTCAGCCCCTTCCGCCGAAGACTGGCTGTTCACCCTGGTGCT
GGATTACGGTGAGCGCGACACGGATCCGGCGCAGGCTCCCGACTGGTTGCCCCCCGGTGAGGGGAGCTGGCCGTGCCGCC
CCGATCGCTTTTCCAGCTGGCACTACGGCTTTGAGCTGCGCACCCGCCGCCTGTGCCGCCAGGTTTTGATGTATCACGCC
GTGACGGCGCTGGCCGGAGAAGAAGAACCCGGCGATAAGCCGCAGCTGGTGGCGCGTCTCTGCCTGGATTACCAGCAGAC
TCCCTCGGTGACCATGCTCAAGTCGGTGCGCCAGAACGCGTATGAAGCTGATGGTCGTGTGCTCAGTCTGCCGCCGCTGA
CGTTTGGCTGGCATACCTTTACGCCGCCGAAAACGGCTGCATGGCAGCAGCGTGAAGATATGGGCAACCTGAATGCGTTG
CAGCCTTACCAGCTGGTGGATCTGAATGGCGAAGGGCTGGCCGGGATCCTGTATCAGGATGACGGCGGGTGGTGGTATCG
CGCACCGGTGCGTCAAACCGGCGGCAGCGCCGATGAGGTCACCTGGGATAAGGCCGCGCCTCTGCCCATCATGCCGGCCC
TGCACGCGGGCGGTGCGCTGCTCGATCTGAACGGCGATGGCTACCTTGAGTGGCTGGTGACCGCGCCTGGCCTGGCCGGG
CATTATGGCCGCACGGCCGACCGTCAATGGCGAAACTTCACGCCGCTGTCAGCACTGCCGGTGGAGTATGGTCATCCGCG
TGCCCGGCTGGCGGATCTCACCGGGTCCGGGCTGAGCGACCTGGTGCTGATCGGCCCCACAAGCGTGCGCCTGTACAGCG
GAACCGGAGAGGGCTGGCAGAAGGCGCTAACGGTGATGCAGGCAGACTCTGTGACTCTGCCGGTGTCACCGACAAAAGAT
GACAGCATGCTGATTGCCTTCAGTGATATGGACGGTAGCGGCCAGCAGCACCTGGTGCAGGTGTCTGCCAGCGGGGTACG
CTACTGGCCCAACCTCGGGCACGGGCGCTTCGGCCAGCCGCTGGATGTCCCCGGCTTCAGTCAGCCCGCCGCGCGTTTTA
ATCCGCAGCAGCTGTTCCTCGCCGATATCGACGGCTCAGGTACCACCGACCTGATTTATGCTCTTGGCGACAGCCTGCTT
ATCTATCTCAACCAAAGCGGCAACGCGTTTGCCGCGCCATTTAGCCTGCCGCTGCCAGCCGGGGTGCGCTATGCCCGCAC
CTGTAGCCTGCAGCTGGCGGATATTCAGGGGCAGGGCGTTGCCAGTCTGATACTGACCAGCCCTCATCCGACACCGCGTC
ACTGGGTTTGTCATTTGTCAGAGCAGAAACCCTGGCTGCTGAACGCGATGGATAACCATATGGGTGCCACACATCGCCTG
TACTACCGCAGCTCGGCTCAGTTCTGGCTGGATGAGAAAGCCGAAGCCGCGCAGGCGGGAAAACCGCAGCCTGCCTGTTA
TCTGCCGTTTGCCCTGCATACGGTGCAGCGCACCGAGGTAACCGATGACATTACCGGCAACCGGCTGGTCAGCAGCGTGC
GCTACCGGCACGGCGCATGGGACGGGCGTGAACGCGAATTTCGTGGTTTTGGTCTGGTGGAAGTCAGCGATACCGATACT
GGCACCAGTCGTGGAAGTGCGGCAGAAATCAGCATGCCTGCCATCACGCGTAGCTGGTATGCCACTGGCCTGGTGCAGGT
GGATGACCAGCTGCCACGGCAATACTGGCAGGGCGATAGCGCCGCTTTTGCTGGATTTGTTCCCCGTTTCACCATCGGCA
GCGGTGACGACGAGCAGGCGTACGTCCCGGATGACAGCACCCGCTTCTGGCTGAAGCGGGGGGTAAAGGGCATGCTGCTG
CGCAGCGAGCTGTACGGTGCGGACGACAGCGTTCTGGCGACTACCCCCTACAGCGTTAACGAAACCCGCCCCCAGGTGCG
CTTAGTCGAGGTGCGGGGCGTTTATCCGGTGACCTGGTCGGTGTCGGCGGAAACGCGTACTTATATCTATGAGCGGGTCA
GCGGCGATCCGCAGTGCAGCCAGCAGGTGTTGCTGCGCAGCGATCGCTACGGCCAGCCGCTGCGTCAGGTCAGCATCAGC
TATCCGCGCCGCCGCCAGCCGACTGAAAGCCCGTACCCGTCAACGCTGCCGCTGACGTTGTTTGCCAGCAGCTACGACCC
GCAGCAGCAGGAACTGCGCCTGGTGTTGCAGCAGAGCCGCTGGCATACGTTGTCAGACAGCGCGACCGGTGTATGGCTGG
CTGGACTGTCTGATGCCACCCGCAGCGATATTTTCACTTACTCTGCTACCTCCGTCCCGGCGCAGGGTCTGACGCTGGAG
CAGCTTTCCGCCAGCGGCAATCTGACAGGCGAAGATCGGCCTGCCACCTTCGCCGGGCAGCAGCAGGTCTGGTATCTGGA
TACGCAGGGTGAGGCGGTCACGGCAATGCCTGCCATCCCACCGCGCCTTGCTTTTAGTGAAAGTGCGGTACTGGACGAGC
CTCTTATCGCCTCACTTGCGCAGGACATCAGCGCTGAAACCCTGACGCAGGCCGGTTACTCGCCGTCCGGCTATCTTTTT
GCCCGTCGCGGTGAAAGCGGCAAATTGTTGTGGACGGTACGCCAGGGTTACAGCAGCTATGGTTCTGCTGAACATTTCTG
GCTGCCTGAATCCGTGCGTGACACCCTGCTGACCGGTGCTACCACCGTCACCCGCGACCGTTACGACTGCGTGATAACCC
AGCTGCGCGATGCGGCCGGGCTGACTACCAGCGCGCAGTATGACTGGCGTTTTCTGACGCCGGTCAGCGTGACTGATGCC
AACGACAATCGGCATACAGTCACGTTGGATGCGCTGGGGCGGGTCTCTGTGATGCGTTTTAGCGGCACGGAAAACGGCAG
CGCCACGGGCTACAGCGATAAAGCCATCGACCCGCCGCAAACGGCGGAACAGGCGCTGGCGCTCTCAGCACCGCTGCCGG
TACACCAGCTGCTGGTTTATATCACCGATAGCTGGATGACGCAGCGCGCAGAAAAACAGCCGCCGCACGTTGTGATGCTC
ACTACCGACCGCTATGACAGCGACCCGCAGCAGCAGGTCCGCCAGCAGGTGGTGTTCAGCGACGGCTTCGGACGCGTGCT
ACAGACCTCCGTTCGCCAGGCTGACGGTGACGCCTGGCAGCGTAGCGCGGCGGGCGCGCTGGTGAGCGCGTCCAACGGTG
CGCCTCAGCTGGCAACCACCACCTCGCGCTGGGCGGTCAGCGGGCGTACGGAATATGACAATAAAGGCCAGGCGATTCGC
ACTTACCAGCCATTTTTCCTCAATAGCTGGCAATACCTCAGTGATGACAGCGCCCGTCAGGATCTGTATGCCGACACGCA
TTATTACGACCCCACGGGGAGGGAATGGCAGGTGAAAACCGCCAAATGCTGGCTGCGCCGCAGCCTGTTCACGCCGTGGT
TTGTGGTCAGCGAGGACGAAAATAACACCGCCGCTGAAGCGGAGGCTGGTATTTAA

Protein sequence :
MQNSDSLILEAPSLPKGGGAVSGLKGDMAAAGPDGAATLSVPLPVSAGRGYAPALALSYHSRGGNGPFGMGWDVNLSAIR
RRTNKGIPTYGADDEFTGPDGEVLVPLLTADGTPETRSASALLEVDAGGNYHVRAYRSRTESDFSRLEYWVADSDSAEAF
WLLYRPDGQLFLLGRNAQARISNPHHARQTAVWLIESSVSVSGEQIYYQYRAEDDTDCADAEKAAHGAATAQRYLMAVWY
GNRTASRTLPAFTSAPSAEDWLFTLVLDYGERDTDPAQAPDWLPPGEGSWPCRPDRFSSWHYGFELRTRRLCRQVLMYHA
VTALAGEEEPGDKPQLVARLCLDYQQTPSVTMLKSVRQNAYEADGRVLSLPPLTFGWHTFTPPKTAAWQQREDMGNLNAL
QPYQLVDLNGEGLAGILYQDDGGWWYRAPVRQTGGSADEVTWDKAAPLPIMPALHAGGALLDLNGDGYLEWLVTAPGLAG
HYGRTADRQWRNFTPLSALPVEYGHPRARLADLTGSGLSDLVLIGPTSVRLYSGTGEGWQKALTVMQADSVTLPVSPTKD
DSMLIAFSDMDGSGQQHLVQVSASGVRYWPNLGHGRFGQPLDVPGFSQPAARFNPQQLFLADIDGSGTTDLIYALGDSLL
IYLNQSGNAFAAPFSLPLPAGVRYARTCSLQLADIQGQGVASLILTSPHPTPRHWVCHLSEQKPWLLNAMDNHMGATHRL
YYRSSAQFWLDEKAEAAQAGKPQPACYLPFALHTVQRTEVTDDITGNRLVSSVRYRHGAWDGREREFRGFGLVEVSDTDT
GTSRGSAAEISMPAITRSWYATGLVQVDDQLPRQYWQGDSAAFAGFVPRFTIGSGDDEQAYVPDDSTRFWLKRGVKGMLL
RSELYGADDSVLATTPYSVNETRPQVRLVEVRGVYPVTWSVSAETRTYIYERVSGDPQCSQQVLLRSDRYGQPLRQVSIS
YPRRRQPTESPYPSTLPLTLFASSYDPQQQELRLVLQQSRWHTLSDSATGVWLAGLSDATRSDIFTYSATSVPAQGLTLE
QLSASGNLTGEDRPATFAGQQQVWYLDTQGEAVTAMPAIPPRLAFSESAVLDEPLIASLAQDISAETLTQAGYSPSGYLF
ARRGESGKLLWTVRQGYSSYGSAEHFWLPESVRDTLLTGATTVTRDRYDCVITQLRDAAGLTTSAQYDWRFLTPVSVTDA
NDNRHTVTLDALGRVSVMRFSGTENGSATGYSDKAIDPPQTAEQALALSAPLPVHQLLVYITDSWMTQRAEKQPPHVVML
TTDRYDSDPQQQVRQQVVFSDGFGRVLQTSVRQADGDAWQRSAAGALVSASNGAPQLATTTSRWAVSGRTEYDNKGQAIR
TYQPFFLNSWQYLSDDSARQDLYADTHYYDPTGREWQVKTAKCWLRRSLFTPWFVVSEDENNTAAEAEAGI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 54
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 53
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 44