Gene Information

Name : PAU_03849 (PAU_03849)
Accession : YP_003042678.1
Strain : Photorhabdus asymbiotica ATCC 43949
Genome accession: NC_012962
Putative virulence/resistance : Virulence
Product : Insecticial toxin complex
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4475526 - 4479992 bp
Length : 4467 bp
Strand : +
Note : -

DNA sequence :
ATGCAGGATTCACCCAAAGTATCAGTTACAACATTGTCGCTGCCCAAAGGTGGTGGCGCTATTAATGGAATGGGAGAAGC
GCTTAACACCGCCGGTCCAGAAGGGGCCGCAACGCTCTCTCTGCCGTTACCTCTCTCCTCTGGGCGAGGTGTTGCCCCCG
GATTGTCACTGATTTATAGCAGCAGTGCAGGCAATGGGCCTTTTGGCATCGGCTGGCAATGTGGCGTCATGACCATCAGC
CGCCGTACTCAACATGGCATCCCGCAATATGGTACTGACGACACATTTTTATCCCCCCAAGGCGAAGTGATGAATATCGC
TCTGAATGATCAGGGACAACCAGATATCCGCCAGGGAGTTAAGACTCTGCAAGGCGTTACCCTGCCTGTTTCCTATACGG
TGACCCACTACCAACCCCGCCAGATACAGGATTTCAGTAAAATAGAATACTGGCAACCCTCCTCTGGTCAGGAAGGACGT
CCTTTCTGGTTGATATCATCACCAGACGGACAACTGCATATCCTGGGAAAAACAGCACAAGCTTGCCTGACTAACCCACA
AAATGATCAGCAAATCGCTCAGTGGTTACTAGAAGAAACCGTGACGCCAACCGGTGAACATGTCAGCTATCAGTATCGGG
CCGAAGATGAAGCCAATTGCGACGATAACGAAAAAACACGCCATCCGAATACTACAGCACAACGCTATCTAGTACAGGTA
AACTACGGGAACATTAAACCACAGGCCAGCCTGTTCGTGCTGGATAATACACCACCAACCCCGGAAGAATGGTTGTTTCA
TCTGGTCTTTGACCACGGAGAACGAGATACCTCACTTTATACCGTACCAGCATGGGATACCGGTACAGCACAATGGTCTG
CACGCCCGGATACTTTTTCCCGCTATGAATACGGTTTTGAAGTCCGTACCCGCCGTCTGTGTCAACAGGTGCTGATGTTT
CACCGTATCGCGCTGCTGGCTGGAGAAGCCAATACTAATAGCACACCAGAGTTGGTGGGGCGTCTGATACTGGACTATGA
CAAAAATGCCAGCGTCACAACCTTAATCGCCGCCCGTCAATTGAGCCATGAATCTGACGGCAAGCCAACCGCGATGCCAC
CACTGGAGTTTGCTTGGCAACAATTTGATCATGAGAAGATACCAGCCTGGCAACGTTTTGATACTCTGGATAATTTCAAT
CCTCAACAACACTATCAGCTAGTCGACCTACGGGGAGAAGGGTTGCCTGGAATGTTGTATCAAGAACGCGGTGCCTGGTG
GTACAAAGCCCCACAACGTCAGGAAGGCAAAGACAGTAATGCGGTAACCTACAGCAAAATAGCCCCTCTGCCCACCCTGC
CTAGTTTGCAGGAAAATGCCTCACTGATGGATATCAACGGGGACGGTCAACTGGACTGGGTAGTCACCGCTTCCGGCATC
CGGGGATACCACAGCCAACAACCTGATGGAAAATGGACACACTTTACGCCAATCAGTGCCTTGCCTGTGGAATATTTTCA
TCCAAGTGCCCAATTCGCCGATCTTATCGGTGCAGGCTTATCCGATTTAGTGTTGATAGGACCGAAAAGCGTGCGTCTGT
ATGCCAACCAGCGTGACGGCTGGCGCAAAGGAGAAAATGTGCCACAATCCGCCGGTATTACCTTGCCGATCGCCGGCACC
GATGCCCGCAAACTGGTGGCATTCAGTGATATGCTAGGTTCAGGTCAGCAACATCTGGTAGAAATCAGCGCTAACAGTGT
CACTTGCTGGCCCAATCTGGGACATGGCCGTTTCGGTCAGCCACTAACCCTTCCAGGATTCAGCCACCCCGAAAACCGTT
TTAATCCCGAACGGCTGTTTCTGGCGGATATCGACGGCTCCGGCACCGCAGACATTATCTATGCACAATCCGATTCTTTG
TTAATTTATCTCAACCAAAGCGGTAATCAGTTTGACGCACCACTGACATTAGCTTTGCCTGAAGGTGTCCAGTTTGACAA
CACCTGTCAGCTACAAGTTGCTGATATTCAGGGACTGGGGGTAGCCAGTTTGATACTGACAGTACCTCATATGACACCTC
ATCATTGGCGTTGTGACCTGTCATTGACCAAGCCTTGGTTATTAAAGGTGATGAACAATAACCGGGGAGCACATCATACC
CTGCATTACCGCAGTTCTGCTCAGTTCTGGCTAGATGAAAAATTACAACTCACCAAAGCGGGCAAACCACCCATCTGTTA
CCTGCCATTTCCTATGCATCTACTGTGGCACACTGAAATTCAGGATGAAATCAGTGGTAATAAACTCACCAGTGAAGTGA
GTTACAGTCATGGCGTTTGGGATGACAAAGAGCGGGAATTCAGGGGCTTTGGTTGTATCAAACAAACAGATACCACGACT
TTTTCCCACGGCACCGCACCTGAACAAGCTTCTCCATCACTCAGCGTCAGTTGGTTTGCCACCGGGATTGATGCAGTAGA
CAATCAGTTATCGGCAGAATATTGGCAGGGAGATACTCAGGCTTGTACCGAATTTAAAACCCGCTATACCACTTGGGACA
GTACCAGCCAAACGGATAAAGTTTTCTCCCCCAATGATACACAACGTAACTGGCTAACCAGAGCAGTGAAAGGCCAGTTG
CTACGCAATGAAGTATACAGTCTGGACGGAACCGAGAAGGAAACAATCCCTTATATCGTTAGTGAGTCACGCTATCAGGT
ACGGTTTATTCCGGTAGATAAAGAAACAGAGCTATCAGCCTGGGTCTCGGCTATTGAAAACCGCAGTTATCACTACGAGC
GTATTGTCAGTGATCCGCTGCTTACCCAAAGTATTCGGTTGCAACAAGATATTTTTGGGCAACTATTGCAAGGGGTTGAT
ATCTCCTGGCCCCGCCGGGAAAAACCTGCTGAAAACCCCTATCCGTCAACTCTGCCAGATACCTTGTTCGACAGTAGCTA
TGATGACCAACAGCAGGTGCTGCGTCTGGTTCGGCAAAAAAATAACTGGCATCACCTGACTGACGGAGAAAATTGGCGGT
TAGGCTTACCTGATGCTCAACGCAGCGATGTTTATACCCATGATCGGTCTAAAATTCCAGCAGACGGCATTTCTCTTGAA
GTATTGCTAGAAGAAGATGGCTTACTGGCTGATGAAAAAGCCGCAGTTTATCTGGGTCAGCAACAGACATTTTATACCGC
CGGCCAATCAGAAATACCGCTGGAAAAACCTACGCTTCAGGCATTGGTTGCATTTTACGAAACCGCCATGATGGACGACA
CATCATTACAGGCTTATGACGGCGTAATTGAACAACAAGCGTTGAACACGGCACTGACACAAGCGGGTTATCAGCAATCA
GCTCGACTGTTTAATACCGGCTCAGAAAGTCCGGTATGGGTCGCGCGACTGGGATACACCGATTACGGAGATGCTACACA
GTTCTGGCGGCCCCAAGCTCAGCGTAACTCGTTACTGACCGGGAAAACCACATTGAGTTGGGATACCCATCATTGTGTAG
TGACGCAGACTCAGGATGCCGCAGGCTTAACGACACAAGCCCACTACGATTATCGTTTTCTAACACCGATACAACTGACA
GATATCAATGATAATCAACACATTGTGACATTGGATGCTCTGGGACGCGTTATCACCAGCCGGTTCTGGGGTACTGAAGC
CGGACAAGTCACAGGCTATTCCACACCAGCCGATAAACCCTTTACGCCACCAGATTCGGTTGATAAAGCACTCGCATTAA
CAGGCCAACTCCCCGTTGCTCAATGTATCGTCCATGCCATTGACGGTTGGATGCCAGCGTTATCATTAACCCACCTTTCC
GGGACACAGGAAGAAGCCGAAACCCAATGGAAGCAACTTCAAGCTGCCCAGGTAGTGACCGAAGATGGGAAAATACGTAC
GCTTAGCTGGCAACAGGCCATAGATCGTCAAAAATTGACCGTCCAGATGGCCTCCTTATTCGCGGATATTCCCCGTTTAC
CTCCCCATACGCTAACGATTACTACAGATCGTTACGACAACGATCCGCAGCAACTACACCAGCAGACAATCAGCTTCAGT
GACGGTTTTGGCCGGTTACTCCAGAGTTCAGTTCGTTACGAATCCGGTGATGCCTGGCAACGTAAAGAAGATGGCGGATT
AGTCGTCGATACAAATGGAGCGCTGGTCAGTGCACCAACGGATACTCGTTGGGTCGTATCCGGTCGCACAGAATATGACG
ACAAAGGCCAGCCAGTGCGCACTTATCAACCCTATTTTCTCAATGACTGGCGCTACGTCAGTGATGACAGTGCACGGGAT
GATCTGTTTGCGGATACCCATATTTATGATCCATTGGGACGTGAATACAAAGTCATCACGGCTAAGAAATATCTAAGGGA
AAAACAGTACACCCCTTGGTTTGTCGTGAGTGAGGATGAAAACGACACAGCATCACGAAACCTATAA

Protein sequence :
MQDSPKVSVTTLSLPKGGGAINGMGEALNTAGPEGAATLSLPLPLSSGRGVAPGLSLIYSSSAGNGPFGIGWQCGVMTIS
RRTQHGIPQYGTDDTFLSPQGEVMNIALNDQGQPDIRQGVKTLQGVTLPVSYTVTHYQPRQIQDFSKIEYWQPSSGQEGR
PFWLISSPDGQLHILGKTAQACLTNPQNDQQIAQWLLEETVTPTGEHVSYQYRAEDEANCDDNEKTRHPNTTAQRYLVQV
NYGNIKPQASLFVLDNTPPTPEEWLFHLVFDHGERDTSLYTVPAWDTGTAQWSARPDTFSRYEYGFEVRTRRLCQQVLMF
HRIALLAGEANTNSTPELVGRLILDYDKNASVTTLIAARQLSHESDGKPTAMPPLEFAWQQFDHEKIPAWQRFDTLDNFN
PQQHYQLVDLRGEGLPGMLYQERGAWWYKAPQRQEGKDSNAVTYSKIAPLPTLPSLQENASLMDINGDGQLDWVVTASGI
RGYHSQQPDGKWTHFTPISALPVEYFHPSAQFADLIGAGLSDLVLIGPKSVRLYANQRDGWRKGENVPQSAGITLPIAGT
DARKLVAFSDMLGSGQQHLVEISANSVTCWPNLGHGRFGQPLTLPGFSHPENRFNPERLFLADIDGSGTADIIYAQSDSL
LIYLNQSGNQFDAPLTLALPEGVQFDNTCQLQVADIQGLGVASLILTVPHMTPHHWRCDLSLTKPWLLKVMNNNRGAHHT
LHYRSSAQFWLDEKLQLTKAGKPPICYLPFPMHLLWHTEIQDEISGNKLTSEVSYSHGVWDDKEREFRGFGCIKQTDTTT
FSHGTAPEQASPSLSVSWFATGIDAVDNQLSAEYWQGDTQACTEFKTRYTTWDSTSQTDKVFSPNDTQRNWLTRAVKGQL
LRNEVYSLDGTEKETIPYIVSESRYQVRFIPVDKETELSAWVSAIENRSYHYERIVSDPLLTQSIRLQQDIFGQLLQGVD
ISWPRREKPAENPYPSTLPDTLFDSSYDDQQQVLRLVRQKNNWHHLTDGENWRLGLPDAQRSDVYTHDRSKIPADGISLE
VLLEEDGLLADEKAAVYLGQQQTFYTAGQSEIPLEKPTLQALVAFYETAMMDDTSLQAYDGVIEQQALNTALTQAGYQQS
ARLFNTGSESPVWVARLGYTDYGDATQFWRPQAQRNSLLTGKTTLSWDTHHCVVTQTQDAAGLTTQAHYDYRFLTPIQLT
DINDNQHIVTLDALGRVITSRFWGTEAGQVTGYSTPADKPFTPPDSVDKALALTGQLPVAQCIVHAIDGWMPALSLTHLS
GTQEEAETQWKQLQAAQVVTEDGKIRTLSWQQAIDRQKLTVQMASLFADIPRLPPHTLTITTDRYDNDPQQLHQQTISFS
DGFGRLLQSSVRYESGDAWQRKEDGGLVVDTNGALVSAPTDTRWVVSGRTEYDDKGQPVRTYQPYFLNDWRYVSDDSARD
DLFADTHIYDPLGREYKVITAKKYLREKQYTPWFVVSEDENDTASRNL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 59
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 57
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 49