Gene Information

Name : YPA_3691 (YPA_3691)
Accession : YP_653598.1
Strain : Yersinia pestis Antiqua
Genome accession: NC_008150
Putative virulence/resistance : Virulence
Product : insecticidal toxin complex
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4140804 - 4145294 bp
Length : 4491 bp
Strand : -
Note : -

DNA sequence :
ATGGAAAACAGTAAACAGCAAGTGGCCGTTGCTCCCTTGTCGCTCCCTAAAGGGGGCGGTGCGATTACCGGTATGGGCGA
TAGTTTAGGTCCTATCGGCCCCAGTGGCATGGCAACACTGACGCTGCCTCTGCCGATCTCCGCAGGCCGCGGTTACGCCC
CCTCGCTCACGCTAAGTTACAGTAGCGGCAGTGGTAACGGCCCGTTTGGCCTTGGCTGGCAACTCGGCACCATGGCGATT
CGCCGCCGAACCAACGCCCAAGTGCCACGTTACGATGAGTATGATGAATTTCTGGCTCCCAATGGGGAAGTCATGGTGGT
TGCCGCGGATCCGCAGGGCAGTATCGAACGCACTGAACAGTCACTAAATGGGGAACAATTCAGCGTAATTCGTTACTTAC
CACGTATTGAAGGCAATTTTCATCGCATTGAGTATTGGCGACCCCGGACAAATAACAGCCAGGCACCGTTTTGGTTAGTC
CACAGTTCAGATGGCCAAAAACACTGTTTGGGGTATTCCGCTGCCGCGCGAATTGCCGATCCACTGCACCCTGAGCATAT
TGCTGAATGGCTACTGGAAGAGTCGGTGTCTCTGAGCGGTGAGCATATCGGCTACCAGTATCAGGCAGAAGATGAACAAG
GCATTGATGAGCCTAGTATTTATAAGGCTGAAAAACAAAATCATCCAGCAGCCAGTGCGCAACGCTATCTGAAACGCGTG
GTTTATGGTAATCGCCAAGCGGCGTATGAACTCTATTGTCTGACTCAGCAACCTGCGCCGACAAGCTGGTTATTTAGCCT
GATTTTTGATCACGGCGAATACAGTAATATCGCGGAGCAAGTGCCGGTAATCATAAAAGGAAAATCCTGGAATTTCCGTC
AGGACGCCTTCTCACACTTTAACTATGGCTTTGAAGTGCGTACCCGCCGCTTGTGTCAGCAAGTGTTGATGTACCATAAT
TTGTCAGCACTTAAAGGTGATGAACCCGATGCTCAAGCCACGCTGGTTAGCCGTCTGCGTTTACACTATCAGCACGATGC
TTATGCCACCCAACTGGTCGGTTGCCAACAGTTAGCCCATGAACCCGATGGCACCAAACGTAGCCTGCCACCGCTAGAAT
TTGATTATCAGGATTTTTCCACTCGCGACGCCCTTGGTTGGCAACCACTTACTGATTGGGCTGAATTTAACTATCAGTAC
CAAATGGTGGATCTGAACGGCGAAGGGATGCCGGGGATGTTGTATCAGGACAGCGGTCACTGGATTTATCGCCCACCGGT
TCGCCAACCAGGCACCGCCGACGGCATCACTTTTGGCGCGGCCCAACGGTTACCGAGCCTACCCGCGATGCGCGAAAATG
CCATGTTGATGGATATCAATGGTGACGGCAAGCTGGATTGGGTGATCAGTCAACCGGGCTTAGCGGGCTACTTTAGCCGT
GATCCTGACCTTAGCCGTGATCCTGATCTTAGCTGGACCCAATTCATCCCGTTATCTACCTTGCCAGCGGAGTATTTCCA
TCCACAAGCGCAACTGGTCGATTTAGCCGGCAGCGGGTTATCTGATCTGGCGTTAGTCGGGCCAAAAAGTGTCCGCGTGT
ATACCAATCTGTGTGACAGTTTTGCTGCCGCAACCCAAGTAGCACAAGATGATGACATCACGCTACCACTACCCGGAGTT
CACTTCACTGAACTGGTGGCGTTCAGCGATGTGATGGGTTCGGGCCAACAACATCTGGTGCGAATACGCCACAATAGCGT
CACCTGCTGGCCCAATCTCGGCCATGGCCGCTTTGGTCACCCCCTCTCTTTACCGGGATTTAATCAGCCAGTTGAGCAAT
TTAATCCACTAGCGATCTATCTGGCAGATATTGATGGCTCCGGTACCATTGACCTTATTTATGCCACCACCAGCCAACTG
CTCATTTACCGCAACCAGAGTGGTAACCGCTTTGCCGAACCACTGGCTATCGCGTTGCCAACAGGCATTCGCTTTGATAA
TAGCTGCCAACTGTCACTGGCAGATATTCAGGGGCTGGGCGTGGCCAGCATTATGTTGAGTGTGCCGCACCCCACCACAC
AACATTGGCGCTATGATTTTGTCGCCAGTAAACCCTATTTACTGTGCACCACCAACAATAATATGGGCGCAGAGAGTCAG
TTGCTTTACCGCAGTTCTGTCCAATTCTGGTTGGATGAAAAAGCACAGGCGGCCAAACAGGGCCGATCACTGGCCTGCCA
ATTGCCCTTCCCGATCCATCTATTAGCGCAAACCACGCAATTTGATGAGATAACCGGTAACAGCCTGAGCCAAACCGCCC
GCTACTTCCATGGTTTTTATGATGGTGTGCAACGTGAATTCAGTGGTTTTGGCCGTGTCGATACGCTGGATACCGATACC
TCGGCACAAGGCAGTGCCGCTGAACGCACTGCGCCGACCAAAAGCAGTCGCTGGTTCCACACTGGCCGGGCAGGCAATGA
AACACTATGGCAAAGTGAGTATTGGCAGGGCGATGACCAAGCCTACTCATTGCTGCCTACCCGCCTGACAAAATTTATTA
ACAATACCCAAGGTGACGAATTACTCAGTGAACTCGATGATAATCAAACGTTTTGGTTGCACCGGGCGCTGAAAGGTTCA
TTGCTACGCAGCGAACTCTATGGTCTGGACGACAGCGAACTGGCGACACAACCGTATAGCGTTAACAGCTCTCGCTATCA
GGTGCGGCAAATTCAATCCTCTGCTGATGGAATCTCATCCCCGGTAGCCTTACCGATGGTGCTGGAACAACTCAGTTATC
ACTACGAACGTATCGCGCAAGACCCGCAATGCAGCCAGCAGATTGTGCTGCGTTGCAATGAATATGGTCACCCACTACAC
AGTGTGACAATCAATTATCCACGCCGCGATAAAGCCCGTATTTCCCCGTACTCCTGGCTGGCAAAGGAACATTGGGATAG
CCATTTTGACGAGCAACAGCAACAGTTGCGCATCACTGAAAGCCAGCAATCTTATCACCATGAGATCAGTGATAAATTCT
ATGTGCTGGGCCTGCCTGCCGGGCAGCGCAGTGATGTACTGACCTATCCCGACAATTTCGTGCCTACCGCGGGGATTCAT
TGGGAAGAATTACAGCAGCCAGAGGGCCTGCTTGGTACTAAAGCAGAGCGAACCTTTACCGCGCAGCAGCAGGTTTTCTA
CACCTCGGACACCATTCCGGGCCTGGTTGCCTACAGTCAACAGGCAGAATTTGACGATCAAACCTTGGTCGCATTGGATG
AATTATTACCAGCGAATGAGCGTAAACAGCAGCTTATCAAGGCCGGTTATCAAATAGCACCCCGTCTATTTGCTCGCACC
GGAGAAACGGATATTTGGGTCGCCCAAAGTGGGTTTACTGACTATGGTGATGCCTCCCGCTTCTATCGCCCCATCAGTCA
ACGTAGTACGCAATTAGTCGGGAAAACCATTTTGGAATGGGATGCCACCTGCTGTGCTGTCAGTGACATTATATTGGCCG
ATTACAGCATCACCCATGCTGAATACGATTATCGCTTTATCACCCCTTATCTGCTGATAGATATCAATGATAATCAACAT
TATATCGAGCTAGATGCGCTGGGGCGGGTTACCTCCAGCCGTTTTGCGGGCACCGAGATTGATCCACAAACCAATAAAGT
CATTGAAACCGGTTTTCCTTCCATCGCGGAACAGCCCTTTAGCGCCCCCAATTCAGTTGATAAAGCTCTCAGCCTGGAGA
ATACCCGCATTCCCGTCGCGCAGTTCTCTGTCTATCAACCTCAGAGTTGGATGATTTCATTACAGCTTGATGACATTGAA
ATATGGATTAGAGCCAATAACATTACTCCAGAATATCTATTCCAAAATCATATTCTGATCGATAACTATTATCTTTGCCC
CCTTGCGCTACGCCGCTGGGGAAGACAAAACAACCTTCTCATCACTGAAGGTGTTGGCCTGACATTGAAAAATCCCATGC
GCCAACCGCCTCATATATTAACCGTCGTCGTGGATAACTACTTTTCTGCTTCTGAACCACAACAGCATCAACAAACTCTC
GCTTTCAGTGACGGCTTTGGCCGCGTATTACTCAGCGCACGACGGGTGGAGACAGGACCCTCTTACTCATTCGATCCGGA
AAACGGCCTCTTAGTTGATGACAAAGGCAATCTGGTGCAACTCGAAGTCGATCAACGCTGGGCGGTCTCTGGCCGTACCG
AGTACGACAATAAAGGTCTGCCACGCCGCCGTTATCAACCTTATTTTTTCGACAACTGGATCTGGCTTTATATCGCCAAT
AACAGAACCCTCAAAGAGGCTTACGCCGATACCCATATTTACGACCCACTCGGGCGAGAAATAAAAGTGATCACCGCGAA
AGGTTATCTGCGACGAACTCACTATTTCCCGTGGTTTGTTATCAGTGAGGATGAAAATGACACGGCGTCAGAAATCACGC
CGAATCCCTAA

Protein sequence :
MENSKQQVAVAPLSLPKGGGAITGMGDSLGPIGPSGMATLTLPLPISAGRGYAPSLTLSYSSGSGNGPFGLGWQLGTMAI
RRRTNAQVPRYDEYDEFLAPNGEVMVVAADPQGSIERTEQSLNGEQFSVIRYLPRIEGNFHRIEYWRPRTNNSQAPFWLV
HSSDGQKHCLGYSAAARIADPLHPEHIAEWLLEESVSLSGEHIGYQYQAEDEQGIDEPSIYKAEKQNHPAASAQRYLKRV
VYGNRQAAYELYCLTQQPAPTSWLFSLIFDHGEYSNIAEQVPVIIKGKSWNFRQDAFSHFNYGFEVRTRRLCQQVLMYHN
LSALKGDEPDAQATLVSRLRLHYQHDAYATQLVGCQQLAHEPDGTKRSLPPLEFDYQDFSTRDALGWQPLTDWAEFNYQY
QMVDLNGEGMPGMLYQDSGHWIYRPPVRQPGTADGITFGAAQRLPSLPAMRENAMLMDINGDGKLDWVISQPGLAGYFSR
DPDLSRDPDLSWTQFIPLSTLPAEYFHPQAQLVDLAGSGLSDLALVGPKSVRVYTNLCDSFAAATQVAQDDDITLPLPGV
HFTELVAFSDVMGSGQQHLVRIRHNSVTCWPNLGHGRFGHPLSLPGFNQPVEQFNPLAIYLADIDGSGTIDLIYATTSQL
LIYRNQSGNRFAEPLAIALPTGIRFDNSCQLSLADIQGLGVASIMLSVPHPTTQHWRYDFVASKPYLLCTTNNNMGAESQ
LLYRSSVQFWLDEKAQAAKQGRSLACQLPFPIHLLAQTTQFDEITGNSLSQTARYFHGFYDGVQREFSGFGRVDTLDTDT
SAQGSAAERTAPTKSSRWFHTGRAGNETLWQSEYWQGDDQAYSLLPTRLTKFINNTQGDELLSELDDNQTFWLHRALKGS
LLRSELYGLDDSELATQPYSVNSSRYQVRQIQSSADGISSPVALPMVLEQLSYHYERIAQDPQCSQQIVLRCNEYGHPLH
SVTINYPRRDKARISPYSWLAKEHWDSHFDEQQQQLRITESQQSYHHEISDKFYVLGLPAGQRSDVLTYPDNFVPTAGIH
WEELQQPEGLLGTKAERTFTAQQQVFYTSDTIPGLVAYSQQAEFDDQTLVALDELLPANERKQQLIKAGYQIAPRLFART
GETDIWVAQSGFTDYGDASRFYRPISQRSTQLVGKTILEWDATCCAVSDIILADYSITHAEYDYRFITPYLLIDINDNQH
YIELDALGRVTSSRFAGTEIDPQTNKVIETGFPSIAEQPFSAPNSVDKALSLENTRIPVAQFSVYQPQSWMISLQLDDIE
IWIRANNITPEYLFQNHILIDNYYLCPLALRRWGRQNNLLITEGVGLTLKNPMRQPPHILTVVVDNYFSASEPQQHQQTL
AFSDGFGRVLLSARRVETGPSYSFDPENGLLVDDKGNLVQLEVDQRWAVSGRTEYDNKGLPRRRYQPYFFDNWIWLYIAN
NRTLKEAYADTHIYDPLGREIKVITAKGYLRRTHYFPWFVISEDENDTASEITPNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 58
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 49
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 49