Gene Information

Name : YpAngola_A1182 (YpAngola_A1182)
Accession : YP_001605725.1
Strain : Yersinia pestis Angola
Genome accession: NC_010159
Putative virulence/resistance : Virulence
Product : insecticidal toxin complex
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1222312 - 1226802 bp
Length : 4491 bp
Strand : +
Note : identified by match to protein family HMM PF03534

DNA sequence :
ATGGAAAACAGTAAACAGCAAGTGGCCGTTGCTCCCTTGTCGCTCCCTAAAGGGGGCGGTGCGATTACCGGTATGGGCGA
TAGTTTAGGTCCTATCGGCCCCAGTGGCATGGCAACACTGACGCTGCCTCTGCCGATCTCCGCAGGCCGCGGTTACGCCC
CCTCGCTCACGCTAAGTTACAGTAGCGGCAGTGGTAACGGCCCGTTTGGCCTTGGCTGGCAACTCGGCACCATGGCGATT
CGCCGCCGAACCAACGCCCAAGTGCCACGTTACGATGAGTATGATGAATTTCTGGCTCCCAATGGGGAAGTCATGGTGGT
TGCCGCGGATCCGCAGGGCAGTATCGAACGCACTGAACAGTCACTAAATGGGGAACAATTCAGCGTAATTCGTTACTTAC
CACGTATTGAAGGCAATTTTCATCGCATTGAGTATTGGCGACCCCGGACAAATAACAGCCAGGCACCGTTTTGGTTAGTC
CACAGTTCAGATGGCCAAAAACACTGTTTGGGGTATTCCGCTGCCGCGCGAATTGCCGATCCACTGCACCCTGAGCATAT
TGCTGAATGGCTACTGGAAGAGTCGGTGTCTCTGAGCGGTGAGCATATCGGCTACCAGTATCAGGCAGAAGATGAACAAG
GCATTGATGAGCCTAGTATTTATAAGGCTGAAAAACAAAATCATCCAGCAGCCAGTGCGCAACGCTATCTGAAACGCGTG
GTTTATGGTAATCGCCAAGCGGCGTATGAACTCTATTGTCTGACTCAGCAACCTGCGCCGACAAGCTGGTTATTTAGCCT
GATTTTTGATCACGGCGAATACAGTAATATCGCGGAGCAAGTGCCGGTAATCATAAAAGGAAAATCCTGGAATTTCCGTC
AGGACGCCTTCTCACACTTTAACTATGGCTTTGAAGTGCGTACCCGCCGCTTGTGTCAGCAAGTGTTGATGTACCATAAT
TTGTCAGCACTTAAAGGTGATGAACCCGATGCTCAAGCCACGCTGGTTAGCCGTCTGCGTTTACACTATCAGCACGATGC
TTATGCCACCCAACTGGTCGGTTGCCAACAGTTAGCCCATGAACCCGATGGCACCAAACGTAGCCTGCCACCGCTAGAAT
TTGATTATCAGGATTTTTCCACTCGCGACGCCCTTGGTTGGCAACCACTTACTGATTGGGCTGAATTTAACTATCAGTAC
CAAATGGTGGATCTGAACGGCGAAGGGATGCCGGGGATGTTGTATCAGGACAGCGGTCACTGGATTTATCGCCCACCGGT
TCGCCAACCAGGCACCGCCGACGGCATCACTTTTGGCGCGGCCCAACGGTTACCGAGCCTACCCGCGATGCGCGAAAATG
CCATGTTGATGGATATCAATGGTGACGGCAAGCTGGATTGGGTGATCAGTCAACCGGGCTTAGCGGGCTACTTTAGCCGT
GATCCTGACCTTAGCCGTGATCCTGATCTTAGCTGGACCCAATTCATCCCGTTATCTACCTTGCCAGCGGAGTATTTCCA
TCCACAAGCGCAACTGGTCGATTTAGCCGGCAGCGGGTTATCTGATCTGGCGTTAGTCGGGCCAAAAAGTGTCCGCGTGT
ATACCAATCTGTGTGACAGTTTTGCTGCCGCAACCCAAGTAGCACAAGATGATGACATCACGCTACCACTACCCGGAGTT
CACTTCACTGAACTGGTGGCGTTCAGCGATGTGATGGGTTCGGGCCAACAACATCTGGTGCGAATACGCCACAATAGCGT
CACCTGCTGGCCCAATCTCGGCCATGGCCGCTTTGGTCACCCCCTCTCTTTACCGGGATTTAATCAGCCAGTTGAGCAAT
TTAATCCACTAGCGATCTATCTGGCAGATATTGATGGCTCCGGTACCATTGACCTTATTTATGCCACCACCAGCCAACTG
CTCATTTACCGCAACCAGAGTGGTAACCGCTTTGCCGAACCACTGGCTATCGCGTTGCCAACAGGCATTCGCTTTGATAA
TAGCTGCCAACTGTCACTGGCAGATATTCAGGGGCTGGGCGTGGCCAGCATTATGTTGAGTGTGCCGCACCCCACCACAC
AACATTGGCGCTATGATTTTGTCGCCAGTAAACCCTATTTACTGTGCACCACCAACAATAATATGGGCGCAGAGAGTCAG
TTGCTTTACCGCAGTTCTGTCCAATTCTGGTTGGATGAAAAAGCACAGGCGGCCAAACAGGGCCGATCACTGGCCTGCCA
ATTGCCCTTCCCGATCCATCTATTAGCGCAAACCACGCAATTTGATGAGATAACCGGTAACAGCCTGAGCCAAACCGCCC
GCTACTTCCATGGTTTTTATGATGGTGTGCAACGTGAATTCAGTGGTTTTGGCCGTGTCGATACGCTGGATACCGATACC
TCGGCACAAGGCAGTGCCGCTGAACGCACTGCGCCGACCAAAAGCAGTCGCTGGTTCCACACTGGCCGGGCAGGCAATGA
AACACTATGGCAAAGTGAGTATTGGCAGGGCGATGACCAAGCCTACTCATTGCTGCCTACCCGCCTGACAAAATTTATTA
ACAATACCCAAGGTGACGAATTACTCAGTGAACTCGATGATAATCAAACGTTTTGGTTGCACCGGGCGCTGAAAGGTTCA
TTGCTACGCAGCGAACTCTATGGTCTGGACGACAGCGAACTGGCGACACAACCGTATAGCGTTAACAGCTCTCGCTATCA
GGTGCGGCAAATTCAATCCTCTGCTGATGGAATCTCATCCCCGGTAGCCTTACCGATGGTGCTGGAACAACTCAGTTATC
ACTACGAACGTATCGCGCAAGACCCGCAATGCAGCCAGCAGATTGTGCTGCGTTGCAATGAATATGGTCACCCACTACAC
AGTGTGACAATCAATTATCCACGCCGCGATAAAGCCCGTATTTCCCCGTACTCCTGGCTGGCAAAGGAACATTGGGATAG
CCATTTTGACGAGCAACAGCAACAGTTGCGCATCACTGAAAGCCAGCAATCTTATCACCATGAGATCAGTGATAAATTCT
ATGTGCTGGGCCTGCCTGCCGGGCAGCGCAGTGATGTACTGACCTATCCCGACAATTTCGTGCCTACCGCGGGGATTCAT
TGGGAAGAATTACAGCAGCCAGAGGGCCTGCTTGGTACTAAAGCAGAGCGAACCTTTACCGCGCAGCAGCAGGTTTTCTA
CACCTCGGACACCATTCCGGGCCTGGTTGCCTACAGTCAACAGGCAGAATTTGACGATCAAACCTTGGTCGCATTGGATG
AATTATTACCAGCGAATGAGCGTAAACAGCAGCTTATCAAGGCCGGTTATCAAATAGCACCCCGTCTATTTGCTCGCACC
GGAGAAACGGATATTTGGGTCGCCCAAAGTGGGTTTACTGACTATGGTGATGCCTCCCGCTTCTATCGCCCCATCAGTCA
ACGTAGTACGCAATTAGTCGGGAAAACCATTTTGGAATGGGATGCCACCTGCTGTGCTGTCAGTGACATTATATTGGCCG
ATTACAGCATCACCCATGCTGAATACGATTATCGCTTTATCACCCCTTATCTGCTGATAGATATCAATGATAATCAACAT
TATATCGAGCTAGATGCGCTGGGGCGGGTTACCTCCAGCCGTTTTGCGGGCACCGAGATTGATCCACAAACCAATAAAGT
CATTGAAACCGGTTTTCCTTCCATCGCGGAACAGCCCTTTAGCGCCCCCAATTCAGTTGATAAAGCTCTCAGCCTGGAGA
ATACCCGCATTCCCGTCGCGCAGTTCTCTGTCTATCAACCTCAGAGTTGGATGATTTCATTACAGCTTGATGACATTGAA
ATATGGATTAGAGCCAATAACATTACTCCAGAATATCTATTCCAAAATCATATTCTGATCGATAACTATTATCTTTGCCC
CCTTGCGCTACGCCGCTGGGGAAGACAAAACAACCTTCTCATCACTGAAGGTGTTGGCCTGACATTGAAAAATCCCATGC
GCCAACCGCCTCATATATTAACCGTCGTCGTGGATAACTACTTTTCTGCTTCTGAACCACAACAGCATCAACAAACTCTC
GCTTTCAGTGACGGCTTTGGCCGCGTATTACTCAGCGCACGACGGGTGGAGACAGGACCCTCTTACTCATTCGATCCGGA
AAACGGCCTCTTAGTTGATGACAAAGGCAATCTGGTGCAACTCGAAGTCGATCAACGCTGGGCGGTCTCTGGCCGTACCG
AGTACGACAATAAAGGTCTGCCACGCCGCCGTTATCAACCTTATTTTTTCGACAACTGGATCTGGCTTTATATCGCCAAT
AACAGAACCCTCAAAGAGGCTTACGCCGATACCCATATTTACGACCCACTCGGGCGAGAAATAAAAGTGATCACCGCGAA
AGGTTATCTGCGACGAACTCACTATTTCCCGTGGTTTGTTATCAGTGAGGATGAAAATGACACGGCGTCAGAAATCACGC
CGAATCCCTAA

Protein sequence :
MENSKQQVAVAPLSLPKGGGAITGMGDSLGPIGPSGMATLTLPLPISAGRGYAPSLTLSYSSGSGNGPFGLGWQLGTMAI
RRRTNAQVPRYDEYDEFLAPNGEVMVVAADPQGSIERTEQSLNGEQFSVIRYLPRIEGNFHRIEYWRPRTNNSQAPFWLV
HSSDGQKHCLGYSAAARIADPLHPEHIAEWLLEESVSLSGEHIGYQYQAEDEQGIDEPSIYKAEKQNHPAASAQRYLKRV
VYGNRQAAYELYCLTQQPAPTSWLFSLIFDHGEYSNIAEQVPVIIKGKSWNFRQDAFSHFNYGFEVRTRRLCQQVLMYHN
LSALKGDEPDAQATLVSRLRLHYQHDAYATQLVGCQQLAHEPDGTKRSLPPLEFDYQDFSTRDALGWQPLTDWAEFNYQY
QMVDLNGEGMPGMLYQDSGHWIYRPPVRQPGTADGITFGAAQRLPSLPAMRENAMLMDINGDGKLDWVISQPGLAGYFSR
DPDLSRDPDLSWTQFIPLSTLPAEYFHPQAQLVDLAGSGLSDLALVGPKSVRVYTNLCDSFAAATQVAQDDDITLPLPGV
HFTELVAFSDVMGSGQQHLVRIRHNSVTCWPNLGHGRFGHPLSLPGFNQPVEQFNPLAIYLADIDGSGTIDLIYATTSQL
LIYRNQSGNRFAEPLAIALPTGIRFDNSCQLSLADIQGLGVASIMLSVPHPTTQHWRYDFVASKPYLLCTTNNNMGAESQ
LLYRSSVQFWLDEKAQAAKQGRSLACQLPFPIHLLAQTTQFDEITGNSLSQTARYFHGFYDGVQREFSGFGRVDTLDTDT
SAQGSAAERTAPTKSSRWFHTGRAGNETLWQSEYWQGDDQAYSLLPTRLTKFINNTQGDELLSELDDNQTFWLHRALKGS
LLRSELYGLDDSELATQPYSVNSSRYQVRQIQSSADGISSPVALPMVLEQLSYHYERIAQDPQCSQQIVLRCNEYGHPLH
SVTINYPRRDKARISPYSWLAKEHWDSHFDEQQQQLRITESQQSYHHEISDKFYVLGLPAGQRSDVLTYPDNFVPTAGIH
WEELQQPEGLLGTKAERTFTAQQQVFYTSDTIPGLVAYSQQAEFDDQTLVALDELLPANERKQQLIKAGYQIAPRLFART
GETDIWVAQSGFTDYGDASRFYRPISQRSTQLVGKTILEWDATCCAVSDIILADYSITHAEYDYRFITPYLLIDINDNQH
YIELDALGRVTSSRFAGTEIDPQTNKVIETGFPSIAEQPFSAPNSVDKALSLENTRIPVAQFSVYQPQSWMISLQLDDIE
IWIRANNITPEYLFQNHILIDNYYLCPLALRRWGRQNNLLITEGVGLTLKNPMRQPPHILTVVVDNYFSASEPQQHQQTL
AFSDGFGRVLLSARRVETGPSYSFDPENGLLVDDKGNLVQLEVDQRWAVSGRTEYDNKGLPRRRYQPYFFDNWIWLYIAN
NRTLKEAYADTHIYDPLGREIKVITAKGYLRRTHYFPWFVISEDENDTASEITPNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tcaC'(2) CAI77376.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 58
tcdB1 AAL18487.1 TcdB1 Virulence tcd island Protein 0.0 49
tcdB2 AAO17202.1 TcdB2 Virulence tcd island Protein 0.0 49