Gene Information

Name : YpAngola_A1189 (YpAngola_A1189)
Accession : YP_001605731.1
Strain : Yersinia pestis Angola
Genome accession: NC_010159
Putative virulence/resistance : Virulence
Product : RHS repeat family protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 1232691 - 1235549 bp
Length : 2859 bp
Strand : +
Note : identified by match to protein family HMM TIGR01643

DNA sequence :
ATGTCCACAAGTTTACCAACCCAGCTTTGCGCCAACACCCCCGCCCTCACCATCCACGATAACCGGGGGTTAGCTATTCG
TACACTGGCTTATAACCGCCGCGATCATAATGAAACCGTTGACGAACTGATCAGCCGCAACCGCTATAACGCCTCCGGTC
AGCTAATCGCCAGCCGTGATCCGCGCCTTGAGGTGGATAATTTCCGCTATCAATACAGCCTCAGCGGTGTTCCACTGCGC
ACCGACAGCGTCGATAGCGGCAGTACACTGCAACTGGCAGATAGTGCTGGCCGCACGGTGCTCACGCTCGATGCACACCA
CACCCGCCGCTGGGTGGAGTATGAGACCGGTGAACACAGTTTAGGCCGCCCGCTAAGTTACCACGAGCAAGCCAAAGGCG
GCCTGAAAACGGTTACCGACCGCTTTTTCTATGCCACAAACAGCGAGCAGGATAAAAACTGCAACCTGAACGGCCAGTGT
GTACGCCATTACGACAGCGCTGGTTTGCAGGCACTGATTAGCCAGTCGATTATTGGCGTACCACTGCAACAACAGCGCCG
TCTACTGACGAATCCCAAAGGCCCAGTTGACTGGTTTGGCGAAAAGGAAAACTGGGGCGCTCGCCTGAGCGAACAGCCGT
TTGTTAGCCATAGCACCACCGATGCTCTCGGCCAGTTACTCACGCAAACCGATGCCAAAGGCCATATCCAGCGCATGGCC
TATAACCGCGCCGGACAACTTATCGGTTCGTGGCTAACAATAAAAAATAGCGCTGAACAGGTGATCCTCCGTTCACTCAC
CTATTCGGCTGCTGGGCAAAAACTGCGCGAAGAGAGCGGCAACGGGGTGATTACCGAATACCGTTATGAACCCCAGACTC
AGCGCTTAATCGGCATTAAAACCACCCGTCCGGCGAAGAAAGACCGCCCGACCCGGTTACAAGACCTGCGTTACGATTAT
GACCCGGTCGGGAATATTCTCGCCATCCATAATGACGCCGAAACCACCCGCTTCTACCGTAATCAGAAAATCGTGCCGGA
AACCACTTACCGCTACGATGCACTGTATCAGCTTATCGAGGCCACTGGCCGTGAAGCGGATACCAACGGCATACAAAACA
GCCAGTTACCCGCGTTGGCGTCACTGAACGACAGCAACCAGTTCGTCAACTACACCCGCAGCTACCACTATGACCGCGCC
GGTAACCTGCTAAAAATTCAGCATACCGGTGCCAGCCAATACAGTACCCATATCACGGTGTCCGATTCGTCCAATCACGG
CATTCAGCAACAAGATGGCATCATCGCCCGTGATATTCGCTCCCAGTTTGATGCGGCGGGTAATCAGCAACAACTGCAAC
CCGGTCAGCCCCTGCGCTGGAACAGCCGCAATCAGTTACAGCAGGTGGAACCTGTGCCCCGCAACGATGGCATCAGTGAC
AGCGAAAGTTATCTCTATGATGGCAGCGGTAGGCGGGTGGCCAAAATCAGTCTCCATAAAACCCATAACGCCATCCAAAC
CCGTTCGGTCATTTATTTAGCGGGACTGGAACTGCGTGGCCAACATAATGACAATAATCTGACAGAAAGTTTTCAGGTGA
TAACCGTGGGTGCTGCGGGCCGTGCTCAGGTACGGGTATTACACTGGGAGAGCGGCCAACCCGTTGATATCGTCAATGAC
CAACTGCGTTACAGTTTCGATAATCACCTTGGCTCGGCGTTAATCGAATTAGACAGCGATGGCGATATTATCAGCCAGGA
AGAATATTACCCATTTGGCGGTACCGCGGTGTTAGCCTCCCGTAATACCGTGGAAGCCAAATATAAAACCGTTCGTTATT
CCGGTAAAGAGCGCGATGCCACCGGGCTGTATTATTATGGTTACCGTTATTACCAACCGTGGCTGGGCCGATGGTTAAGC
GCCGACCCCGCAGGCACTATAGACGGACTGAATTTATATCGGATGGTGAGGAATAACCCAATCAGATGGCGTGATAACAA
TGGGCTATTAACCGAAGAGCAAATTAATATGTACGTTAATTTGTTTAGTAATATTGGATTAAAAAATGATGATGAATTAA
AGAGTGAATTATTAAAATATGGTTTAAGTGAAGAAGAGCAAAACCAGATATACCTTAATATGTTAAGACCTATGCAGTCT
GGATCATCAAGCTCATTATTCTCCTTCCCTTCTGAAAGTAGTTCAAGTTCTGGGAGTACGCAAAGTGTTGATTCAGGTTA
TCTCAGTCCAGTAAGAAACTATCATTTTTTTGAAGATATTAAGTTAGCAACAATGCACCGTCCCTATCCAAAAAAACAAG
CCTCTAGTGACACAATAACATATTCAGCAGAAGATTTAACAGAAGCTAGCCCTATAAAAATTCTCATTGGTTTGGATTTG
ACCAGTAAAAACACCCAACCATATAAGTCAGCGCTTGCCGAAAAAGGAATTAAGTATATCACTAAAGAAAAATATGAAAT
AACAGACTTTTTTGAAGAAGGAGGATTATCGACTGAACAAATAGATTTAACAGTAAATAAAATATTAAAATTACAAAAAA
AGGATCTTGTAGGAATTCATTGTGGGGCAGGTAATGGAAGAAGTGGAGTTATTGCATCAGCATTATCCATTAATAAACAG
TATACAACAGATAAAATAAATAGTTTTGACGTAACTCATTCATTAAGAGGGTCAATACTTAAAGACACACAAACATACCA
AGTGGATACGGTAACCGCCAAGGCGGTTGGGATTATCAGAGAAATAAATCCTAAAGCAGTGGAACGTAATCAGGATGTTA
TTTCCCTATATAGATATTCTCATTTTTTATATACAAGAAAACACACTACATCATTATAA

Protein sequence :
MSTSLPTQLCANTPALTIHDNRGLAIRTLAYNRRDHNETVDELISRNRYNASGQLIASRDPRLEVDNFRYQYSLSGVPLR
TDSVDSGSTLQLADSAGRTVLTLDAHHTRRWVEYETGEHSLGRPLSYHEQAKGGLKTVTDRFFYATNSEQDKNCNLNGQC
VRHYDSAGLQALISQSIIGVPLQQQRRLLTNPKGPVDWFGEKENWGARLSEQPFVSHSTTDALGQLLTQTDAKGHIQRMA
YNRAGQLIGSWLTIKNSAEQVILRSLTYSAAGQKLREESGNGVITEYRYEPQTQRLIGIKTTRPAKKDRPTRLQDLRYDY
DPVGNILAIHNDAETTRFYRNQKIVPETTYRYDALYQLIEATGREADTNGIQNSQLPALASLNDSNQFVNYTRSYHYDRA
GNLLKIQHTGASQYSTHITVSDSSNHGIQQQDGIIARDIRSQFDAAGNQQQLQPGQPLRWNSRNQLQQVEPVPRNDGISD
SESYLYDGSGRRVAKISLHKTHNAIQTRSVIYLAGLELRGQHNDNNLTESFQVITVGAAGRAQVRVLHWESGQPVDIVND
QLRYSFDNHLGSALIELDSDGDIISQEEYYPFGGTAVLASRNTVEAKYKTVRYSGKERDATGLYYYGYRYYQPWLGRWLS
ADPAGTIDGLNLYRMVRNNPIRWRDNNGLLTEEQINMYVNLFSNIGLKNDDELKSELLKYGLSEEEQNQIYLNMLRPMQS
GSSSSLFSFPSESSSSSGSTQSVDSGYLSPVRNYHFFEDIKLATMHRPYPKKQASSDTITYSAEDLTEASPIKILIGLDL
TSKNTQPYKSALAEKGIKYITKEKYEITDFFEEGGLSTEQIDLTVNKILKLQKKDLVGIHCGAGNGRSGVIASALSINKQ
YTTDKINSFDVTHSLRGSILKDTQTYQVDTVTAKAVGIIREINPKAVERNQDVISLYRYSHFLYTRKHTTSL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
tccC CAI77380.1 putative insecticidal toxin complex protein Not tested tc-PAIYe Protein 0.0 67
TccC5 AAO17210.1 TccC5 Virulence tcd island Protein 2e-150 55
TccC3 AAO17204.1 TccC3 Virulence tcd island Protein 1e-152 55
TccC2 AAL18492.2 TccC2 Virulence tcd island Protein 1e-152 54
TccC4 AAO17196.1 TccC4 Virulence tcd island Protein 2e-152 54