Gene Information

Name : TY21A_21175 (TY21A_21175)
Accession : YP_007928406.1
Strain : Salmonella enterica Ty21a
Genome accession: NC_021176
Putative virulence/resistance : Unknown
Product : large repetitive protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4312245 - 4322096 bp
Length : 9852 bp
Strand : +
Note : -

DNA sequence :
ATGGGAAATAAAAGCATACAAAAGTTTTTTGCCGATCAAAATTCTGTAATTGATTTATCTTCTTTGGGTAATGCCAAAGG
CGCAAAAGTTTCTCTTTCCGGGCCAGACATGAACATTACCACGCCGCATGGGTCAGTGATCATTGTCAATGGCGCTCTTT
ATTCAAGTATCAAAGGCAATAACCTCGCTGTTAAATTTAAAGATAAGACTATTACCGGCGCTAAAATTCTGGGCAGCGTA
GATTTAAAAGATATTCAACTGGAGAGAATTGACAGCTCATTAGTTGATTCTGCTCAGGTAGAAAAGAAAGGTAATGGCAA
ACGACGAAATAAGAAGGAAGAAGAGGAATTAAAAAAGCAGCTTGACGAGGCTGAAAACGCAAAGAAAGAAGCTGATAAGG
CGAAGGAAGAAGCAGAGAAAGCTAAGGAGGCAGCAGAAAAAACGCTCAATGAAGCGTTTGAAGTACAGAACTCGTCAAAG
CAAATTGAAGAAATGCTGCAGAACTTTTTGGCTGACAATGTAGCAAAAGACAATCTGGCTCAGCAAAGCGATGCTTCCCA
GCAAAATACACAGGCTAAAGCAACGCAGGCTTCTAAACAGAACGATGCTGAAAAAGTTCTTCCTCAACCTATTAATAAAA
ATACCAGTACTGGCAAAAGTAATAGCAGTAAAAATGAGGAAAATAAGCTCGATGCCGAGTCTGTTAAAGAGCCGCTTAAA
GTCACATTAGCGCTTGCGGCCGAGAGTAACAGCGGTAGCAAAGATGATAGTATAACTAATTTTACCAAACCTCAGTTTGT
AGGTAGCACTGCTCCCAATGCCACGGTTATTATTAAAATTAATGGTATTGCTGTCGGTCAGGCTGTAGCGGATAGTTTGG
GTAACTTCACCTTCACAGCGCCTGAAACATTGACTGATGGAACATACAATCTGGAGGCAGAGGCCAAGACTGCTGATGGA
AGCGGTAGCGCCAAACTTGTCATTACTATCGATTCCGTTACCGATAAGCCAACATTTGAACTTTCGCCTGAAAGTAGTGT
GTCCGGTCATAAGGGCTTAACGCCGACCTTGACGCCTTCAATTGTTGGTACGGCGGAAGAGAATGCTAAGGTTGACATTT
ATGTAGATAATAAACTGGTTGCCAGCGTTGATGTCGATAAAGATGGAAACTGGAGTTATGAATTTAAAGATAATGAATTA
TCTGAGGGCGAAAATAGTATAAAAGTCGTTGCCGTGGATAAAGCAGGTAATAAAAACGAAACGACGGATAGTATCATAAC
CGACACCATTCCTCCGGAAAAGCCGACGATTGAGCTGGATGATAGTAGTGATTCCGGCATTAAAAATGACAACATTACAA
ATAGCACTCTGCCAACATTTATTGGTGTGGCGGAACCTGGTTCTACAGTCTCTATTTATCTTGGGCTTAAACATCTTGGT
GAGGTCATTGTTGCTAAAGATGGGACATGGAGCTATACGCTTACTACGCCGCTCAAGGATGGCGAATACAATATAACAGC
AACCGCTACTGATATTGCCGGGCATACCTCTGCGACGGCAAATCTGCCTTTTACTATTGATACCCGTATCAGCTATTTCA
GCGCTGAGATTGAAACGACTGATGATAGCGGTATTGTTGGAGATAACGTTACTAACAATACTCGCCCAACCTTTACAGGT
AAAACTGAGCCAAATGCTATTATCAGTGTCATAAATAGTGAGACTGGCGAAGAGGTTATTTTTAAAGCGAATGACAAGGG
CGAATGGACGTTCAATTTCACTTCAGACTCAGTGGAAGGGGTTAACAATCTTACGTTCACTGTTGAAGATGTCGCTGGCA
ACAAAAAGGATTTTTCCTTTAGTTACGTTATCGATACTGTTGCCCCTGTACCTCCGACGGTTTCTTTGGAGGATTTTGTT
GTTTTACCGAATGGTATAATTTTATCAGGGAATGATTTACCGGCTTTAGTCGGTACGGCAGAGCCAAAGTCTACCATCTT
ATTGATGCGAGATGGTAAATTATATGACAGCATTGAGGTTGACTCAAACGGGACCTGGAATTATCAGTTTAGTAATAAAT
TTCTTCAGGGCGCCTATGATATTGAAATCATTTCTCAGGATGCCGCCGGTAATAAATCCTCTACTGTTAAATATTCTTTT
ACTATTCAAACTGAAGTTGTACCTCCAAAAGCGGAACTCGATGCCAGTGATGATTCCGGTGCAAAAGGCGACTGGATTAC
CAATAAACATAATGCTCTGACATTACTGGGAACAGCGGATAGGTTTGCTACCATAAATATCCTTATCGATGGTAAAACGA
TAGGCGTGACGACTGCGGATGCAGACGGTAACTGGAATTTTGATATTTCCCGAAATCTGTCTGACAATGTTTATAAGATT
ACGGTTGAATCTATCGATCCTTTAGGAAGAACGTCATCTGTAGATTATCAGCTTACCATTGATAGCTTTACGCCGATCCC
TACTGTTATGTTGCATGATAGCGCTGGCTCTGGCGTTAAAGGCGATATGATTACTAAAATTAATACGCCGTTGTTTACCG
GGATGGCTGAAGCTAATGCTAAGGTTTCCATCTATGTTGACGGTGTGTTAAGTGGCGAGGCTATTGCTGGCGATGATGGT
GTATGGAATTTTCAATTTACCACAGCGTTGTCCGATGGCTCGCATGACGTAACGGTAAAGGTAGAAGATATTGCCGGTAA
TACTGCCTCCTCATCAGCGTATAATTTCCAAATCGTAACGCAAACGCAAAAACCAACAATAGAGTTGGTCAACGATACGG
GGGTTGATAATACAGACCATATTATTAATGAAAAGAATCCTGCACTGACAGGGACCGCTGCACCCTATTCAACGGTTAAA
CTCTATGTTGATGGCGCACTAATCGCTGAGGTCAGAACAAATAAAGATAGCAGATGGGAGTATACCCTGAAAGCCGATCA
AGGTTTGGTTGATGGCGATCATAGAATAACCGCTTCAGTTGAAGATATCGCTGGCAACATTGCGCATTCGGATCCTTTCT
TAATTAGCGTTGATACTGCTATTTCAATACCGATAGTTTCATTGAGCCCGGATTCAGATTCGGGAATCGCAGATGATAAT
TTAACGAATATCGTTAACCCTACCTTGCACCTAAAAGATATTGATCCGGACATTATCAGCGTTCAGGTATGGGATGCCGC
GTCTGATACGCAGATCGGTGTTGCCACGCAACAACCTGATGGTTCATGGACCTATACCTTTACTTCAGATTTAACGGAAG
GCTTGCATCAGGTTTATGTCAAGGTTGAGGACATTGCGGGTAATAAAGCGAACAGCGCGGTATTCGATTTTACTATCGAT
ACCACAGTATCAACGCCGGTGATTTCCCTGCTTTCTAAGGATGATACGGGGGTGACAGGCGATAACCTGACCAATATCAA
TAAGCCAGGTTTTGCTATTTCCGGTGTTGATGCCGATGCGCATCGGGTCGTCGTACAGGTGATGCACAATGGCGTGAGCG
AAGAGATCGAACTTTCCCACCTCAATGGTAGTTGGTTATTTACACCAGGGAATACGTGGGCGGATGGCAGCTACACGTTA
ACGGTGAAAGTAGAAGATAAGGCAGGAAATACCAGCTATTCAGCGCCGCTGACGGTCGTTATCGATACCCAAATCGCCAT
TGATGGGGTGGAACTGGTCAACGATAGCGGCGTGAAAGGCGACAATATGACCAACGACGACCGTCCCCACTTTCGTGTGA
CGGTACCTACGGATGTCAATGAAGTCCGTCTGAGCATTGACGGTGGTAATTCGTGGGTTCAGGCAACCCCGGGCGTGGCA
GGAAGCTGGGAGTATATCTGGCCGACAGACCTGGCAGATGGTCAATACACGCTAACGGTGGAAGCGACTGATAAAGCAGG
CAATACAGTGACGAAGACCATCGATTTCGCGGTGGATACCACGCTGTCAGTGCCGGTCATCGTACTGAATAGCGCGGACG
ATACCGGTGTCCAGGGCGATAACATGACGAATCGCACCCAGCCGACATTTGCCCTGCAGCATATTGATGATGATGCCGTT
CGCGTTACGGTCAGCGTGGAGCATGGCGGCGTCACCACCACATTTGACGCCACGAAAGGCACAGGCGGATGGACCTTTAC
GCCGCCGACATCATGGGCGGATGGTGATTATACCCTGAGTGTGTCAGTCGAAGATAAAGCGGGGAACACCAGCCATTCTG
CATCGCTGACGGTGACGGTGGACACGCAAATCGCCATTAATAACATTGAACTGGTCAATGACAGCGGTATTCCCAACGAT
AATCTGACTAATAATGTGCGTCCACACTTCCAGGTGACGGTACCGACGGATGTCAACGTGGTGCGCCTGAGCATTGACGG
CGGCAAGACGTGGTTCAACGCGACCCAGAGCGCGACGCCGGGCGTCTGGGATTATATCTGGCCGGATGATGTGGCCGACG
GAGGCTATACCCTGACGGTAGAAGCGACCGATGAGGCGGGAAATAAGGCAACACAGACGCTCGATTTCACCATCGATACC
ACTCTGTCTGTGCCGACCCTCTCGCTGGACAGCGCAGATGACAGCGGCATCGCGGGCGATAATATCACCAGTGTTAAAAC
GCCGGGCTTTACCCTCAACAATATTGATACCGATGTCAGCCGGGTGATAGTGGAGGTAATGCACAATGGCATTAAACAGG
AGGTACCACTGGTTCAGACCGGCGGACAGTGGCGCTTTGCGCCGACCAGCGACTGGGCGGACGGCGGCTATATCCTGACG
GTGAAGGTAGAAGACAGGGCCGGAAATGTGAAGCAGTCCGCGCCGTTGACGGTGACAGTGGACACGCATATCGCCATTGA
CCGTATTGAACTGGTTAACGACAGCAGTATCCCCGACGATAATCTGACCAATGAAGCGCGCCCGCACTTTCAGGTGACAG
TACCGGCGGATGTTAACGGTGTAAGACTGAGCATTGATGGCGGCAAAACGTGGTTTGACGCCACGCAGAGCGCGACGTCG
GGCGTCTGGGATTATACCTGGCTGACGAATGTGGCTAACGGCCCTCACACCCTGATGGTGGAAGCGACCGACAAGGCGGG
AAACAAAACGACGCAGAAACTGGACTTCATCATCGATACCCTGCTGTCAGAACCGATTATTACCCTGGACAGTGCGGATG
ACAGCGCCGCTGGCGATAACATCACCAACGTTAAGATGCCAGGCTTTACCCTCGGTAATATCGACGCCGACGTGACCAAA
GTGGTGGTGACGGTGGCGCATGATGGTAAGAACCAACAGATAGAGTTGATTAAGAACGGCGGTGTGTGGCGCTTTACGCC
GGGCGCAGCCTGGACCGATGGCGACTATACGCTGACGGTAAAGGTAGAAGATAAGGCGGGTAATACAAATTATTCTGCGC
CGCTGACGGTGACTATCGATACGCAAACGTCTATTGATCGCATTGGGCTTCTTAATGACACGGGTATTGTCGGGGATAAC
CTGACCAATGAAGCACGTCCACAGTTTCATATTACGGTACCGACGGACGTGAACTCTGTGCAACTGAGTCTTGATGGCGG
CATCAACTGGGTTAACGCAACGCTGACGTCTGACGGCGTTTGGGAGTATATATGGCCGACAGATCTGGTCGAAAATACGT
ATACCCTGACAGTGAAAGCAACCGATGTTGCAGGCAACACGGCGACGGAAACGCTCAATTTTATCATTGATACCACATTG
TCGACACCGACCATCACGCTGGATAGCGCAGATGATAGCGGCACCGCCAACGATAATAAGACTAACGTTAAAACGCCGGG
TTTTATTATCGGCGGTATTGATTCTGACGTGACTCAGGTCGTCGTGCAGGTGATGCGCGATGGTCACAGCGAGGAGGTGG
AGCTGACGCAGACTAACGGGCAGTGGCGTTTTGTACCCGGCAGCGCGTGGACTGATGGCGACTATACGCTGACGGTAACG
GTGAAAGATGAGGCGGGTAATATTCGCCACTCAGCGCCGTTGACGGTCACCATCGATACGCAAATCACCATTGACCATAT
TGAACTGGTCAATGACAGCGGTATTCCCGACGATAATCTGACTAATAATGTGCGTCCGCACTTCCAGGTGACGGTACCGA
CGGATGTCAACGTGGTGCGCCTGAGCATTGACGGCGGCAAGACGTGGTTCAACGCGACCCAGAGCGCGACGCCGGGCGTC
TGGGATTATACCTGGCTGGCTGATGTGGGAGAGGGTAAGCATACCCTGACAGTGGAGGCGACCGACAAGGCGGGAAACAA
AACGACGCAGCAACTGGACTTCATCATCGATACCCTACTGTCAGAACCGACTATCGTGCTGGACAGCACGGACGACAGCG
GAACAAAAGGCGATAACCTGACCAACGTAAATAAGCCGACGTTTTTACTGGGCAATATTGACGCAGACGCGCGGTATGTC
ACAGTTGAGGTACAGCATGGCGGCACGAAAGAGGTGCTGACGGCCACCAAAGACGCGACCGGCAACTGGAGCGTGACACC
GATCGGCACATGGGCAGATGGCGACTATACGCTGACAGTGAGGGTGGAAGATGAGGCGGGGAACGAAAAACACTCAGCGT
CGCTGACGGTCACTGTTGATACCCAAATCACCATTGATGTTATTGAACTGGTTAATGATAACGGTATTCCCGGCGACAAT
ATGACTAACGACGCCCATCCGCAGTTCCGCGTGACGGTACCGGGGGATGTTAACGAAGTCAGTCTGAGCATTGACGGCGG
TGTAACCTGGGTTAAAGCGATGCAAAGCGCGACGCCGGGCGTCTGGAATTATACCTGGCCAAAGACAGTGGCAGATGGTG
ACTACACGTTAACGGTGAAAGCGACTGATAACGCAGGCAATACGGTGACCAGGACGCTCGACTTCACTATTGATACTACG
TTGTCGACGCCGGTTATCGTACTGGATAGCGCGGACGACAGTGGTGTCCATGGCGATAACATGACCAATCGCACCCAGCC
GACATTTGCCCTGCAGCATATTGATGATGATGCCGTTCGCGTTACGGTCAGCGTAGAGCATGGCGGCGTCACCACCACAT
TTGACGCCACGAAAGACGCAGGCGGATGGACCTTTACGCCGACAGGGGCGTGGGCGGATGGTGATTATACCCTGAGTGTG
TCAGTCGAAGATAAAGCGGGGAACACCAGCCATTCTGCATCGCTGACGGTGACGGTGGACACGCAAATCGCCATTAATAA
CATTGAACTGGTCAATGACAGCGGTATTCCCAACGATAATCTGACTAATAATGTGCGTCCACACTTCCAGGTGACGGTAC
CGACGGATGTCAACGTGGTGCGCCTGAGCATTGACGGCGGCAAGACGTGGTTCAACGCGACCCAGAGCGCGACGCCGGGT
GTCTGGGATTATACCTGGCTGGCTGATGTGGGAGAGGGGAAGCATACCCTGACAGTGGAGGCGACCGACAAGGCGGGAAA
CAAAACGACGCAGCAACTGGACTTCATCATCGATACCCTACTGTCAGAACCGACTATCGTGCTGGACAACACGGACGACA
GCGGAACAAAAGGCGATAACCTGACCAACGTAAATAAGCCGACGTTTTTACTGGGCAATATTGACGCAGACGCGCGGTAT
GTCACGGTTGAGGTGCAGCATGGCGGCACGAAAGAAGTGCTGACGGCCACCAAAGGCGCGACCGGCATCTGGAGCGTGAC
ACCGACCGGCACATGGGCAGATGGCGACTATACGCTGACGGTGAGGGTGGAGGATGATGCGGGGAACGTAAAATACTCAG
CGCCGCTGACGGTCACGGTTGATACCCAAATCACCATCGATGTTATTGAACTGGTTAATGATAACGGTATTCCCGGCGAC
AACCTGACCAATGATGTTCGTCCACACTTCCGCGTCACGGTGCCAGGGGATGTCAACGAGGTACGTCTGAGTATCGACGG
CGGTAATACGTGGGTTCGTGCAACACAGGGCACGGCAGGGATCTGGGATTACACCTGGCCGAAAGATGTGACCGACGGGC
TACATACCCTGACGGTAGAAGCGACCGATAAGGCGGGAAATAAGACGACGCAGACGCTCGATTTTACCATTGATACCCGG
CTGTCAACGCCTACCATCGCTATGGACAGCAGGGACGATACAGGTGCCATTGGCGATCATATTACGAGCGTCAAAAGACC
GGGCTTTACCATTGGCAATATTGACGCCGATGCGCACTCGGTCATTTTGCGGATCACACAGGGCGGCAATAGCCAGGAAG
TGACACTAACCCAGGTTGGAGGACAGTGGCGCTTTACGCCAGATGCTGACTGGGCGGACGGTAGCTATACGCTGACGGTA
GAGGTAACGGATAACGCAGGAAACGTTCGTCAGTCCACGCCGCTGGTGGTGACGGTGGACACGCAAACCAGCATTACTGA
TATTACATTGGTCAATGATCATGGCGTGCCTGATGACAATCTAACTAATAGCACCCGTCCGCAGTTTGAGATCACGGTGC
CGGCGGATGTGAATTCTGTGCAACTGAGCATTGATGGGGGCGCAAACTGGGTGAGCGCGACGCAGGGTATCGAAGGCGTC
TGGGGCTATACCTGGCCAACGGATATGGGCGATGGAAAACACACCCTAACCGTCATGGTCACCGACAGAGCGGGCAATAC
GGCGACGCAAACGCTTGAATTTTTCATCGACACCCGGTTGTCGACGCCGACCATTGCGCTGGATAGCACGGATGATACCG
GTACGCCTGGCGATGATATGACCAATCGCACCCGGCCGACTTTTATTCTGCAGAATATCGATTCGGATGTTATCAACGTT
ACAGTCAGCGTCACGCATAATGGAACGACAACCTCGTTTACCGCGACACAGGGGGCTGGAGGATGGAGCTTTACACCGCC
AGCGCCGTGGGGCGACGGTGATTATACGCTGACGGTGACAGTGGAGGATCGGGCGGGAAATACGCGTCCGTCTACGCCGC
TGACGGTGACAGTGGATACGCAAATAGCCATTGATCATATTGAATTAGTCAACGATAGCGGCGTCCCTGGCGATAATGTG
ACAAAACATGTGCGTCCGCAGTTCCAGATCTCGGTACCGGATGATGTGGAAAAGGTTCTTCTGAGTATTGACGGCGGCAC
GACCTGGGTTACTGCAATCAAGAGTTCGACGGTTGGCATTTGGGATTACACCTGGCCGACGGATATGCCAGAGGGACAGC
ATACCCTGATCGTGGAAGTGACTGACGGTGCGGGTAATAAGATGACGGGGACGCTCGATTTCACTATCGACATCACGTTG
TTGACGCCAACCATTGAGCTAGCGCCCGATCAGGATACCGGACAGAATAAGAACGATAATCTGACCAGCGTCACTCAGCC
GGTATTTGTGTTGGGGAGTATCGATAAAGATGTTCGACACGTGGAATTGAGTATTGAGCATAACGGCACGTTTAAAACGG
TGGTACTCACCGAATCAGCCGACGGCTGGCGCTATCGACCGGATTCTGCTTTGGCGGACGGTAGCTACACATTCACCGTG
ACGGTAACAGATGTGGCCGGTAATCAGCAAACATCCGCGCCTTTAAAGGTGACGATAGACGGTACGTTGACTACGCCGGT
GATTGAGCTGGCGGCCGGCGAAGATAGCGGTACTGTTGGCGATCGCCTCACCAATCACGATCGGCCTGTGTTCGACATAC
GCCAGATTGATTCTGACGTTACGCGCGTGATGGTCAAAGTAACTTACAACGGTAAAACGCACGAGGAAGCGGCGGTATTC
ACCAATGGTTAA

Protein sequence :
MGNKSIQKFFADQNSVIDLSSLGNAKGAKVSLSGPDMNITTPHGSVIIVNGALYSSIKGNNLAVKFKDKTITGAKILGSV
DLKDIQLERIDSSLVDSAQVEKKGNGKRRNKKEEEELKKQLDEAENAKKEADKAKEEAEKAKEAAEKTLNEAFEVQNSSK
QIEEMLQNFLADNVAKDNLAQQSDASQQNTQAKATQASKQNDAEKVLPQPINKNTSTGKSNSSKNEENKLDAESVKEPLK
VTLALAAESNSGSKDDSITNFTKPQFVGSTAPNATVIIKINGIAVGQAVADSLGNFTFTAPETLTDGTYNLEAEAKTADG
SGSAKLVITIDSVTDKPTFELSPESSVSGHKGLTPTLTPSIVGTAEENAKVDIYVDNKLVASVDVDKDGNWSYEFKDNEL
SEGENSIKVVAVDKAGNKNETTDSIITDTIPPEKPTIELDDSSDSGIKNDNITNSTLPTFIGVAEPGSTVSIYLGLKHLG
EVIVAKDGTWSYTLTTPLKDGEYNITATATDIAGHTSATANLPFTIDTRISYFSAEIETTDDSGIVGDNVTNNTRPTFTG
KTEPNAIISVINSETGEEVIFKANDKGEWTFNFTSDSVEGVNNLTFTVEDVAGNKKDFSFSYVIDTVAPVPPTVSLEDFV
VLPNGIILSGNDLPALVGTAEPKSTILLMRDGKLYDSIEVDSNGTWNYQFSNKFLQGAYDIEIISQDAAGNKSSTVKYSF
TIQTEVVPPKAELDASDDSGAKGDWITNKHNALTLLGTADRFATINILIDGKTIGVTTADADGNWNFDISRNLSDNVYKI
TVESIDPLGRTSSVDYQLTIDSFTPIPTVMLHDSAGSGVKGDMITKINTPLFTGMAEANAKVSIYVDGVLSGEAIAGDDG
VWNFQFTTALSDGSHDVTVKVEDIAGNTASSSAYNFQIVTQTQKPTIELVNDTGVDNTDHIINEKNPALTGTAAPYSTVK
LYVDGALIAEVRTNKDSRWEYTLKADQGLVDGDHRITASVEDIAGNIAHSDPFLISVDTAISIPIVSLSPDSDSGIADDN
LTNIVNPTLHLKDIDPDIISVQVWDAASDTQIGVATQQPDGSWTYTFTSDLTEGLHQVYVKVEDIAGNKANSAVFDFTID
TTVSTPVISLLSKDDTGVTGDNLTNINKPGFAISGVDADAHRVVVQVMHNGVSEEIELSHLNGSWLFTPGNTWADGSYTL
TVKVEDKAGNTSYSAPLTVVIDTQIAIDGVELVNDSGVKGDNMTNDDRPHFRVTVPTDVNEVRLSIDGGNSWVQATPGVA
GSWEYIWPTDLADGQYTLTVEATDKAGNTVTKTIDFAVDTTLSVPVIVLNSADDTGVQGDNMTNRTQPTFALQHIDDDAV
RVTVSVEHGGVTTTFDATKGTGGWTFTPPTSWADGDYTLSVSVEDKAGNTSHSASLTVTVDTQIAINNIELVNDSGIPND
NLTNNVRPHFQVTVPTDVNVVRLSIDGGKTWFNATQSATPGVWDYIWPDDVADGGYTLTVEATDEAGNKATQTLDFTIDT
TLSVPTLSLDSADDSGIAGDNITSVKTPGFTLNNIDTDVSRVIVEVMHNGIKQEVPLVQTGGQWRFAPTSDWADGGYILT
VKVEDRAGNVKQSAPLTVTVDTHIAIDRIELVNDSSIPDDNLTNEARPHFQVTVPADVNGVRLSIDGGKTWFDATQSATS
GVWDYTWLTNVANGPHTLMVEATDKAGNKTTQKLDFIIDTLLSEPIITLDSADDSAAGDNITNVKMPGFTLGNIDADVTK
VVVTVAHDGKNQQIELIKNGGVWRFTPGAAWTDGDYTLTVKVEDKAGNTNYSAPLTVTIDTQTSIDRIGLLNDTGIVGDN
LTNEARPQFHITVPTDVNSVQLSLDGGINWVNATLTSDGVWEYIWPTDLVENTYTLTVKATDVAGNTATETLNFIIDTTL
STPTITLDSADDSGTANDNKTNVKTPGFIIGGIDSDVTQVVVQVMRDGHSEEVELTQTNGQWRFVPGSAWTDGDYTLTVT
VKDEAGNIRHSAPLTVTIDTQITIDHIELVNDSGIPDDNLTNNVRPHFQVTVPTDVNVVRLSIDGGKTWFNATQSATPGV
WDYTWLADVGEGKHTLTVEATDKAGNKTTQQLDFIIDTLLSEPTIVLDSTDDSGTKGDNLTNVNKPTFLLGNIDADARYV
TVEVQHGGTKEVLTATKDATGNWSVTPIGTWADGDYTLTVRVEDEAGNEKHSASLTVTVDTQITIDVIELVNDNGIPGDN
MTNDAHPQFRVTVPGDVNEVSLSIDGGVTWVKAMQSATPGVWNYTWPKTVADGDYTLTVKATDNAGNTVTRTLDFTIDTT
LSTPVIVLDSADDSGVHGDNMTNRTQPTFALQHIDDDAVRVTVSVEHGGVTTTFDATKDAGGWTFTPTGAWADGDYTLSV
SVEDKAGNTSHSASLTVTVDTQIAINNIELVNDSGIPNDNLTNNVRPHFQVTVPTDVNVVRLSIDGGKTWFNATQSATPG
VWDYTWLADVGEGKHTLTVEATDKAGNKTTQQLDFIIDTLLSEPTIVLDNTDDSGTKGDNLTNVNKPTFLLGNIDADARY
VTVEVQHGGTKEVLTATKGATGIWSVTPTGTWADGDYTLTVRVEDDAGNVKYSAPLTVTVDTQITIDVIELVNDNGIPGD
NLTNDVRPHFRVTVPGDVNEVRLSIDGGNTWVRATQGTAGIWDYTWPKDVTDGLHTLTVEATDKAGNKTTQTLDFTIDTR
LSTPTIAMDSRDDTGAIGDHITSVKRPGFTIGNIDADAHSVILRITQGGNSQEVTLTQVGGQWRFTPDADWADGSYTLTV
EVTDNAGNVRQSTPLVVTVDTQTSITDITLVNDHGVPDDNLTNSTRPQFEITVPADVNSVQLSIDGGANWVSATQGIEGV
WGYTWPTDMGDGKHTLTVMVTDRAGNTATQTLEFFIDTRLSTPTIALDSTDDTGTPGDDMTNRTRPTFILQNIDSDVINV
TVSVTHNGTTTSFTATQGAGGWSFTPPAPWGDGDYTLTVTVEDRAGNTRPSTPLTVTVDTQIAIDHIELVNDSGVPGDNV
TKHVRPQFQISVPDDVEKVLLSIDGGTTWVTAIKSSTVGIWDYTWPTDMPEGQHTLIVEVTDGAGNKMTGTLDFTIDITL
LTPTIELAPDQDTGQNKNDNLTSVTQPVFVLGSIDKDVRHVELSIEHNGTFKTVVLTESADGWRYRPDSALADGSYTFTV
TVTDVAGNQQTSAPLKVTIDGTLTTPVIELAAGEDSGTVGDRLTNHDRPVFDIRQIDSDVTRVMVKVTYNGKTHEEAAVF
TNG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4458 NP_458558.1 large repetitive protein Not tested SPI-4 Protein 0.0 99