Gene Information

Name : TY21A_21885 (TY21A_21885)
Accession : YP_007928537.1
Strain : Salmonella enterica Ty21a
Genome accession: NC_021176
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4459894 - 4462674 bp
Length : 2781 bp
Strand : -
Note : COG5283 Phage-related tail protein

DNA sequence :
ATGAGTGATAATAACCTGCGACTGCAGGTAGTTCTGGGGGCGGTGGATAAGTTAACCCGCCCATTTAAAAATGCACAGGC
TGGCTCTAAGGAGCTGGCATCAGCTATTAGACAAACCCGCGATCAGATTAAAAAGCTGAGTGATGCTGGAGGTCAGCTTA
AATCTTTCGATCAGCTAACTCAAAGTGTTAGCCGTACTGGTGCCGAACTGGATCAGGCGAGGCTACGCGCTCAAATGATG
ACGCGCGAAATGTCTTCTTTGGAATCCCCGACAAAAAAACAAACGCAGGCGCTTGAAGCTCAGTGGCGTGCTGTTTCACG
TCTTGAACAAAAACAGCAACAGGAAACTCGCCAGATGGCGGCAGCCAGAGCTGAGCTTTATCGGCTGGGGTTATCTGCTG
GGGGCGGAGCGCGTGAGACGGCACGGATTGCACGAGAAACTGAGCGGTATAACCGACAGTTGGCTGAGCAGGAGCGCAGG
CTGCGTGAAGTTGGCGAGCGTCAGCGAAAGCTCAACGCCATCAAAGCCAAGGCTGAAAAGACCCGCGAGTTAAGGAACTC
TCTGGCAGGTAATGGTGCAGGGGCGATGGCGGCTGGGGTAACTACTGGCATGACGTTGCTGGCTCCAGTAAAAGCCTATT
CAGAATCAGAAAATGCAGCGAATCAACTCGCCGGTTCCATGATGGGGCCGGGCGGAAAGGTAGCGCCTGAATTTGAAAAA
ATTAACCGGCTTGCAGTTGCTTTGGGTGATAAGCTGCCGGGAACAACAGCCGACTTTCAGAACATGATGACTATGCTACG
CCGTCAGGGTATGTCGGCGCAGGTCATCCTGGGAGGCTTGGGAGAGTCAGCAGCTTATCTTGGCGTGCAGTTACAGATGG
CTCCCACTGCAGCAGCTGAGTTTGCGGCTAAGTTACAAGATGCTACTCAGACCTCCGAAAAAGACATGATGAATCTGATG
GACGTGATCCAGAAAGGATTCTACGCGGGGGTAGATTCAGGAAATATGCTGCAGGGTTTCTCAAAAATCAGCAGCGCGAT
GAATATCATCAATAAGAAGGGGTTGGAAGCGGTCAAAACTTTCGCGCCTTTGTTGGTTATGGCTGATCAGGGGAGTATGG
CTGGTGAGTCTGCCGGTAATGCATACCGAAAGATTTTTCAGGCCGCTCTGGATGCTGACAATATTAAGGCGGTTAACGAT
GACCTGAAAGAAAAGGGCGCGGGTATTAAATTCAACTTCTCTGACGGGAAGGGTGGGTTTGGTGGTCTGGAAAATATGTA
TGCCCAGCTGGAGAAACTTAAAAAATTAAATCCAGAAACCCAAATGGCAACCATGAAAGATTTGTTTGGCAATGATGCGG
AAACGCTACAAGCGCTGAACATTATGCTATCTAAAGGGATTGAAGGATATAGAGAAACTGCAGCAAAGCTTGAAAATCAG
GCATCGCTTAGGGAGCGTGTTGATGCATCGCTGAACACCCTGGGGAATAAATGGGAGGCTGCTACCGGGACATTCACTAA
TGCGATGGCGAGTATTGGTGAAACCGTAGCACCTGCATTAAAGAACCTCGCCGATTGGCTGGGGGAGTTAGCCTCTCGTT
TGGACGGTTTTGTTAAACGCCATCCCTCGCTTACGTCTGCCTTATTCAAGATGGCTGCAGGATTTGCCGTTGCTGCAACG
GCTGTAGGGGCTATCTCTTTGGCGCTGGCATCAATCCTTGGTCCGATGGCAATCATCAGAGTGAGCGCGGGCATGTTGGG
CCTGAAGTTTGCGTCGGTGGGTGGCTTGGTTCGTGCTGCGCTGGGAGGGCTCGGGAAATCAGTTTTATGGCTGGGCCGAT
TGATGTTTGCAAACCCTATACTGGCTGTCATAGGGCTGATCGCCGCTGGTGCTATTTATATCTGGCAGAACTGGGACACG
CTTGGGCCAAAGTTCAAGGCCATGTGGGATGCCGTATGTAATGCCACAGGTACGGCATGGGATTGGATTAAAGAAAAGGC
CAGCGCCGCGTGGGAGGGGATTAAGTCACTGTTCTTTAATTATACCTTGCCGGGATTAATAGCTAAAAATTGGGATGCAA
TAAAATCTGGCGTTTCTGAGGCGTGGGCCAATATCAGACAATCTATCAGTGATAAATGGAATGCGATCCTGGCTGATGTT
TCCGCGCTTCCTGCGAAGTTTCAGGACACGGGCAGCGCCATTATTGACAGCATTCTCGATGGAATTAATGCCAAATGGGA
GACACTCAAAAGCAAGCTTTCCTCAGTCACCGATTATCTGCCTGACTGGATGACTGAAAATAATAAAACACAAGACAAAG
CACAGGTGCAGGTGGTTGGTGGCGCAGCGGCTGCTGCTGTTCCGTTTGCCGGGATGTATGACAGCGGTGGGATTATTCCG
CGCGGTCAGTTCGGTATTGTTGGGGAGAACGGCCCTGAAATTGTGAACGGCCCCGCAAAAGTGACCAGTAGGCGGCGCAC
TGCCGCGTTGGCTTCCATCGTTGCAGGTGTCATGGGCGTAGCGGCAGCGCCTGCAGAGGCTGCTCCACTACATCCTTACA
GTCTGCCTACTGCGGCATATAAACAAAGCCAGCCTGCGAAATCTGCCAGCGCGCCGCCAGTGATGCACTTTGAAACTCAC
GCGCCGATCACTATCTATGCTCAGCCAGGGCAGAGTGCGCAGGATATTGCCCGTGAAGTTGCCCGACAGCTTGACGAACG
CGAGCGCAAGACCAGGGCTAAAGCACGCAGTAATTTCAGTGATCAAGGGGGATATGAATAA

Protein sequence :
MSDNNLRLQVVLGAVDKLTRPFKNAQAGSKELASAIRQTRDQIKKLSDAGGQLKSFDQLTQSVSRTGAELDQARLRAQMM
TREMSSLESPTKKQTQALEAQWRAVSRLEQKQQQETRQMAAARAELYRLGLSAGGGARETARIARETERYNRQLAEQERR
LREVGERQRKLNAIKAKAEKTRELRNSLAGNGAGAMAAGVTTGMTLLAPVKAYSESENAANQLAGSMMGPGGKVAPEFEK
INRLAVALGDKLPGTTADFQNMMTMLRRQGMSAQVILGGLGESAAYLGVQLQMAPTAAAEFAAKLQDATQTSEKDMMNLM
DVIQKGFYAGVDSGNMLQGFSKISSAMNIINKKGLEAVKTFAPLLVMADQGSMAGESAGNAYRKIFQAALDADNIKAVND
DLKEKGAGIKFNFSDGKGGFGGLENMYAQLEKLKKLNPETQMATMKDLFGNDAETLQALNIMLSKGIEGYRETAAKLENQ
ASLRERVDASLNTLGNKWEAATGTFTNAMASIGETVAPALKNLADWLGELASRLDGFVKRHPSLTSALFKMAAGFAVAAT
AVGAISLALASILGPMAIIRVSAGMLGLKFASVGGLVRAALGGLGKSVLWLGRLMFANPILAVIGLIAAGAIYIWQNWDT
LGPKFKAMWDAVCNATGTAWDWIKEKASAAWEGIKSLFFNYTLPGLIAKNWDAIKSGVSEAWANIRQSISDKWNAILADV
SALPAKFQDTGSAIIDSILDGINAKWETLKSKLSSVTDYLPDWMTENNKTQDKAQVQVVGGAAAAAVPFAGMYDSGGIIP
RGQFGIVGENGPEIVNGPAKVTSRRRTAALASIVAGVMGVAAAPAEAAPLHPYSLPTAAYKQSQPAKSASAPPVMHFETH
APITIYAQPGQSAQDIAREVARQLDERERKTRAKARSNFSDQGGYE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4603 NP_458686.1 hypothetical protein Not tested SPI-7 Protein 0.0 100
t4297 NP_807894.1 hypothetical protein Not tested SPI-7 Protein 0.0 100