Gene Information

Name : SARI_02733 (SARI_02733)
Accession : YP_001571731.1
Strain : Salmonella enterica RSK2980
Genome accession: NC_010067
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 2647477 - 2649024 bp
Length : 1548 bp
Strand : -
Note : 'KEGG: bpm:BURPS1710b_A0534 1.0e-115 hypothetical protein K00356; COG: COG3517 Uncharacterized protein conserved in bacteria; Psort location: cytoplasmic, score: 23'

DNA sequence :
ATGCTGATGTCTGTAAAAAATGAGAATGCGGCCGGTGGCGAGAGTGTGGTGCTGGAACGTCCTGCTGCAGGTGGCGTTTA
TGCGTCCCTGTTTGAAAAAATCAACCTGAACCCGGTCTCTGAGCTGAGTGCGCTGGATATCTGGCAGGACGCGCAGGCGA
TGTCGGATGCGACTGCGGATGAACGTTTAACGGCCGGCATGCAGGTGTTCCTGGAGTGCCTGACGAAAGCGGGCTCAAAG
GTTGAAAAACTCGACAAAAACCTGATTGACCACCATATTGCTGAGCTGGACTACCAGATTAGCCGTCAGCTGGATGCGGT
CATGCACAGCGAAGAATTCCAGGCGGTAGAGAGTCTGTGGCGCGGCGTTAAATCGCTGGTCGACAAAACCGATTTTCGCC
AGAACGTGAAGGTTGAACTGCTGGATATGTCCAAAGAAGACCTGCGTCAGGACTTCGAAGACAGCCCGGAAATCATCCAG
AGTGGACTGTATAAACATACCTATATTGATGAATATGACACCCCGGGCGGTGAGCCGATTGCCGCGCTGATTTCTGCTTA
CGAGTTTGATGCCTCTGCGCAGGATGTTGCCCTGCTGCGTAATATATCAAAAGTGGCCGCTGCTGCCCATATGCCGTTTA
TCGGTTCTGCGGGTCCGGCGTTCTTTTTGAAAAATTCGATGGAAGAAGTCGCCGCCATTAAAGATATCGGTAACTACTTT
GACCGCGCTGAATATATCAAGTGGAAATCCTTCCGCGAGACGGACGATGCCCGCTATATCGGTCTGGTGATGCCGCGTGT
GCTGGGCCGCCTGCCGTATGGCCCGGACACCGTGCCCGTACGCAGCTTCAACTACGTCGAAGAAGTGAAGGGCCCGGATC
ACGACAAATACCTGTGGACTAACGCCTCGTTTGCCTTTGCATCCAACATGGTACGCAGTTTCATCAACAACGGCTGGTGT
GTGCAGATCCGTGGCCCGCAGGCCGGTGGTGCCGTTCAGGATTTGCCTATCCATCTGTACGATCTGGGCACCGGCAATCA
GGTCAAGATCCCGTCAGAAGTGATGATCCCGGAAACCCGTGAATTTGAATTTGCGAATCTCGGCTTCATTCCGCTGTCCT
ACTACAAGAACCGCGATTTTGCCGCATTCTTCAGTGCAAACTCCACGCAGAAGCCAGCTATCTACGATACCGCTGATGCC
ACAGCAAACAGCCGCATCAATGCCCGTTTGCCGTACATTTTCCTGCTGTCGCGTATTGCGCACTACCTGAAGCTGATTCA
GCGTGAAAACATTGGTACCACCAAGGACCGTCGTTTGCTGGAGCTGGAACTCAATACCTGGGTGAAAAGTCTGGTCACCG
AAATGACCGATCCGGGCGATGAACTGCAGGCGTCACACCCGCTGCGCGATGCCAAAGTGGTGGTGGAAGACATCGAAGAC
AACCCGGGCTTCTTCCGCGTGAAGCTGTATGCCGTGCCGCACTTCCAGGTGGAAGGGATGGACGTCAACCTGTCACTGGT
TTCCCAGATGCCGAAAGCCAAATCCTAA

Protein sequence :
MLMSVKNENAAGGESVVLERPAAGGVYASLFEKINLNPVSELSALDIWQDAQAMSDATADERLTAGMQVFLECLTKAGSK
VEKLDKNLIDHHIAELDYQISRQLDAVMHSEEFQAVESLWRGVKSLVDKTDFRQNVKVELLDMSKEDLRQDFEDSPEIIQ
SGLYKHTYIDEYDTPGGEPIAALISAYEFDASAQDVALLRNISKVAAAAHMPFIGSAGPAFFLKNSMEEVAAIKDIGNYF
DRAEYIKWKSFRETDDARYIGLVMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHDKYLWTNASFAFASNMVRSFINNGWC
VQIRGPQAGGAVQDLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDFAAFFSANSTQKPAIYDTADA
TANSRINARLPYIFLLSRIAHYLKLIQRENIGTTKDRRLLELELNTWVKSLVTEMTDPGDELQASHPLRDAKVVVEDIED
NPGFFRVKLYAVPHFQVEGMDVNLSLVSQMPKAKS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 9e-100 44
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 7e-100 44
HH0247 NP_859778.1 hypothetical protein Not tested HHGI1 Protein 2e-96 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SARI_02733 YP_001571731.1 hypothetical protein VFG2475 Protein 4e-111 48
SARI_02733 YP_001571731.1 hypothetical protein VFG2093 Protein 9e-109 47
SARI_02733 YP_001571731.1 hypothetical protein VFG2070 Protein 5e-88 44