Gene Information

Name : YpsIP31758_0778 (YpsIP31758_0778)
Accession : YP_001399763.1
Strain : Yersinia pseudotuberculosis IP 31758
Genome accession: NC_009708
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 933399 - 934952 bp
Length : 1554 bp
Strand : +
Note : identified by similarity to GB:AAN81831.1; match to protein family HMM PF05943; match to protein family HMM TIGR03355

DNA sequence :
ATGCTGATGTCTGTACAGGAAAACAGTGCGCAGGGTAGTGCGACGACCGTGCTAGAGAAGAACCCTGCCGTAGCTCAAGG
GGTTTATGCCTCTTTATTTGAAAAGATCAACCTGAATCCCGTCTCCTCATTGGCTGGCCTTGACGCTTTCCAGAACAGTG
ATGCACTGGCGGAAGCGACTACGGATGAGCGCGTTACCGCCGCCGTCAGTGTCTTTTTAGACCTGCTGAAGCAATCGTCA
AAAAAAGTGGAGAAGCTGGATAAAACCTTGCTGGATGGTCACATTGCGGCACTGGATGACCAAATCAGTCGCCAGTTGGA
TGCGGTGATGCATCACGCTGATTTTCAACGTGTCGAGTCCACGTGGCGTGGTGTAAAATCCCTTATTGATCAAACCGACT
TCCGTCAAAATGTCCGTATTGAACTGCTGGATATCAGCAAGGATCATCTGGTACAGGATTTTGAAGACGCCCCTGAAATC
GTGCAAAGCGGTCTCTACACCCAGACCTACATTCAGGAATACGATACTCCCGGTGGCGAGCCAATTGCCGCGGCTATCTC
CAACTACGAGTTTGACCGCAGCCCGCAAGATATCGCCTTGTTGCGCAATATCTCCAAGGTGGCGGCGGCGGCTCACATGC
CATTTATCGGTTCTGTTGGTCCTGAGTTCTTTGGCAAAGAGAACATGGAAGAAGTGGCGGCCATCAAAGATATCGCTAAC
TACTTTGACCGTGCCGAATACATCAAGTGGAAAGCTTTCCGCGACTCGGATGATTCCCGCTATATCGGCTTGACCATGCC
GCGCGTGCTGGGGCGTTTGCCGTACGGGCCAGACACTGTGCCAGTGCGTAGCTTCAACTACGTTGAGCAGGTGAAAGGGC
CAGACCATGATCGCTATTTGTGGACCAATGCCTCGTTTGCCTTTGCTGCCAACATGGTGAAAAGCTTCATTAAAAATGGC
TGGTGCGTACAGATCCGTGGGCCACAAGCCGGTGGTGCAGTGACCAATCTGCCAATCCACCTGTATGACTTGGGTACCGG
TAATCAGGTCAAAATCCCGTCAGAAGTGATGATCCCGGAAACGCGCGAATTTGAGTTTGCCAACCTGGGCTTTATCCCGC
TGTCATACTACAAAAACCGTGATTACTCGTGCTTCTTCTCGGCTAACTCCGCCCAGAAACCGGCGCTGTATGATACCGCC
GATGCCACCGCCAACAGTCGCATCAATGCCCGTCTGCCGTATATCTTCCTGCTGTCACGCATTGCCCATTACCTGAAGTT
GATTCAGCGTGAAAACATTGGCACCACCAAAGATCGTCGTTTGCTGGAGCTGGAGCTGAATAACTGGATCCGTGGGCTAG
TCACTGAAATGACTGATCCGGGTGATGATTTACAGGCATCCCATCCACTGCGTGATGCGAAAGTCACGGTAGAAGATATC
GAAGACAACCCCGGCTTCTTCCGCGTCAAGCTGTATGCCGTGCCACATTTCCAGGTGGAAGGCATGGATGTGAATTTGTC
ATTGGTTTCCCAGATGCCGAAGGCGAAAGCCTGA

Protein sequence :
MLMSVQENSAQGSATTVLEKNPAVAQGVYASLFEKINLNPVSSLAGLDAFQNSDALAEATTDERVTAAVSVFLDLLKQSS
KKVEKLDKTLLDGHIAALDDQISRQLDAVMHHADFQRVESTWRGVKSLIDQTDFRQNVRIELLDISKDHLVQDFEDAPEI
VQSGLYTQTYIQEYDTPGGEPIAAAISNYEFDRSPQDIALLRNISKVAAAAHMPFIGSVGPEFFGKENMEEVAAIKDIAN
YFDRAEYIKWKAFRDSDDSRYIGLTMPRVLGRLPYGPDTVPVRSFNYVEQVKGPDHDRYLWTNASFAFAANMVKSFIKNG
WCVQIRGPQAGGAVTNLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYSCFFSANSAQKPALYDTA
DATANSRINARLPYIFLLSRIAHYLKLIQRENIGTTKDRRLLELELNNWIRGLVTEMTDPGDDLQASHPLRDAKVTVEDI
EDNPGFFRVKLYAVPHFQVEGMDVNLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 5e-102 44
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 4e-102 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
YpsIP31758_0778 YP_001399763.1 hypothetical protein VFG2093 Protein 5e-110 46
YpsIP31758_0778 YP_001399763.1 hypothetical protein VFG2475 Protein 4e-112 46
YpsIP31758_0778 YP_001399763.1 hypothetical protein VFG2070 Protein 2e-88 42