Gene Information

Name : y3368 (y3368)
Accession : NP_670667.1
Strain : Yersinia pestis KIM
Genome accession: NC_004088
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 3712431 - 3713975 bp
Length : 1545 bp
Strand : -
Note : residues 68 to 512 of 514 are 44.78 pct identical to residues 43 to 492 of 492 from GenPept : >gb|AAF96022.1| (AE004353) conserved hypothetical protein [Vibrio cholerae]

DNA sequence :
ATGCTGATGTCTGTACAGGAAAACCGTGCGCAGGGTACTGCGACGACCGTATTAAAGAATAGCCCTGCGGCACAAGGGGT
TTACGCCGCCTTGTTTGAAAAAATCAACCTGAGCCCGGTCTCCTCACTGGCCGGTATCGAGGCGTTCCAGAACAACGATG
CGCTGGCGGAAGCCACCACCGATGAGCGCGTTACCGCCGCCGTCAGTGTCTTTTTAGACCTGCTGAAGCAGTCGGCGAAG
AAAGTAGAAAAACTGGATAAAACCCTGCTGGACGGCCATATTGCCGCACTGGATGACCAAATCAGCCGCCAGTTGGACGC
GGTAATGCACCACCCCGATTTCCAGCGGGTGGAATCGACCTGGCGTGGTGTGAAGTCGCTGATCGATCAGACCGATTTCC
GCCAGAACGTGCGCATCGAGCTGCTGGATATCAGTAAAGATCATCTGGTGCAGGATTTTGAAGATGCCCCGGAAATCTCA
CAAAGCGGTTTATACGCCCAGACCTATATTCAGGAATACGACACCCCCGGCGGCGAGCCGATTGCCGCGGCTATCTCCAA
CTACGAGTTCGACCGTAGCCCGCAAGATATCGCGTTATTGCGCAATATCTCCAAGGTCGCGGCGGCAGCCCATATGCCGT
TTATCGGTTCCGTTGGCCCCGAGTTCTTTGGTAAAGAGAACATGGAAGACGTGGCAGCCATCAAAGATATCGCCAACTAC
TTTGACCGTGCTGAATACATCAAGTGGAAAGCCTTCCGCGACTCGGATGATTCCCGCTATATCGGCCTGACCATGCCGCG
CGTGCTGGGGCGTTTGCCTTACGGCCCGGACACGGTGCCGGTACGCAGCTTCAACTACGTTGAGCAGGTGAAAGGGCCGG
ATCATGATCGCTACTTGTGGACCAATGCCTCATTTGCCTTTGCTGCTAACATGGTGAAAAGCTTCATTAAAAATGGCTGG
TGTGTGCAGATCCGGGGCCCGCAGGCCGGTGGTGCGGTCACCAACTTACCGATCCACCTGTACGATCTGGGTACCGGTAA
TCAGGTCAAAATTCCGTCAGAGGTGATGATCCCGGAAACCCGCGAGTTCGAGTTTGCCAATCTGGGCTTTATCCCGCTGT
CGTACTACAAAAACCGCGATTACTCGTGCTTCTTCTCGGCCAACTCGGCGCAGAAACCGGCGCTGTATGACACCGCGGAT
GCCACGGCCAACAGCCGCATCAACGCCCGTTTGCCGTATATCTTCCTGCTGTCGCGCATTGCCCATTATCTGAAACTGAT
CCAGCGCGAGAACATCGGCACCACCAAAGATCGCCGTCTGCTGGAGCTGAACAACTGGGTGCGCGGGCTGGTGACGGAAA
TGACTGATCCGGGTGATGATTTGCAGGCGTCCCATCCCCTGCGTGATGCCAAAGTGACGGTCGAAGATATCGAAGACAAC
CCCGGCTTCTTCCGCGTCAAGCTGTATGCGGTGCCGCATTTCCAGGTGGAAGGGATGGACGTCAATCTGTCATTGGTTTC
CCAAATGCCAAAGGCTAAGGCATAA

Protein sequence :
MLMSVQENRAQGTATTVLKNSPAAQGVYAALFEKINLSPVSSLAGIEAFQNNDALAEATTDERVTAAVSVFLDLLKQSAK
KVEKLDKTLLDGHIAALDDQISRQLDAVMHHPDFQRVESTWRGVKSLIDQTDFRQNVRIELLDISKDHLVQDFEDAPEIS
QSGLYAQTYIQEYDTPGGEPIAAAISNYEFDRSPQDIALLRNISKVAAAAHMPFIGSVGPEFFGKENMEDVAAIKDIANY
FDRAEYIKWKAFRDSDDSRYIGLTMPRVLGRLPYGPDTVPVRSFNYVEQVKGPDHDRYLWTNASFAFAANMVKSFIKNGW
CVQIRGPQAGGAVTNLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYSCFFSANSAQKPALYDTAD
ATANSRINARLPYIFLLSRIAHYLKLIQRENIGTTKDRRLLELNNWVRGLVTEMTDPGDDLQASHPLRDAKVTVEDIEDN
PGFFRVKLYAVPHFQVEGMDVNLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 3e-101 44
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 4e-101 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
y3368 NP_670667.1 hypothetical protein VFG2093 Protein 2e-108 45
y3368 NP_670667.1 hypothetical protein VFG2475 Protein 6e-111 45
y3368 NP_670667.1 hypothetical protein VFG2070 Protein 5e-88 42