Gene Information

Name : YpsIP31758_0326 (YpsIP31758_0326)
Accession : YP_001399320.1
Strain : Yersinia pseudotuberculosis IP 31758
Genome accession: NC_009708
Putative virulence/resistance : Unknown
Product : lipoprotein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 377928 - 381461 bp
Length : 3534 bp
Strand : +
Note : identified by match to protein family HMM PF06744; match to protein family HMM PF06761; match to protein family HMM TIGR03348

DNA sequence :
ATGAAATTGCCGTTCTTTATACGCATAGCCAAGCCCGCCATTCCTCGCCTTAAAGCGTCAATACCGGTGGTATTGGCGTT
GATGGCTTGTGCGGCACTCATTTGGGTATGGATTTACGGACCCGAGTGGCAATTGGGGGAAAATTACCCGTTTGAGACAT
TATTGAGCCGTTGGTTGGTCACGGCGGTATTTGTCTTGGTGGCCGTTTGTTGGCTCAGCCTGAAAGTCATGAGGCGAGTA
CAACACCTGGAAAAACTCCAACTGCAAACGAAAATTCAGCTCGACGACCCCGTCAGTGCTGATATCGAACAACAAAACCA
CTACCTCAATGGCTGGAAGCACCAACTTCAGCGCCATTTGAACACCCCGGAATATCTGTATCACCTGCCCTGGTACATGG
TCATTGGTGCGCGCAATAGCGGCAAAAGTACACTGATCAAAGAAGGGTATAAACTGACAGAGATCTCCGCATCCGAGCGG
CTCCATGTGGAAGGCGCAGCAGATCTGCGCGTCCGCTGCTGGTTGGGGGAACAAGCGGTTATTATCGATCCTGCGGGGGT
GCTGATTGAACAACCTACGACACCGATCGCGGGCAAAGCATCACTCAACAGCCGCTTATGGCAAAGTCTATTATCCTGGC
TGATTGAACAGCGCCAGCGTCAGCCGCTAAACGGCATTATTCTCACCGTCGATCTTCATCAGATGATGACCGCGAATAAA
GCGCAGCGGGAGACGTACGTCGCTGATATTCATCAACGGCTGCAAGAGATACGGCTGTCTTTGCACAGCCAGGTGCCGCT
GTATGTGGTGTTCACCAAAATGGACCTGCTGTACGGCTTCGAGGCCATGTACCAATCGCTGGATAAAGCCGAACGTGAAG
CGGTTCTGGGGGTGACGTTCAGCCTCAATGCGGCGGATCCGGACGTGTGGCGGACGGAGTTGAAACAGTTCTGGCAGCAA
TGGGTCGCACAACTGAATGGCGCGATGCCGGACATGATGCTGAACAGTGTGGATGCCGGGCAGCGTAGCCAGTTGTTTAG
CTTCACTCGCCAGATGCAGGGCCTGCACGATTACGTCGTGCAACTGCTGGAAGGCATTCTGTACCGTGGAGAACATGCCC
AGCCGCTGTTACGGGGGGTTTATCTCACGTCTGCCCAGCAACGCGGGCAAATGGACGATATCTTTACCCAATCTGCCGCG
GTGCAATATCACCTTGCGCCACAGGCCTTCCCGACCTGGCCGGTCTCGGATACGACGCCGTATTTTACCAAAGCGCTGTT
TAATCAGGTGCTGTTGGCCGAACCCAATCTGGCCGGAGAGAACGGTATCTGGCTGCAAAAAACGAGAAAGCGAATGTTCA
TTTTCTCGGGGGTGGGTGCCCTGGCCGCGCTGACCTTATGGGGCTACTGGCACTACTATCACCAGCTTAACTACCGGGCT
GGCGAAGAGGTATTAACCCAGGCAAAAACCTTCTTATCGATCCCACCGCCAGAAGGCGATGACCGTTATGGCAATCTGCA
ACTGCCGCTGCTAAACCCCATCCGTGACGCCACGCTGGCTTACGGTAATTACCATGAACGCAGCCCGTTCCTGGCGGATA
TGGGGTTATATCAGGGCAATAATATCGGGCCTTATGTCGAAAGCACTTACTTGCAACTCCTACAGCAACGTTTTGTCCCG
GCACTGATGAGCGGGCTGTTGGAACAACTGAACGCCGCACCGAAGGGCAGTGAAGAGAAACTGGAAATTCTGCGGGTGAT
GCGGATGTTGGAAGACGGCAGTGGCCGCAATGCCGCGCTGGTTGAACAATATATGAGCCACCGCTGGAGCCAACAATTCA
ACGGCCAGCGCGAGTTGCAGGAGCAACTGTCGAGCCACCTGAACTACGCACTGAAACACACCGACTGGCACGGCGCACGG
GAAAGTGGCGATCAATATGCGATCAAGAGCTTTGTCCCGTACCTCAGCCCGATCCAGTCAGCGCAGCAAGAGTTGAGTAA
ACTCTCGATCTACCAACGGGTGTATCAGAACCTGCGGATCAAAGCACAGGATGCGTTGCCGCCCGCGCTCGATTTGCGCG
ATCAGATCGGCGCAAGTTTTGACGACATCTTTGTCTCCGGTAACGATCGCCTGCTGGTGATCCCGCAATTCCTGACCCGT
AGCGGCTTACAGAGTTACTTCATCAAACAGAACGATCAACTGGTTGATCTCACGGTGATGGACAGTTGGGTACTTAATCT
CACCAAGAACGTCGAATACAGCGAGGCAGACCGCAAAGAGATCCATCGCCAGGTGACCGAACAGTATCTTGGCGATTATA
CCGCCACCTGGCGTGCGGCCATGAATAACCTATCGGTCAGTGATTTTGAAGGCTTGCCACAGGCGATCAGCGCCATTGAA
CAGGTGATCAGCGGGGAACAACCTTTCCGCCGTGCACTGCAAACGCTGAGTGACAACACCCGCTTGCCGGTTATCTCGGA
TCTGATCCCCGCGAAGGAGCAGCAAGAGCTGCTGCAAAAACCCGATTACCTGCTGCTGACCCGCATCAACCGTGAGTTCT
CCCCCGAAACGGCGGTGCTGGTGGAGAATGGCGATAAAGGCAGCGTGATCCAAAGTGTTTACCAAAAACTGACCGAGCTA
CACCGCTATCTGTTGGCGATCCAAAACTCGCCCGCGCCGGGCAAAGCGGCGTTGAAGGCGGTGCAATTACGTCTGGATCA
AAACAACAGTGATCCGATTTTCGAAGTCCAGCAACTGGCTAAAAACCTACCGGAGCCGCTAAACCGCTGGGTGGGCGAAC
TGGCAGAGCAAGCCTGGCGGGTGGTGATGATGGAGGCGATCCAGTCACTGGAAGTGGAGTGGAATGAGACGGTGATCAAA
CAGTATCAAACCTACCTGGCCGGACGTTATCCCTTCGATCCTCACGCGAAACAGGATGTACCACTCAGTGAGTTTGAACG
CTTCTTCGGGCCGAAAGGCACGCTCGATGCATTCTATCAGCAGAACCTGAAACCGTTTGTCGAAAACAACCTGACCGGTG
GCAGCGATGGCGAATTGCTGATCCGGCCTGATGTGTTACAGCAACTGGCGCAGGCGCGGAAAATTCGCGACACCTTCTTC
TCGGCCCAAAACGGCCTGGGCACACAGTTTGCGATTGAACCGGTGCTGTTAAGTGGCAACAAGCGCCGCAGCGTATTGAA
TCTGGATGGGCAATTACTGGATTACGCCCATGGCCGCAGCGGCGTAGTGCATCTGGTTTGGCCAAACTCGATGCGTGCAG
GAGTGGAAAGCAAACTGACGTTAGTCCCGGATGAGAGCGGCAAATCACCGCGCACCCTCAGCTTCAGCGGTCCTTGGGCG
CAGTTGCGGCTGATCAACGCCGGTGAACTGACCAATGTGGGCACCAACTCCTTCGATGTCCGCTTCAAGGTTGATGGCGG
CGAGATGACGTACCGCATCTTTGTTGATGAATCCGACAACCCATTCGCGGGCGGTTTGTTCAGTAAATTCAGTCTGCCAG
ACACTTTGTATTAA

Protein sequence :
MKLPFFIRIAKPAIPRLKASIPVVLALMACAALIWVWIYGPEWQLGENYPFETLLSRWLVTAVFVLVAVCWLSLKVMRRV
QHLEKLQLQTKIQLDDPVSADIEQQNHYLNGWKHQLQRHLNTPEYLYHLPWYMVIGARNSGKSTLIKEGYKLTEISASER
LHVEGAADLRVRCWLGEQAVIIDPAGVLIEQPTTPIAGKASLNSRLWQSLLSWLIEQRQRQPLNGIILTVDLHQMMTANK
AQRETYVADIHQRLQEIRLSLHSQVPLYVVFTKMDLLYGFEAMYQSLDKAEREAVLGVTFSLNAADPDVWRTELKQFWQQ
WVAQLNGAMPDMMLNSVDAGQRSQLFSFTRQMQGLHDYVVQLLEGILYRGEHAQPLLRGVYLTSAQQRGQMDDIFTQSAA
VQYHLAPQAFPTWPVSDTTPYFTKALFNQVLLAEPNLAGENGIWLQKTRKRMFIFSGVGALAALTLWGYWHYYHQLNYRA
GEEVLTQAKTFLSIPPPEGDDRYGNLQLPLLNPIRDATLAYGNYHERSPFLADMGLYQGNNIGPYVESTYLQLLQQRFVP
ALMSGLLEQLNAAPKGSEEKLEILRVMRMLEDGSGRNAALVEQYMSHRWSQQFNGQRELQEQLSSHLNYALKHTDWHGAR
ESGDQYAIKSFVPYLSPIQSAQQELSKLSIYQRVYQNLRIKAQDALPPALDLRDQIGASFDDIFVSGNDRLLVIPQFLTR
SGLQSYFIKQNDQLVDLTVMDSWVLNLTKNVEYSEADRKEIHRQVTEQYLGDYTATWRAAMNNLSVSDFEGLPQAISAIE
QVISGEQPFRRALQTLSDNTRLPVISDLIPAKEQQELLQKPDYLLLTRINREFSPETAVLVENGDKGSVIQSVYQKLTEL
HRYLLAIQNSPAPGKAALKAVQLRLDQNNSDPIFEVQQLAKNLPEPLNRWVGELAEQAWRVVMMEAIQSLEVEWNETVIK
QYQTYLAGRYPFDPHAKQDVPLSEFERFFGPKGTLDAFYQQNLKPFVENNLTGGSDGELLIRPDVLQQLAQARKIRDTFF
SAQNGLGTQFAIEPVLLSGNKRRSVLNLDGQLLDYAHGRSGVVHLVWPNSMRAGVESKLTLVPDESGKSPRTLSFSGPWA
QLRLINAGELTNVGTNSFDVRFKVDGGEMTYRIFVDESDNPFAGGLFSKFSLPDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 69
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 58
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 58