PAI Gene Information


Name : YE3482 (YE3482)
Accession : YP_001007641.1
PAI name : YAPI
PAI accession : NC_008800_P2
Strain : Yersinia enterocolitica 8081
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : Similar to Salmonella typhi hypothetical protein Sty4573 SWALL:Q8Z1J2 (EMBL:AL627282) (807 aa) fasta scores: E(): 1.7e-214, 66.5 38d in 806 aa, and to Xanthomonas axonopodis hypothetical protein Xac2274 xac2274 SWALL:Q8PKA1 (EMBL:AE011864) (897 aa) fasta
Homologs in the searched genomes :   139 hits    ( 139 protein-level )  
Publication :
    -Delihas,N., "Annotation and evolutionary relationships of a small regulatory RNA gene micF and its target ompF in Yersinia species", BMC Microbiol. 3, 13 (2003) PUBMED 12834539 REMARK Publication Status: Online-Only.

    -Delihas,N., "Direct Submission", Submitted (19-JAN-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Thomson,N.R., "Direct Submission", Submitted (30-JUN-2006) Thomson N.R., Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UNITED KINGDOM.

    -Thomson,N.R., Howard,S., Wren,B.W., Holden,M.T., Crossman,L., Challis,G.L., Churcher,C., Mungall,K., Brooks,K., Chillingworth,T., Feltwell,T., Abdellah,Z., Hauser,H., Jagels,K., Maddison,M., Moule,S., Sanders,M., Whitehead,S., Quail,M.A., Dougan,G., Parkh, "The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081", PLoS Genet. 2 (12), E206 (2006) PUBMED 17173484.


DNA sequence :
ATGCCTTGGGGGGAATATCTCCGGGAAGAGCGCTGTTTGTTACTGGATGATGGCCGGTCGGTCGGTGCAGTTTTTGAAGT
GATCCCAGTCGGAACTGAAGGGCGGTCAACTGAGCGTCTTGAGGAAATCCGAGACGTTGTGGAAGATGCATTGCAGGACA
GTTTACCTGAACTTAATGACCACCAATGGGTGGTGCAGTTCTATTGTCAAGATGAGACGGATGTCACGGCGTATATGGAT
AAGCTGCGTAGCTATGTGAAACCTTGGGCGCAGGGTACGCCGTTTACACAGGCCTGGCTGGCTGAAACAGAAAACCATAT
GAAAAGTATCTCAGTGGAAAAAGGCCTGTTTGAGGACAAGGTTGTCACTGGCGCGCCGTGGCGTGGTCAGACGCGCCGCA
CGCGGATGGTGATTTACCGCTATGTTGAAAAACATGGGCATGATCCGCGGTCAACGGCAGTAATGTTAAATCAAGTTTGT
GATCGCCTTACCTCTGCGTTAGCCGGAGCGGATATTCGCTGTGAGCGACAAAATGGGGAGCAAATTCATAGCTGGTTATT
GCGCTGGTTTAATCCTAAACCTGAATGGGTTGCACCGAATGTACTTTACCGTACTGCCCGGTACGATGATGGTGCGCTGG
ACTCTTTGCCTATTCTTAATGATTTCAGCGAAACCCTGTGGTTTACCCGCCCACGTAGTGATGCTGAACGTGGCGTATGG
TGGTTTGATAACATGGCGCATAAAGTGGTGCCGGTAGAGCGACTGCGCCGTGCTCCTCAGACAGGGCATCTGACTGGTGA
AGTCAAACGCGGTGAAAATATCAATACGTTGATGGATCTGATGCCGGAGGGCACCATGATTGCCATGACATTGATTGTGC
AACCGCAAGATGTGCTGGAAGATGCGTTTAACCGCCTTGGCCGAAATGCGATGGGGGAAAATATTGAATCGATGCGCGCT
CGTGAAGATGCGCATACCGCTCGTGAATATCTGGGCGAACGCCATAAACTGTACCGAGCCACACTGACGTTCCTGATCAA
AGCATCGGGTCTGGATATTCTGGACAAACATTATCTCGAAATTAGTGCCAAACTGCTTAATGCTGGGTTACAGCCGGTCA
ATCCCGAGCACGACGTGGCTCCACTTAACGGCTATTTGCGGGCATTACCCATGTGTTTTAACCCCAATGCAGATAAAAGT
CATTGGTACAGTCGGCTAACCTTTGTGCAGCATTTTGCCTGCCTGTTGCCGGTATTTGGACGGGACACGGGCACCGGTAA
TCCGGGTTTTTCGTTCTTCAATCGTGGCGGTGCACCGTTAACCTTTGATCCACTCAACAAAGACGATCGCACGCAAAATG
CCCATTTGCTGTTGTTCGGGCCAACCGGCTCAGGGAAATCAGCCACGCTGTGCAGTTCGTTGTCTCAACTGATGGCGGTT
CACCGTCCACGTCTGTTTCTACTGGAAGCCGGTAACTCTTTTGGCCTGTTTGCCGACTATTGCAGCTCGTTGGGGCTGAC
GGTCAATAAAGTTAGCGTTAAACCCGGTAAAGGGGTTTCATTGGCTCCGTTTGCCGATGCACATCTGCTATTGCAGGTCT
CTCCCGATGAATGGGTCACGGATGAAGCAGAATTACCCGATATTGATACAACCGACGACAACGATGATAAACGCGATATT
CTCGGTGAAATGGAGATTGCCGCTCGCCTGATGGTTACCGGTGGGGAAGCGGCTGAAGAGGCTCGCATGACGCGGGCTGA
TCGGGGGATGTTGCGCGAAGCTATTCTGGCCGCAGCACGTACTTCCTGTGATGCGGGGCGTCAAATGTTGCCGGAAGATT
TAATGCGTGAGCTGGAAAATATTGCTCGTGACAACAATACAGATGAAAACGGACGTGAGCGCAGAACAGCCGGTCGGCGG
GCCAGGGCAGAAGAGATGTCACAAGCGCTGCGAATGTTTACCGGAGGATTTGAAGGTGAGCTGTTCAACCGCCCTGGAGC
GCCTTGGTCAGAAGCCGATGTCACGCTTATTGACCTCGGCACCCTTGCCCGCGAAGGCTATGAAGCACAGATGGCGGTCG
CCGTTATCGCTCTGGTAAACACCGTCAATAACATCGCAGAGCGTGATCAGTATCTCGATCGAGAAATCAATATGGTCATT
GATGAAGCTCATATTGTGACCACTAACCCGCTGTTATCGCCTTATATGACTAAGGTCGTGAAGATGTGGCGTAAGCTCGG
TGCCTGGTTGTGGCTGGCAACACAGAATTTGGCTGATTACCCTGATACCGCAGAAAAAATGCTGAACATGGCGGAGTGGT
GGCTGTGCCTAGCGATGCCACCAGATGAGGTCGAACAGATTTCGCGCTTCAAAAAACTCACTGAGGAGCAAAAAAACATG
CTGCTATCGGCAACAAAACTCCCCAGATGCTACACGGAAGGTGTGGTGTTGGCCAAACGTGTTGAGGCTTTGTTCAGGGT
GGTACCACCCAGCCTTTATCTAGCGCTGGGAATGACGGAGAAAGAAGAGAAAGCAGAGCGCCGTCAGTTAATGACTGAGT
ATAATTGCAGTGAACTAGAAGCGGCATTCCATGTGGCGCGAAAGCTAGATAGGGCGCGGGGTATTACAGCAAAAGAAAAG
TAA

Protein sequence :
MPWGEYLREERCLLLDDGRSVGAVFEVIPVGTEGRSTERLEEIRDVVEDALQDSLPELNDHQWVVQFYCQDETDVTAYMD
KLRSYVKPWAQGTPFTQAWLAETENHMKSISVEKGLFEDKVVTGAPWRGQTRRTRMVIYRYVEKHGHDPRSTAVMLNQVC
DRLTSALAGADIRCERQNGEQIHSWLLRWFNPKPEWVAPNVLYRTARYDDGALDSLPILNDFSETLWFTRPRSDAERGVW
WFDNMAHKVVPVERLRRAPQTGHLTGEVKRGENINTLMDLMPEGTMIAMTLIVQPQDVLEDAFNRLGRNAMGENIESMRA
REDAHTAREYLGERHKLYRATLTFLIKASGLDILDKHYLEISAKLLNAGLQPVNPEHDVAPLNGYLRALPMCFNPNADKS
HWYSRLTFVQHFACLLPVFGRDTGTGNPGFSFFNRGGAPLTFDPLNKDDRTQNAHLLLFGPTGSGKSATLCSSLSQLMAV
HRPRLFLLEAGNSFGLFADYCSSLGLTVNKVSVKPGKGVSLAPFADAHLLLQVSPDEWVTDEAELPDIDTTDDNDDKRDI
LGEMEIAARLMVTGGEAAEEARMTRADRGMLREAILAAARTSCDAGRQMLPEDLMRELENIARDNNTDENGRERRTAGRR
ARAEEMSQALRMFTGGFEGELFNRPGAPWSEADVTLIDLGTLAREGYEAQMAVAVIALVNTVNNIAERDQYLDREINMVI
DEAHIVTTNPLLSPYMTKVVKMWRKLGAWLWLATQNLADYPDTAEKMLNMAEWWLCLAMPPDEVEQISRFKKLTEEQKNM
LLSATKLPRCYTEGVVLAKRVEALFRVVPPSLYLALGMTEKEEKAERRQLMTEYNCSELEAAFHVARKLDRARGITAKEK