PAI Gene Information


Name : PMI2576 (PMI2576)
Accession : YP_002152295.1
PAI name : Not named
PAI accession : NC_010554_P1
Strain : Proteus mirabilis BB2000
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : -
Homologs in the searched genomes :   141 hits    ( 141 protein-level )  
Publication :
    -Pearson,M.M., Sebaihia,M., Churcher,C., Quail,M.A., Seshasayee,A.S., Luscombe,N.M., Abdellah,Z., Arrosmith,C., Atkin,B., Chillingworth,T., Hauser,H., Jagels,K., Moule,S., Mungall,K., Norbertczak,H., Rabbinowitsch,E., Walker,D., Whithead,S., Thomson,N.R., , "Complete genome sequence of uropathogenic Proteus mirabilis, a master of both adherence and motility", J. Bacteriol. 190 (11), 4027-4037 (2008) PUBMED 18375554.

    -Pearson,M.M., Sebaihia,M., Churcher,C., Quail,M.A., Seshasayee,A.S., Luscombe,N.M., Abdellah,Z., Arrosmith,C., Atkin,B., Chillingworth,T., Hauser,H., Jagels,K., Moule,S., Mungall,K., Norbertczak,H., Rabbinowitsch,E., Walker,D., Whithead,S., Thomson,N.R., , "Direct Submission", Submitted (25-AUG-2008) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Sebaihia,M., "Direct Submission", Submitted (18-FEB-2008) Sebaihia M., Sulston Laboratories, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UNITED KINGDOM.


DNA sequence :
ATGGGATTTTCATTATTCCGCTCACGCCGGAAAGCGCAGGCGGTTTCTGCTGAAAAAACGGATGAAAACAAGGCTGACAA
AACCGTACACCGCAGTACACCGGCAGAGAATGCCGGTTTAGGCCGTCAGCCGCTTTCCCGGCCGGGAAAAATGACCCGGG
CGGATGAGCAGAAGATTTACCATGCGAATCCGTCCTTTATTGATTACCTGCCCTGGGTGGAATTTCTGGATGAGGAGCAA
TGCCTGCTGCTTGATGACGGGGTGTCGGTCGGGGCGGTGTATGCCGTTGAGCCGGCGCCGACGGAAGGGCGCTCTGCCGG
CCGTCTGCTCGAAATCCGGGATGCGGTGGAGGATGCGCTTCAGGACAGTCTGGATGAGCATGATGATAACCCGTGGGTGG
TGCAGTTTTTTTGTCAGGATGAGGACGACGTGAGCGTCACACTGGCCACATTGCGGGACTATATCCTGCCGGCGGCCCGG
GGCACCGCGTTTACCGAAGCCTGGCTGACCGAAATGGCGCGTCATTTACGGGGGATTGCCCGTCCGGAAGGGCTGTTTAA
GGACACGGAGGTGACCGGACAACCCTGGCGCGGACAGCAGCGCCGGACCCGGATGGTGGTTTACCGCCGGCTGGGGAACA
GCTATACCGATCCGATGCCGCCGGTGGCGATGCTCAATCAGATGTGTGACCGGCTGGTCAGCAGTCTGGGCAGTGCCGGC
GTGCAGTGTGTCCGCCAGGACGGGAGGCAGGTGCATCAGTGGCTGCTGCGTCATTTCAATCCGGCCCCTGACTGGGTGGA
CCGGGAGACGCTGTACCGGGAAGGGGCGTACTGGCCGGCGGCGGAGGAAGCCCCGGGCACCCTGCCGGTGCTGAAGGATT
TTGCGGAAACGCTGTGGTTTACCCCGCCCCGGTCGGATGTGGAAAACGGCGTGTGGTGGTTTGATAACCTGGCGCACGGT
GTGGTGAATATCGAAAAACTGCGTAAAGCCCCGGGGCCGGGGCACCTGACCGGGGAGCAGAAGCGCGGTGACAAAAATAT
CAATGCGCTGATGGACCTGTTTCCGGAAGGGACGATGCTGTCGATGACCATCGTCACACAGCCTCAGGATACGCTGGAGC
AGGAATTTACCCGGCTGGGGAAAAACTCGGTGGGGGAAAACACCGACTCGCAACGTACCCGCCGGGACGTGGAAACCGCC
AGAGGTTTACTGGGGCACCGCCATAAGCTTTACCGGGCCTCCCTGTCATTGCTGCTGCGTGCCGGCTCGGTGGAGGAACT
GAACAAAAAGCGCGTGGATCTGACGGCCACGCTGCTGAATGCCGGTCTGGAGCCGGTGAAGCCGGAATATGATGTGGCGC
CGCTGAACGGGTATCTGCGTGCCCTGCCGATGTGTTTTGACCCGCAGACGGATAAAAAACACTGGTATACCCGGCTGACC
TGGGTACAGCACCTGGCCGGATTACTGCCGGTGACCGGGCGCACCAGCGGCACCGGGCATCCCGGCTGGAGTTTCTTTAA
CCGGGGCGGCGGGGTGCTGACCTTTGATCCGCAGAATAAACAGGATCGTACCCAGAATGCGCATCTGCTGTTATTCGGTC
CCACCGGGGCCGGTAAATCCGCCACCTTATGCGCACAGTTATCGCAGCTGATGGCGGTCCGCCGTCCGCGTCTGTTTATT
GCGGAGGCCGGTAACTCCTTTGGTTTACTGGCGGACTATTACGCCAGTCTGGGGCTGAGTGTCAATAAGGTCAGTGTCAA
GCCCGGCAGCGGGGTGCGTCTGCCGCTGTTTGCGGATGCGCATCAGCTGCTGATGATGGATGAACAGCGTCTGGCACTGA
ATGCGGATGAACTGCCAGAGACGGACAGTGAATCTGAGGATGATGACGGGCAGCGTGACATTCTGGGGGAAATGGAAATC
TCCGCCCGCATGATGATCACCGGCGGGGATGTGCGGGAAGAAGCGGAGCTGAAGCGGGCTGACCGGGCGATGATCCGCCG
GGCGCTGATGGCGGGGGCCCGGGCGTCTTATGCCGCCGACCGGCAGATGCTGCCGGCGGATTTGCAGGAGGCGCTGTATG
CCCTCTCCCGGGGGGAAGGTGAAGAGGAAGGGCGGGCATTAAATCCGCAGCGCCGGGCGAAAGCGGCCGAAATGGCGGAA
TCCCTGGGGATGTTTACCGAACCGGGCAGTTTTGAGCATGAGCTGTTTAATCTGCCGGGGGAGCTGTGGCCGGAGGCGGA
TGTGACCCTGATTGATTTGGGGCACCTGGCGCGGGAAGGGTATGAGGCCCAGATGGCGCTGACGATGGTGAGTCTTATCA
ACTCCATCAACAATATTGCCGAGCGGGATCAGTACCGGGCCCGGGACATTAATTTTGTGGTGGATGAGGCGCATATCGTG
ACGGTCAACCCGCTGTTATCCCCCTATGTGACCAAAGTGGTGAAAATGTGGCGGAAACTGGGGGCCTGGCTGTGGCTGGC
GACCCAGAACCTCGATGACTATCCGGACTGTGCCGAGAAAATGCTCAATATGGCGGAGTGGTGGGTGTGTCTGACGATGC
CGCAGAAAGAGGTGGAGGCAATCGCCCGCTTTAAGACGCTCAGTGAGGAGCAGAAACAGGTGCTGCTGTCCGCAGCCAAA
CTCTCCGGCTGTTACACGGAAGGGGTGGTGCTGGCGCGGAAAGTGGAGGCGCTGTTCCGGGTGGTACAGCCGAGCTTATA
TCTGGCGCTGGGGATGACGGAAAAAGAGGAGAAGCAGCAACGGCGGGTGCTGATGGAGCAATACGGATGCAGTGAGCTGG
AAGCGGCGTTCAGGGTGGCAGATGAACTTGACCGGAAACGGGGGTTACCGCCCCGGGTGCCGGGATGA

Protein sequence :
MGFSLFRSRRKAQAVSAEKTDENKADKTVHRSTPAENAGLGRQPLSRPGKMTRADEQKIYHANPSFIDYLPWVEFLDEEQ
CLLLDDGVSVGAVYAVEPAPTEGRSAGRLLEIRDAVEDALQDSLDEHDDNPWVVQFFCQDEDDVSVTLATLRDYILPAAR
GTAFTEAWLTEMARHLRGIARPEGLFKDTEVTGQPWRGQQRRTRMVVYRRLGNSYTDPMPPVAMLNQMCDRLVSSLGSAG
VQCVRQDGRQVHQWLLRHFNPAPDWVDRETLYREGAYWPAAEEAPGTLPVLKDFAETLWFTPPRSDVENGVWWFDNLAHG
VVNIEKLRKAPGPGHLTGEQKRGDKNINALMDLFPEGTMLSMTIVTQPQDTLEQEFTRLGKNSVGENTDSQRTRRDVETA
RGLLGHRHKLYRASLSLLLRAGSVEELNKKRVDLTATLLNAGLEPVKPEYDVAPLNGYLRALPMCFDPQTDKKHWYTRLT
WVQHLAGLLPVTGRTSGTGHPGWSFFNRGGGVLTFDPQNKQDRTQNAHLLLFGPTGAGKSATLCAQLSQLMAVRRPRLFI
AEAGNSFGLLADYYASLGLSVNKVSVKPGSGVRLPLFADAHQLLMMDEQRLALNADELPETDSESEDDDGQRDILGEMEI
SARMMITGGDVREEAELKRADRAMIRRALMAGARASYAADRQMLPADLQEALYALSRGEGEEEGRALNPQRRAKAAEMAE
SLGMFTEPGSFEHELFNLPGELWPEADVTLIDLGHLAREGYEAQMALTMVSLINSINNIAERDQYRARDINFVVDEAHIV
TVNPLLSPYVTKVVKMWRKLGAWLWLATQNLDDYPDCAEKMLNMAEWWVCLTMPQKEVEAIARFKTLSEEQKQVLLSAAK
LSGCYTEGVVLARKVEALFRVVQPSLYLALGMTEKEEKQQRRVLMEQYGCSELEAAFRVADELDRKRGLPPRVPG