PAI Gene Information


Name : DIP0586 (DIP0586)
Accession : NP_938962.1
PAI name : Not named
PAI accession : NC_002935_P4
Strain : Corynebacterium diphtheriae 241
Virulence or Resistance: Virulence
Product : siderophore biosynthesis-like protein
Function : -
Note : Similar in its N-terminal region and C-terminal region to Rhizobium sp hypothetical 71.0 kDa protein Y4xN SW:Y4XN_RHISN (P55706) blast scores: E(): 3e-23, score: 277 22% id and also to Rhizobium meliloti rhizobactin siderophore biosynthesis protein RhbC o
Homologs in the searched genomes :   30 hits    ( 29 protein-level,   1 DNA-level )  
Publication :
    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (03-OCT-2003) Cerdeno-Tarraga A.M., submitted on behalf of the Pathogen Sequencing Unit, Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA E-mail: amct@sanger.ac.uk.

    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (08-APR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Cerdeno-Tarraga,A.M., Efstratiou,A., Dover,L.G., Holden,M.T., Pallen,M., Bentley,S.D., Besra,G.S., Churcher,C., James,K.D., De Zoysa,A., Chillingworth,T., Cronin,A., Dowd,L., Feltwell,T., Hamlin,N., Holroyd,S., Jagels,K., Moule,S., Quail,M.A., Rabbinowits, "The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129", Nucleic Acids Res. 31 (22), 6516-6523 (2003) PUBMED 14602910.


DNA sequence :
GTGTTAGAAAATGGTTATATATCCCAGAGTAATTGTGAATCCGACATCCGCGGTCGCCTCCTAGCCGCACTCATTGACGA
ACACCTGATCACCCCTGAGACACACCACAAACTCACCAACACCACAACCCCGCCAGCCGACCTGCTTCGTCAACTAGCGC
AGGACAACCAGCTGACCATCACGCCCACCAACCTCGAACGTGCCTGCGCCGAAATCGCCGACAGCGTGGTGGGTCTCCAC
CGCGCACGCACCGCTATCACACAACGCTGGGAACAAGCGCTGAATGCACAAACCGCAACCAATACATCCCAGACTCCCTA
CAGCGACCTCATCCAAGCCCTGCGACAGCGTTGCCGACTAGAAGACCGCGGATCGTCTGCCATGTTGGCACGCTGCGAAC
AACTCGTCTGCGACGGGCATCCTGCACATCCTGCGGCTAAGACTTCGTTAGGAATCGGCGATTCTTTCCTACACGTCCTC
CCAGAACAAACAGAAACGATCCAGCTACGCTTCGTCGCCGTCGACACCGACCATGCCGTCGTCGTAGGAGGACACCCAGT
AGAAACCATCAGTGAAGCCATGCCGCTATTGGGCGCGCGACTAAACGCTGAGCTAGAGCGTTGTGAATTACACCATCACA
GCGTCATCCCCGTGCACCCATTCCAATGGGACAACGTGATTAGCAGCGAATTTGCTGAAGAAATCGCCTCTGGAACGATC
GTGCTCCTCGAAACGACTGCCACCGCAGAACCCCTCATGTCCGTGCGCACACTGCGTGTCAGTGACGCTACTGGTTCCAT
GCACATCAAAGTGGCGCTCGAAATCCAGCTCACCGGCGCAGTGCGAGGGGTATCGGCAGGAGCCGTCGCAGCGCCCGCCA
TCGCCAGCATTATCGACGACGCCTGCACCCTCGACGCCGGGTTCATTCCCCGAACCGACACCGACCAACCAGCATTTAGC
GTTGCCTATGACCGCAGCGCCATTCGCTGGAATGCCGACAGCGGAATCCGCGCCCACTGTTTTGGCGCCGTGCTTCGCGA
CGACCCGACAGGAAACGCAGATGACGAGATCGCGATGCCCGTCGCAACGCTGCTAGCACGCAACCCACTCACCGGCGCCA
CCATCGCAGCAGACCTCATCGACGAACTCAGCCACCGCCACAACCGCCACCGCGACGAAATCGCCACCGACTGGTTTACC
GCACTCGGAAAGTTCCTATTCGTCCCAGCCGTAGCCCTCATCGCACGATGGGGGATAGCCCTCGAACCACACCCCCAAAA
CACCGTCATCATTCTTCGCGACGGCATGCCCCACCGCATCGTCGTCCGTGACCTAGGCGGCTGTCGACTCTGGGCAAACG
GACCCCTTGCCGCGCACCCCATCGTCGACAAGCTGCGCGCCACCGCACTGATAGAAAACGACCTCATTCGACTCATCGAC
AAAGTCTTCTATCCACTCGTAGCCAACCTACACCGCAACCTCATCACAGCCGCTGCTATAACCAAACCAGCCCAGCAGCG
CATTAACCTCGCACTCTCAACACACATCGCCCGAGAATATTGGCGTACCACCGCCTCCCACCTGCACCCAGCCAACACGG
TCGCCACCGTCTTCCAACGCATCTTAGGGCCCGTTTTGCCAGTTAAACGAGTCCTCGGCATGCGGCTATCGGGAGCTGTC
ACCGAACAAGAATATGTAGCAGACATCAGCCCCCTCGAAAACCTCGAATTACTCACACCAGAGTCCCTCCGCGCCGCATG
CGCACCATACTCCGAGTGGGCCCACGAAACCTTAGCGACCCGACTCCACGATGCTGCTGTTAGGGAAAGAATCGACGATT
CCTTCCCAACCCTTCGAGACGACATCGCCAATGCGGAAGAAAACCTAGCCCTCGTTCGTGCCCAAGTCACCTCCCGTGTG
AACACCCCAGAAAGCTACTGGGATCTTCTCAAGGGCCTACCACCACACGCAGCCATGATCGCCGCCGACTCCTACGCCAT
CAGCGGTCACAACGTACACCCCCTCGCAAAACTACGGCGCGGATTCAGCATCGAAGAATCCGCAGCTTACGGGCCCGAAG
CTGGCATGAGTACCGACCTGCGTCTGGTGGGCGTCGATAAGCGAATGATCGACACCTCCACCACAGCCGACTGTGTCCGA
CTGATCGCCCACCACTTCCCACAGCACATCGCATACGCGCGTACTCACCTGCACGAACACGGACTCGACGCAGATTCCTA
CGCCATCATCCCCGTGCACCCCTGGCAACTAGAACATGTCATCCGCGAAGCATTTGCCGAGGACATTGCTGACCACACGA
TGGTGCCGATTCCCAATATCGCCATCGCCGCACATCCCACCATCTCGTTGCGCACCTTAGTTCCGCATGCACCTACCCCC
AGCGGGACTCGACCCTTTATCAAGTGCGCTGTCGATGTCACGTTGACCTCCACACGACGCTCGATCTCCCAAGACAGCGC
ACTCGGCACGCCCCGAGTTGCTGGCCTAGTTGCCACTGCATTGGAACAACTTCGTAGAGAAACCAATGTGCAACCGCGCG
CAGTGGTGGTCCCAGAATTGAGCGGATTAGCCCTTAGCCGCGATGAAAGGTCGGAAGGAATCGACGATTCCTTCCGAAAA
ACACGCCAAAGAGGGCTCTCGGTGCTCCTGCGTGACGACGCCACAGCGTATCTCGCGCCGGGCGAGATCGCGATGAGCGC
GTGTGCATTGCGCGGGCATGAAGGTGTGGTCCCAAGTCCGCTGCGCGATATTAATGAGGAGTTTTTCGACGACTATGTGT
ACGACCTGATGTCGACAGTCCTCGGACTCATGATGGTGAAAGGCATTGCGTTGGAGCAGCACCTGCAAAATACGCTGGTG
CGCATTGACCTCAGCGGTAAGACACCGGTGTATCGCGGCATAATGCTGCGCGACTTTTCGGGCTTGCGGGCGTGGGCTCC
TCGGCTACAGCAGTGGGCCAGTGATCAGGTTTTTGAACCCGGCGCGATCACGTTGACTGATGACCATGAGGAGTTTGTGA
ATAAGGGCTTTTATGCGTCGGTGTTTGGCAACCTTGACGGCATTGTCGACGAATATTCCCAGGCGCGGGGTGTGGATGCG
CAGAGTTTGTGGGAGCGGGTGCATGTGCAGATCAATCGGTTTGTGCAGGAGGCTGCGGGGATGTTGCCGGCTGTGGATAT
GGAGTGGATGCGGCGAGAAACGATCCGGCGTAAGGGGTTTGTGTCGATGAGTTTGCAGGGGTCTAGTGCGGATATTTATG
TAGAGGAGCGTAATCCGTTGGCGGCTAATCCTGCGTGGGCCTAG

Protein sequence :
MLENGYISQSNCESDIRGRLLAALIDEHLITPETHHKLTNTTTPPADLLRQLAQDNQLTITPTNLERACAEIADSVVGLH
RARTAITQRWEQALNAQTATNTSQTPYSDLIQALRQRCRLEDRGSSAMLARCEQLVCDGHPAHPAAKTSLGIGDSFLHVL
PEQTETIQLRFVAVDTDHAVVVGGHPVETISEAMPLLGARLNAELERCELHHHSVIPVHPFQWDNVISSEFAEEIASGTI
VLLETTATAEPLMSVRTLRVSDATGSMHIKVALEIQLTGAVRGVSAGAVAAPAIASIIDDACTLDAGFIPRTDTDQPAFS
VAYDRSAIRWNADSGIRAHCFGAVLRDDPTGNADDEIAMPVATLLARNPLTGATIAADLIDELSHRHNRHRDEIATDWFT
ALGKFLFVPAVALIARWGIALEPHPQNTVIILRDGMPHRIVVRDLGGCRLWANGPLAAHPIVDKLRATALIENDLIRLID
KVFYPLVANLHRNLITAAAITKPAQQRINLALSTHIAREYWRTTASHLHPANTVATVFQRILGPVLPVKRVLGMRLSGAV
TEQEYVADISPLENLELLTPESLRAACAPYSEWAHETLATRLHDAAVRERIDDSFPTLRDDIANAEENLALVRAQVTSRV
NTPESYWDLLKGLPPHAAMIAADSYAISGHNVHPLAKLRRGFSIEESAAYGPEAGMSTDLRLVGVDKRMIDTSTTADCVR
LIAHHFPQHIAYARTHLHEHGLDADSYAIIPVHPWQLEHVIREAFAEDIADHTMVPIPNIAIAAHPTISLRTLVPHAPTP
SGTRPFIKCAVDVTLTSTRRSISQDSALGTPRVAGLVATALEQLRRETNVQPRAVVVPELSGLALSRDERSEGIDDSFRK
TRQRGLSVLLRDDATAYLAPGEIAMSACALRGHEGVVPSPLRDINEEFFDDYVYDLMSTVLGLMMVKGIALEQHLQNTLV
RIDLSGKTPVYRGIMLRDFSGLRAWAPRLQQWASDQVFEPGAITLTDDHEEFVNKGFYASVFGNLDGIVDEYSQARGVDA
QSLWERVHVQINRFVQEAAGMLPAVDMEWMRRETIRRKGFVSMSLQGSSADIYVEERNPLAANPAWA