PAI Gene Information


Name : DIP0753 (DIP0753)
Accession : NP_939126.1
PAI name : Not named
PAI accession : NC_002935_R2
Strain : Corynebacterium diphtheriae 241
Virulence or Resistance: Resistance
Product : lantibiotic modifying enzyme
Function : -
Note : Similar to Staphylococcus aureus lantibiotic modifying enzyme TR:Q9S4D1 (EMBL:AF147744) (965 aa) fasta scores: E(): 5.3e-32, 21.471% id in 1006 aa, and to Lactococcus lactis Plasmid pMRC01 lacticin 481/lactococcin biosynthesis protein LcnDR2 TR:O87238 (EM
Homologs in the searched genomes :   5 hits    ( 5 protein-level )  
Publication :
    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (03-OCT-2003) Cerdeno-Tarraga A.M., submitted on behalf of the Pathogen Sequencing Unit, Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA E-mail: amct@sanger.ac.uk.

    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (08-APR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Cerdeno-Tarraga,A.M., Efstratiou,A., Dover,L.G., Holden,M.T., Pallen,M., Bentley,S.D., Besra,G.S., Churcher,C., James,K.D., De Zoysa,A., Chillingworth,T., Cronin,A., Dowd,L., Feltwell,T., Hamlin,N., Holroyd,S., Jagels,K., Moule,S., Quail,M.A., Rabbinowits, "The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129", Nucleic Acids Res. 31 (22), 6516-6523 (2003) PUBMED 14602910.


DNA sequence :
GTGCTTGTGGGTAGCATTAGGAAATTTTTCCCAGAGATCACAGATGATCTGTTCCAAGAACTATTGACTAGTGCTGCCTA
CGAGCACGGTGTGGCCTCGCTCCAAGGAGAGATCGAAAAATTCACACAAAAGAACAGTGAGACTGACGAAGAGGTGCTCG
CTGACAATCTTGCAGAACTCTACCGATTCTTCGAACGCGAACCTATCCCCTTCCTTGATTCAACATGGGCACTATCCGAC
TTGCAAATATCATCCATCAGGGACGTGATTGATGCTTTTCCTCACAGAATCGACTGCGACGCTGTAATTAAATCATTCGT
ATCTGCAAATTGTTCAGCACTTCAACGCCTTCTTCTGAGATGTCAACTCGAGTTCGTTGAAAAGGAAATAAGTAGCAACG
GACCAGAACTGACTTATGAAGAGTCCTCTCAAGCACTTTTCCGAAGCTTGGCACAATTTAAGGAACGCTATCCAATTGCG
TGGTACCTGGCAAACCGTAAGTGTGAACGCAGTTTAAGGTACCTGTCAGAGATACTAGACCATTTGAAGGAAGACTGGTC
TAGACTGTCAAGGTTCGGACTGACAGAGGACTCACAAGTATCAAACATCACCTTTGAACTGGGTGATACGCATGATGGCA
AAAGCGTTGCAGTTGTGACCTTGGATGATGGACAACAGATTTTTCACAAACCGCGACCTCTAGATGTAGAGGAGTCTTGC
TCGAGGTTTGCCGAGCAGTTGGGCCGAATGTTTGGATTTACCTGCCCGTTCGTCGGAAAGGTAATCACTCGGGGCTCATA
CGGCTGGGCAGAGCATGTCCCACACGTCGAAGAAAGCCGCTTTGATAATCCAAGGGCAGCAGCCGAGTTTGCCCTACTCC
TGAAGCTTCTAAGTTTTACCGATGTTCATTACGAGAACGTGCGCTTTAGCGCCGATGGTATCCCAATCCTGGTAGACGCT
GAGACCGCCCTTACCTCGGGTCTCTGTCGTCGTGACAGTGAGGGTATACCAATACACGCTGCACTATCCGAGACTGTCAC
CTCAACAGGCTTTTTCCCGTCACCGCTAGTTATCCCCAAAAAGCGCGGAGAAACTTTTGTAGATGTTGGCGTACTTGGTA
GACGCGATAAGAACCTTATTACTGAGCGCCAGCTAGTATTAAAGAATCCTTTTACAAATAAGATGCACTTGGTGTATGAG
GATGTAGCCAAGCATATGTCTGATAACGCGTCTAGTTTCCGTTTTTCATCCGACTACGTCAGAAGTTTAACAGAACGCTA
CAGAGAACTTACTAAAGCTGTCGTTGACAAAAAGGCATCGATCTCTAACTTGTTACGAGAGTGCTTTTCTAAGTCGTGTT
TTCGTGTAGTGGTGCAGGACACCATAAAATACGTTAATGCTATTCAGCTGGCAACAAATCAGCAGTGTCTAAGTAGTCCT
AGTCTTTATGTTGGAGCCCTGCTACGCTTTGCTATTGGGCGTTTCGACAGTGACCGGCTACTATTGCGTAACGAGCTTGC
AAGCCTCATTTCTGGAGATATTCCTCGATATGTGGTCTCAGCAACATCGAGTGACCTCGAAGGGTCTGTTCAAACAGTCA
AAAAGAACTACTTTATCGAGTCGCCCATCGAAAATGCAATAGGTTGTGTGCAAAAGATGGATTCCAAAACCATTGAGCTG
GACTGCTGGCTAATCGAGATTTCATTCGCGAGTTATTATGATGAATCATCCAATGCCACACAGTTTAAATATTCCGACAG
CCTTTTTAACGTTTCACATAATATCGAAGTGTCACTAAATGAGGCACTTACATCTTTGTTGAATGGGTATGTTTACGGTT
CAGGAAGTGCCCCGGCTACTTGGATTGGCGCTAGACTGAGCAACCAGGCTCATCAGTACTGGTACGTCGACGAAGTCTCC
ATGGATCTATATGCCGGATCCTCCGGCTTGGCTTTACCGCTTGTGTTGTCGGGGCCTGCGGGTCTTTCCTGTCGAGGAAG
AAAAGAGGCACATGACTACTTTGATGGACTAGTCACCAAGCTTGAATCTCTAGATACCACACAGTTGGCTACGCTAAATA
CGGGAGCCTTGTCTGGTGCGCACTCTGTGTTGTGGTCATTGCATTGTCATTATTCCGCCGCGGGGGATAACGGTGGACTC
GAGCGACTAAAGTCACTAGCCAATAAAATGATTTCGTGTGCCTCGCACGACGGATTCGATTTCACTGGAGGCACTATCGG
TTTATCGGTTTTGGCAAAGGCCTTAGGTGTATTCGAATCAACCAAAATTGAAGATGACTATTTAAATGCTCTAGACAGTT
TGGCTGAATCGACCTCGCGTGGTTGTAGTTGGTTATCCGGCTATGCACATGGTCATGCCGGTGCTTTAGCCTCCGCTGCT
ATGCTCTGCGACCACATTTCGAATAGGGATCGGCTCGAGAGCTCAGTAAGTAGATTGTTTAGTCAGTTCTTGAGTTTCCG
TGAGAGTGAATCGAGATTATGGCCAATCGGCTTTGAACAAACTGGCATCGGTAGAGGTTGGTGTTCTGGTACTCCAGGTG
TCCTTTTAGCGTTAGCTCAATTCCACGAGAGCGAGTTGGGAAAATGTTATGAGCTTGCTCCCACGATTGAATTTCTCACC
GATGTAGTTAAGAAGGAAACCTTCGGCGGTAATCCCACATTGTGCCATGGGGACGTGGGCAATCTCTGGATTCTACAGAA
GGTTGGTGAGTTGATAGGGGACCATGCCTTGGTTACAAGAAGTCGCGAGGCCGGCCTCTTCTGGCTCCAAAACGTTCTCC
CTAGTTTTCTACGGTCGCTTTCAAGATTTTCGATTTCACACAGCTTCTTCGCTGGAATTGCTGGAACAGCGCTATATGGA
GAGTACCTGTTGTCGTCTGAGGAAGTTGTGAGGTGTCCGCTATGGTTAGAGTGA

Protein sequence :
MLVGSIRKFFPEITDDLFQELLTSAAYEHGVASLQGEIEKFTQKNSETDEEVLADNLAELYRFFEREPIPFLDSTWALSD
LQISSIRDVIDAFPHRIDCDAVIKSFVSANCSALQRLLLRCQLEFVEKEISSNGPELTYEESSQALFRSLAQFKERYPIA
WYLANRKCERSLRYLSEILDHLKEDWSRLSRFGLTEDSQVSNITFELGDTHDGKSVAVVTLDDGQQIFHKPRPLDVEESC
SRFAEQLGRMFGFTCPFVGKVITRGSYGWAEHVPHVEESRFDNPRAAAEFALLLKLLSFTDVHYENVRFSADGIPILVDA
ETALTSGLCRRDSEGIPIHAALSETVTSTGFFPSPLVIPKKRGETFVDVGVLGRRDKNLITERQLVLKNPFTNKMHLVYE
DVAKHMSDNASSFRFSSDYVRSLTERYRELTKAVVDKKASISNLLRECFSKSCFRVVVQDTIKYVNAIQLATNQQCLSSP
SLYVGALLRFAIGRFDSDRLLLRNELASLISGDIPRYVVSATSSDLEGSVQTVKKNYFIESPIENAIGCVQKMDSKTIEL
DCWLIEISFASYYDESSNATQFKYSDSLFNVSHNIEVSLNEALTSLLNGYVYGSGSAPATWIGARLSNQAHQYWYVDEVS
MDLYAGSSGLALPLVLSGPAGLSCRGRKEAHDYFDGLVTKLESLDTTQLATLNTGALSGAHSVLWSLHCHYSAAGDNGGL
ERLKSLANKMISCASHDGFDFTGGTIGLSVLAKALGVFESTKIEDDYLNALDSLAESTSRGCSWLSGYAHGHAGALASAA
MLCDHISNRDRLESSVSRLFSQFLSFRESESRLWPIGFEQTGIGRGWCSGTPGVLLALAQFHESELGKCYELAPTIEFLT
DVVKKETFGGNPTLCHGDVGNLWILQKVGELIGDHALVTRSREAGLFWLQNVLPSFLRSLSRFSISHSFFAGIAGTALYG
EYLLSSEEVVRCPLWLE