Gene Information

Name : irp2F (CD31A_2086)
Accession : YP_005158780.1
Strain : Corynebacterium diphtheriae 31A
Genome accession: NC_016799
Putative virulence/resistance : Virulence
Product : putative siderophore biosynthetic protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2154806 - 2162377 bp
Length : 7572 bp
Strand : +
Note : Non-ribosomal peptide synthetase modules and related proteins

DNA sequence :
ATGATCAAAGAGGAACAAATTCGCGAAGAACTCCTCGCCTCGCTCCACCAGATCCTCGGCGAAGATGCCGAAATCGGCAT
CGACGACAACCTGCTCTCCCACGGCCTGGAGTCACTTCCAACAGTCCGTCTGCTCGCTGATTGGATGAAACAGGGACACC
GCGTCTCCTTTGGAGATTTCATGCGCGCCCCAACGGTAAGGCAGTGGGCGAAGATGCTTGCAGAATCCACACCAAACCAC
AGCACTGACGCCATTGAGTCCCCCAAAGATGGGTTTGCCGCTCCCATCGACGATTCCGTACCCTTTGACCTCACAGATGT
CCAATACGCCTATTGGATTGGCCGCAATTCCTCGCAGCAACTGGGCGGGGTGGGCACCCACGGCTACGTAGAGGTCGAAT
CCCGCAGTATCAATATCGACCGATTGCAACAAAGCTGGCTGACGCTGCTACGTTCCCACCCGATGTTGCGCGCCTGCTAC
ACCGAAGATGGCAAGCAATATGTGCTGCCAGAGCCGCCCCACCCCACCATCCTCGTTCATGACCTCACCAAGATGGACGA
GTCCACGCGCGAAGAAGCTCTGCTATCCACGCGCGAACGCCTTTCGCACCGACTGTTGGACATAGCTACCGGCCATGTGG
TATCTTTAGAAGTCAGCCTCCTCCCTCAGGATGTAGCAGTTATTCACTTCGATATCGACCTGCTTGTGTGCGATGTGCAG
AGTTTCCAAATTATCTTGCACGACCTCGCCCACCATTACGCCACCGGTGAAGCCCCCGACGCCGACCCGTCGTGGAGTTT
CGCTCGCTACCTCGCTGGACACGCTCGCGAAGGCGTCGCCGATATCGACCGTGACCAGGCATACTGGCGCAATCGCCTCT
CCGAGTTGCCGGGTGCCCCAACCTTGCCGATGAGCCACGGCACCAACGAGGAACAAGCACACCGGTTTGTACGCCGCAGT
CGATCATTCGATTCGGCAACTTGGTCGCGCTTGCGTGAGGTGTGTGAACACCACGCTACTACCCCAGCAATGGTGTTGCT
GACCGCCTACGCCCGCACCATCGGCCAATGGAGCGAGAATAAAAAATTCCTCATGGCCGTACCTCTGTTCAACCGCGGTT
CAGACACAGCGATAAAGAATGTCGTCGCAGACTTTACCGCCCTGACACTCACCAGTATCGATCAGTCCACACGCCGCACT
TTCAGCGAGGACCTCAAGGATATACAGGCCTCTTTCTATGAGGATTCGTCACATTCACAGTATTCGGCGGTGCGGGTGCT
GCGAGACCTACGTGCTTCCAGAGGTGAGCAAGTTTTGGCTCCCGTGGTGTTCTCCTGCAACCTGGGAGACCCACTGGTTG
GCCAAGAGTTTATAGATACCTTCGGAGAGATTAGCTACATGATCTCCCAGACCCCCCAGGTGTGGATTGACCTCCAAGTC
TTCACCACCGTCAACGGCTTCCTAATCGTCTGCGACGCAGTTGAACAGTTGTTCCCCGAGAAGATGCTGGACGATCTCTT
CGCCACCCTAGTGATGGAGATTGACAAGGCAATCACCGACGATCTCTCGCATTCAGACCCGGTAGAGAGCCCAGGAGCGC
AGGCCCGTCGAGCGTCACGTGCCGAAGTTGCCTCTTGGCGGCTGCCTGACACCACCCTTGTCGATGAGGTGATTGCCGCT
GCACGGTGCCACCCCCAGGCCACGGCCATTCGCAGTGCAAGCGGCGACGTTATTACCTACCAAGATTTGGAAGAGCAAGC
CACCACCATCGCCTCTGCTTTGGTGAATTCCGGGGTTGGGCGAGGCGCTTTGATTGCAGTGATGGTGGAGCGTGGTCCTC
GCCAGATTATTGGTGCGCTAGCCGCGATGATGGCGGGCGGAGCATATGTACCGGTGAGTCTACAGCAACCGGAATCTCGT
ATTGCCGCTCTCTTGGGGGCAAGCCAAGTAACGCATCTGATTACTGACCGACCCGATAAGGTTCTGAGCGAAACTGCAGT
TCAGGTAGTAGATTTCACGAGCGCTACCGGCACAGCCAATCTTCCGCAGCTGCATCCACAAGACCCGGCCTATGTAATCT
ACACCTCTGGGACCACTGGAACACCAAAAGGAGTCGAGATTTGCCACGGTGCTGCATGGAACACCATTAGCGATATCAAC
CGTCGCCTGGGCGTTGGTCCGACAGACCGCCTACTAAGCGTATCTTCCTTTGATTTCGATCTCTCGGTGTACGACGCCTT
CGGGCTCCTTTCTGCTGGCGGCGAACTGGTAACCATCCCAGATGATGCGCGCCGCGACGCCAAGAAGTGGGTATCGCTGG
TGGATAGTTTAGGCATAACTATCTGGAATTCGGTGCCTACTTTGTTTGAGATGCTTTTATCTGCGGCTGATCGGACACCG
AGCAAGCTCAGCAGCATACGCCATGTTCTGCTATCTGGCGACTGGATTGATACCAGCCTGCCTGAGCGCATGCGCACGGT
TACCCCGCAGGCACACCTATTAGCTATGGGCGGTGCGACGGAGGCATCTATCTGGTCCAATGGTCTCGACCTCGATGTAG
TGTCACCCGAGTGGACCTCAATTCCCTATGGTCGGCCGCTTGCCAAACAGATGTACCGGGTGGTGTCCAGCAATGGCCAG
GACTGTCCCGATTATTCTGTGGGGGAGCTGTGGATTGGCGGCCTGGGCGTAGCTACACAATATGTCGGCGACCCCGGACT
CACCGAAACTAAGTTCGTAATCTCGGAGGACTCACGTTGGTACCGCACAGGCGATATGGGCCGGTTCTGGGCCGATGGGA
CGATCGAGTTTTTAGGCCGCTCGGATAACCAAGTCAAAGTTCGTGGCCACCGCATCGAACTCGGCGAGATCGAGTCCGCG
TGTGAAGCGTTGCTGCCAATAGAGCGTGCCGTGTGTATCACGCACCAAGGTGCATCATCGTCACCATCTTTGGTTACGTT
TGCTCAGTTCACACCGTCACATGTTGCCCGAACTACTCCGGAGCAGTTTGCGACGTCATTGCGCGCCAAAGTCAACGACG
TGCTGACCGAGGGCGACATCCGCACGTCCGTCGAGCACGATGAGCATTTACAGACCGCATATGCCTTCTCAGTGATGCGA
CGCTGGGAGGAGCAACTCACCGGCGTGGGGACCCCAAACCACCTCCGCGAACACCGTAATCGCTGGCAGACATGGCTAGG
CAAAGCAGATGAACACCCGGCTACCGCAGACTTGCTTCTAGACGACGAGTCGTTCGGCGCGTTAGAGCGTTTTGTCACCC
CCTTCGAGCAAGCCTTCGTAATGGCGGAGAAGCAGCGCAGTATCGCTGAGTTCATCCAAAGCCCAGATTCGATGTCGGTG
GAGCAATTCCTCGCAACCCGTCCGTTGGGTCGGTTGGTTCACCGGGTCCTCGGCGCGGTTGTGCGTGAATGCAGTACCCA
CTCAACCAGCGAGCTCAAGATTCTTGAGATCGGGTCACGCCGGCCCGAGGCCTCTGCGGACTACGCGGCTATCGCGGGGA
CGAGCGCATACGTGCTCGCTGATCCCTATCGTCATCACCTCGAACATGCCGGACAACGTGTTGGTAACACCTTCACGTAC
CGCCAGCTCGGAGTCACTAGCACGCCCCAACCGACTCCTGGGGAGGCAGTGACGAAGGCGGACTTGGTGCTGTGCAACCA
GACGCTGCACCAGAGCGAGGACATCGAGAAAACGCTCTGTGAGGCCTGGGGCCTCAGCGCCCCTGGAGCAACGATGGTGG
TGGTAGAGCCCACTGCCCCCTCCCCGATGTCCGATATCACCGCCGCATTTATTGCCAACAATACCACGGATGCTCGGGCC
GAAACCGGTACAGTACTACTCAGCGCCCGCAGCTGGAAAGAAATCCTGCAGCGCACTGGGTGGAAACCTGTAGAGCATGT
AGAGATCACTAAAACGACGGCGCTTATCATCGCCGAACGCGCTAGTTCCAACGAATCGGTCACACTATGCGATTCTGACT
ATGCCAAGGCTACAAACCTACTGGCCACCCGCCTTCCGGAGTACATGCTACCGAAACGCATCCTGGAGCTAGCGAAGTTT
CCACTCACCAGCAACGGCAAGATTGACCGCAAAGCGCTCACAGCATTAGTGCCGGAATACTTCGACAACGAGCCCGCTGT
CACCGAACTTCCGCACACCGCCACCGAGAAACGGCTCATCGATATCTGGGATGAGCTGCTGCACACCAGCTCCAACGTGA
ACTCAGACTACTTTCGCTTGGGTGGTGATTCTCTCACAGCAACACGCCTGCGCCGTACCATCGAACAGTGTTTTGGCGTG
GAATTTCCGTTAGAGAATATCTTTGACGTTCCGTTGCTGCGCGACATGGCCGCCCGAATTGACCAGATAGCCGAAGTCCC
CCACCAGCAATCGGATCTTCCCAAGATTGTTCACGGGTCTGAACAGTACGCTCCGTTCCCGCTGACTGAGGTGCAGCAGT
CCTATCTCATCGGCAGCTCTGGAGCCATCGAACTCGGCGACGTTTCCAGCCACTGTTACTTTGAAATGTCCACTGCCTGC
CTAGACCCGGAGCGAGTGGAAGACGCGTTCAATGCCCTGATTAAACGCCATCCCATGCTGCGCACCGTCGTGTGTGAGGA
CGGATTGAGTCAGCGTGTGTTGCCCGAAGTACCACGCTACCGCATTGCCCTCATTCGCTCCGGCAACGCCGACAATGAGG
ATACTCTCGATGAAATCCGAGAGGAAATGTCCCGCCAGAAGTTTGATCCCACTCAGTGGCCATGCTTTGACGTGCGCTAC
GTAGCCGAGCCCGACGCGGGTCGACTGCTTTTAAGTTTCGATAACTTGTTTATCGACGGCTGGAGCATGTTCCACATTTT
CCGTGAGTGGAAGCAGGCCTACGACCACGGCGTAGACAGTCTCGATCCGGCAATCCCCTATTCATTTAAAGACTATGTCG
AAGCCACGATTGAACTGTCACACAGCGACATCCATAAGCGTGACCAAGCCTATTGGGAATCGGCAGTGGATACTATTTAT
CCTGCACCGCAGTTGCCGGTGACCGACACAAACGGCGCTAATACCTCGCAGTTTTGCCGCCACCACGCCCTGGTGGACGC
TGCGAAGTGGCGCCGGATTAAGCAACGGGTACGTGAGGAGGGGATGACCGAAGCCGTATTCCTAGCCGAGGTATACGCCG
AAGTCCTAGCGAGGTACAGCGATGAGCCGCGACTTAGCATCAACCTGACCCGGTTCGACCGCACTCGGTTCGCCCCCGAG
GTTGACCACATAGTTGGCGACTTCACCAGCCTATCTATTCTCAGCGTGGATACCCAGTGTGCACCATCGTTCCGGGACCG
CGCTGCAGCGCTACACAGGCGCATGTTCAGCAATCTCGATCACGGCAGTGTCTCCGGCGTGTCGGTGCAGCGAATGCTTA
CTAAGCAACGCGGTGCTCGGGTAACCATGCCGGTGGTCTTCACGTGTGGCCTGGGCGTCGTGGAGCACCCCGAGTCAGAT
CAGAGTCCTTACCTAGGCGTAATCGATCATGGCCTATCGCAGACGCCACAGGTATGGATGGACCTCCAGGTCTATGAACA
CGATGGTGGCCTGATGCTGAATATGGACGCGGTCGAGGCGATCTTCCCCGACGACATGGTGGCGGAGCTTTTCACCAGCC
TTACAGCCACTCTCTCGCACCTCGCAGAGTCCCCTGAACTCTGGAACGCGCCCACCAGTACCATAGCTCCGACGACCAAC
GCGCCCACCGCAGATCGAATCAACGACACAGACCGGGAACTCCCCGGAGCGGATAAGAGCCTACTGGGGCTGTATCAGAA
GGGACTTGCCGAGCATGGTAACAACCTCGCGGTGATAGACGCCACCACGCAATGGACATACGAGCAACTCAATGAGCAGT
CGGATAAGTGGGCCCAGCTTATTGCTGCAACTGACCCGGCGCCTGGTGATCTCGTTGGCATCATGATGGAAAAGTCCGCA
CAACAGATCGCAGCGGTACTAGGCGCAATGAAGGCCGGATGCGCCTACTTGCCGCTGAGCGTGGACCAACCGGTGGGGCG
TAACACTTCCATCATCAACGATGCGGGTGCATCCATTGTTGCGATGGACCATCCGGACGATGATTTCGCTGCGTTAGCCG
AGCACTGCACAGTCATCACTCTGGCCGATGTCGCCCGGCATCGGCCAGGTGACCAGGCCTTAAGCGAATCGAGTCCGACA
CCCAGCTCGCTTGCGTACGTCATCTACACGTCTGGTACGACGGGCACTCCCAAAGGAGTAGCCATCACCCACGAGTCGGC
GGTCAATACGATTGTCGATGTCAACGAACGCCTCGGGGTGACTCCGACCGACCGGATATTGGGGATTTCAGAGCTAAACT
TCGATCTATCGGTATACGACATCTTCGGGATGTTCGCGCGGGGTGCTACCTTGGTGTTGCCATCTCCAGCAGACAAACGT
GATCCGCAGTGCTGGGCAGATGCCGTAACTACGCACTCGGTGACGCTGTGGAACTCTGTGCCAGCACTTTTTTCGATGTA
CGTGGACCACCTCCGCGAACGGAGCCTCATTGGTTCCTCCGTACGGTCGGCGCTACTCAGCGGAGATTGGATACCTGTAA
ATATCGCTTACCAGGTCAGCACCCTCTTCCGGGACTGCACAGTTTTCGCTGCTGGGGGTGCTACCGAGGCATCGATATGG
TCAAACTGGTACGAGGTAGGCGTCGATGATGCCTCGCGTACCAGCATTCCCTATGGAACGCCGCTGGCAAATCAGCGAAT
GTACATCCTCGATGAGGCCCTGAACCCCCGACCCACGCATGTGCCGGGAGACCTTTACATTGCCGGACGTGGCCTGGCCA
TGGGGTACTGGAAGGACCCAGAGAAGACCGCAGCATCATTCATCACCCACCCCCGCACCGGCGAACGAATGTACCGCACG
GGCGACAAGGCCCTCTACAATCACCTTGGTCACATCATCTTCCTGGGGCGCGAGGACGGTCAGGTTAAGGTCAACGGCTA
CCGCATCGAACTCGGTGAGATCGAGTCAACAGCACGAAAATTCAACGAACTCCGCGATTGCGTCGCTGTCAACGACCATG
GCATTGTTCTCTACGTTGTTACGCATGAGGGCTTTAACATGGCAGCACTCAATAACCACCTCGCCGAGTCTCTCCCTGCC
TATATGCGTCCACGCGTGATATCACGCATTGATGGGCTACCACGGTCTTGGAACGGCAAGATTGATCGCAAGAGTTTAGA
AGGAAAAACCTTTGAACAACCACAAACAAGGGAACGCTCTCGTAATCACCGAGATTCGGGTATCATTACCATCTTGCAAG
AACTCCTCGGGCCCAAGGAGATCAGCATCGACGATGACTTCTTCACCATCGGTGCCGACTCGCTCACGGCCGTGCGGCTT
ACTAACTCGATTCGTCGAGAAATGTCAGTAGAGATCTCCATACGTGACGTATTCAACCATCCAACGGTCCGAGAATTGTC
CGACCTGATCGCCGACATCGTCGGCAGCGACGTAGAAGAAGGCGAAATTTAG

Protein sequence :
MIKEEQIREELLASLHQILGEDAEIGIDDNLLSHGLESLPTVRLLADWMKQGHRVSFGDFMRAPTVRQWAKMLAESTPNH
STDAIESPKDGFAAPIDDSVPFDLTDVQYAYWIGRNSSQQLGGVGTHGYVEVESRSINIDRLQQSWLTLLRSHPMLRACY
TEDGKQYVLPEPPHPTILVHDLTKMDESTREEALLSTRERLSHRLLDIATGHVVSLEVSLLPQDVAVIHFDIDLLVCDVQ
SFQIILHDLAHHYATGEAPDADPSWSFARYLAGHAREGVADIDRDQAYWRNRLSELPGAPTLPMSHGTNEEQAHRFVRRS
RSFDSATWSRLREVCEHHATTPAMVLLTAYARTIGQWSENKKFLMAVPLFNRGSDTAIKNVVADFTALTLTSIDQSTRRT
FSEDLKDIQASFYEDSSHSQYSAVRVLRDLRASRGEQVLAPVVFSCNLGDPLVGQEFIDTFGEISYMISQTPQVWIDLQV
FTTVNGFLIVCDAVEQLFPEKMLDDLFATLVMEIDKAITDDLSHSDPVESPGAQARRASRAEVASWRLPDTTLVDEVIAA
ARCHPQATAIRSASGDVITYQDLEEQATTIASALVNSGVGRGALIAVMVERGPRQIIGALAAMMAGGAYVPVSLQQPESR
IAALLGASQVTHLITDRPDKVLSETAVQVVDFTSATGTANLPQLHPQDPAYVIYTSGTTGTPKGVEICHGAAWNTISDIN
RRLGVGPTDRLLSVSSFDFDLSVYDAFGLLSAGGELVTIPDDARRDAKKWVSLVDSLGITIWNSVPTLFEMLLSAADRTP
SKLSSIRHVLLSGDWIDTSLPERMRTVTPQAHLLAMGGATEASIWSNGLDLDVVSPEWTSIPYGRPLAKQMYRVVSSNGQ
DCPDYSVGELWIGGLGVATQYVGDPGLTETKFVISEDSRWYRTGDMGRFWADGTIEFLGRSDNQVKVRGHRIELGEIESA
CEALLPIERAVCITHQGASSSPSLVTFAQFTPSHVARTTPEQFATSLRAKVNDVLTEGDIRTSVEHDEHLQTAYAFSVMR
RWEEQLTGVGTPNHLREHRNRWQTWLGKADEHPATADLLLDDESFGALERFVTPFEQAFVMAEKQRSIAEFIQSPDSMSV
EQFLATRPLGRLVHRVLGAVVRECSTHSTSELKILEIGSRRPEASADYAAIAGTSAYVLADPYRHHLEHAGQRVGNTFTY
RQLGVTSTPQPTPGEAVTKADLVLCNQTLHQSEDIEKTLCEAWGLSAPGATMVVVEPTAPSPMSDITAAFIANNTTDARA
ETGTVLLSARSWKEILQRTGWKPVEHVEITKTTALIIAERASSNESVTLCDSDYAKATNLLATRLPEYMLPKRILELAKF
PLTSNGKIDRKALTALVPEYFDNEPAVTELPHTATEKRLIDIWDELLHTSSNVNSDYFRLGGDSLTATRLRRTIEQCFGV
EFPLENIFDVPLLRDMAARIDQIAEVPHQQSDLPKIVHGSEQYAPFPLTEVQQSYLIGSSGAIELGDVSSHCYFEMSTAC
LDPERVEDAFNALIKRHPMLRTVVCEDGLSQRVLPEVPRYRIALIRSGNADNEDTLDEIREEMSRQKFDPTQWPCFDVRY
VAEPDAGRLLLSFDNLFIDGWSMFHIFREWKQAYDHGVDSLDPAIPYSFKDYVEATIELSHSDIHKRDQAYWESAVDTIY
PAPQLPVTDTNGANTSQFCRHHALVDAAKWRRIKQRVREEGMTEAVFLAEVYAEVLARYSDEPRLSINLTRFDRTRFAPE
VDHIVGDFTSLSILSVDTQCAPSFRDRAAALHRRMFSNLDHGSVSGVSVQRMLTKQRGARVTMPVVFTCGLGVVEHPESD
QSPYLGVIDHGLSQTPQVWMDLQVYEHDGGLMLNMDAVEAIFPDDMVAELFTSLTATLSHLAESPELWNAPTSTIAPTTN
APTADRINDTDRELPGADKSLLGLYQKGLAEHGNNLAVIDATTQWTYEQLNEQSDKWAQLIAATDPAPGDLVGIMMEKSA
QQIAAVLGAMKAGCAYLPLSVDQPVGRNTSIINDAGASIVAMDHPDDDFAALAEHCTVITLADVARHRPGDQALSESSPT
PSSLAYVIYTSGTTGTPKGVAITHESAVNTIVDVNERLGVTPTDRILGISELNFDLSVYDIFGMFARGATLVLPSPADKR
DPQCWADAVTTHSVTLWNSVPALFSMYVDHLRERSLIGSSVRSALLSGDWIPVNIAYQVSTLFRDCTVFAAGGATEASIW
SNWYEVGVDDASRTSIPYGTPLANQRMYILDEALNPRPTHVPGDLYIAGRGLAMGYWKDPEKTAASFITHPRTGERMYRT
GDKALYNHLGHIIFLGREDGQVKVNGYRIELGEIESTARKFNELRDCVAVNDHGIVLYVVTHEGFNMAALNNHLAESLPA
YMRPRVISRIDGLPRSWNGKIDRKSLEGKTFEQPQTRERSRNHRDSGIITILQELLGPKEISIDDDFFTIGADSLTAVRL
TNSIRREMSVEISIRDVFNHPTVRELSDLIADIVGSDVEEGEI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2F YP_005134498.1 putative siderophore biosynthetic protein Not tested Not named Protein 0.0 99
irp2F YP_005163486.1 putative siderophore biosynthetic protein Virulence Not named Protein 0.0 99