Gene Information

Name : CFSAN001921_24640 (CFSAN001921_24640)
Accession : YP_008258185.1
Strain :
Genome accession: NC_021815
Putative virulence/resistance : Virulence
Product : peptide synthetase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 181370 - 187477 bp
Length : 6108 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTACCAACAATTACGCGAGCG
GCTTATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAATCTGATCCAGGCCGGCCTGGATTCCATAA
GGTTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTGTATGCCGCCCCCACGCTG
GCGGCATGGAACCAGTTAATACTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCCCGACGAATCATCCTGGCC
GAACATGACCGAAAGTACCCCCTTCCCATTGACGCCGGTACAGCACGCCTACCTGACGGGCCGCATGCCGGGGCAGACGC
TTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACAGCGTCGCAACTGGAGCAGGCCATTACG
GCCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGATCTGGCTACCGCAACCTTACTG
GAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGCCCTATCTGGAAGCACTGCGCCAGCGCC
TGAGCCACCGTCTTTTACGCGTGGAGATCGGTGAAACGTTTGATTTTCAGCTGACGCTCTTGCCGGACAATCGCCACCGC
CTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGAGCTTAACGCCCTGCTGGC
CGGAGAATCGCTGTCGGCTATAGACACCCGCTATGATTTCCGTTCGTATTTGCTGCACCAGCAGAAGATCAATCAACCAC
TGAGAGACAACGCACGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCACCCGTCTTGCCGCTGACCTGCGAA
CCCGCCACACTGCGCGAAGTCCGTAATACCCGGCGCCGCATGATTGTCCCGGCAACACGCTGGCACGCCTTTAGCAACCG
GGCCGGCGAGTATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTTCTGCCGTGCTGGCTCGCTGGGGCGGCCTGA
CGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGTCGGTTGGCGCGATGCTTGCCGATTTCACC
AATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACGGA
GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGCGCTACCCCCACGGTGCCC
CGGTGGTATTTACCAGCAATCTGGGACGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTGGGCGAGCCGGAATGGGGC
ATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTGGCTGCAATGGGACAGCAA
CGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACGTTGTTCGACGCCTACTGCCAGTTGATTAACCAACTCTGCGATGACG
AAAGCGCCTGGCAAAAGCCGTTTGCAGATATGATGCCCGCCCGCCAGCGCGCAATACGCGAACGGGTCAACGCCACCGGT
GCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCAGGCGCTGGCGGTAACGGA
CATGCGTTATCAGTGGAATTATTATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGTTAATCGAGTGCGGGGTTC
AGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTGGCCGTCCTGCTGGCCGGG
GCGGTTTACGTTCCGGTTTCGCTGGACCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGACGCCAGCGTCCGGCTGGT
ACTCATTTGCCAGCACGACGCCAGCGCCTGGTCAGACGATATTCCCGTCCTTGCCTGGCAGCAGGCCATTGAGGCGGAGC
CGATCGCCAACCCGGTGGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGCTCTACCGGCACGCCGAAA
GGGGTGGTCATTTCTCACCGGGGAGCGCTCAACACCTGTTGCGATATCAATACCCGCTATCAGGTTGGCCCCGGTGACAG
GGTGCTGGCCCTCTCCGCCCTGCATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGCGCGCGGGCGGCGTGCTGG
TGATGGTGATGGAAAATCAACGGCGCGATCCTCATGCATGGTGTGAGCTGATCCAGCGCCATCAGGTCACGCTCTGGAAC
AGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCCGGAAAACCTGCGTGCAGT
GATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCACAAGGACAATTTATCGCGA
TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGTGAAATTCACGACGTCCCCGCCCACTGGCGTTCCATCCCT
TACGGTTTCCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAATGGGGCCGGGACTGCCCTGACTGGGTACCGGGCGA
ATTATGGATTGGCGGCATCGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGCAGCGAGCAGCAATTTTTGACGCACC
CGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGACGGCACAATAGAGTTCCTCGGTCGTCGCGAC
AAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCACTCAGCCAGTTGGCGGGGGTGAAACA
AGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTGGTTCCTCAGGGCGAGGCTTTTTGCGTTA
CCGATCATCGGGACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGTTGTGCTATCTCGCCAGAG
ATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGTTAAAACTGAAGCCGGGTCACACCGCTGGCGCCGATCC
TCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGTTAGCATTTCTGGTGACGC
AACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCAAGGATGAGCACCCGAATTTCAGC
GGACATGATTTAACGTTATCGCAAATTCTTCGCGGCACCCGCGACGAACTGTCGTTACTGAACGACGCGCAGTGGTCGCC
GGAAAGCCTGGCCTTCAACCATCCGGCCAGCGCCCCATATATTCAGGAACTGGCGACAATTTGCCAACAGCTTGCACAGA
GCTTACAGCGCCCGGTACGCCTGCTTGAGGTGGGAACCCGCACCGGCCGCGCCGCAGAATCGCTGTTAGTACAGCTCAAC
GCCGGACAGGTTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTCCTGAGCGCCCGGCAGAGGCTCGTCCCCCGGCT
TGGTGCCCGTCTGTCCCCCTGGAATGCAGACACGCTGGCGGTGCACGCTCACTCGGCGGACATTATCTGGCTCAATAACG
CCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTTCCCGGCGCGCTGCTCTAC
GTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCTTGCTCAGCACGCTCCTGTTAACCGATGGACAGCCGGAGGCCCTGCT
GCATAACAGCGCCGACTGGGCGGCGTTATTTAGCGCAGCCGCCTTCAACTGTCAGCATGGCGATGAGGTCGCGGGGTTAC
AACGCTTCCTCGTACAATGTCCCGATAGCCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCCGCCCTCGCCGGACGCCTG
CCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGGGAAGATTGACTATCAGGC
GCTGAAGCGTCGTCATACTCCTGAAGCGGAAAACCAGGCCGAAGCGGATTTACCCCAGGGCGACACTGAAAAACAGGTTG
CCGCCCTCTGGCAGCAACTCTTATCGACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAGCAAGGCGGCGATAGCCTG
CTGGCGACCCGTCTGACCGGACAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCTGTTTAATCATCCCCGGCT
GGCGGATTTTGCCGCCACGCTGCGTAAAATCGACGTCCCGGTCGAACAACCATTCGTCCACTCTCCTGAAGATCGCTACC
AGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGTGTCAGCCGGGCTTTGCCCTGGGCGGCGTCGGCTCA
CATTTCTTTGTTGAATTTGAAATTGCCGATCTGGACCTCACCCGGCTGGAGACGGTCTGGAACCGATTAATCGCCCGCCA
CGATATGCTCCGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTAGAACAGACGCCCCACTGGGTGATACCCGCACACA
CCCTCCATACGCCTGAAGAGGCGTTGCGGGCGCGCGAAAAACTGGCACATCAGGTACTCAACCCCGAAGTGTGGCCGGTA
TTCGATCTCCAGGTCGGATACGTGAACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTGCTTGACGGTCT
GAGCATGCAGATCCTGCTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAGCAGTTGCCTCCGCCGCTGCCCGTCACCT
TCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGGCAGGCACAGCTTGATGAT
ATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTTCACCCGCCTGAACGGCGC
GCTGGACAGCACGCGCTGGCATCGTCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGTCGGCCGTACTGTTGTCGG
TGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTTTTCGACAGGCGGCCGCTG
CACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAAAGCTGGCTGCA
CAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCGGCCATCCGCGTGATGCGTCAAC
TGGCGCAACGGCAAAACGTGCCCGCCATTCCGATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC
CTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCTCGATCACCAGGTTTATGA
ATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGGTGGAGCGCCAGTTTGAAC
AGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCCGCGCTGGTGCCTCCCGTT
AAATTCGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACACATTGCGGCGGACGAGAG
CACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAGAAAACTTCTTTGAGGCGG
GCGCAACGTCACTGAATCTGGTGCAACTGCACATTTTGTTACAACGTCACGAATTTTCCACCCTGACGTTGCTTGACCTC
TTCACCCATCCTTCTCCTGCCGCCCTGGCCGATTATCTGGCCGGCGTCGTCACGGTGGAGAAAACAAAACATCCTCGCCC
TGTTCGCCGTCGTCAGCGGCGGATATAG

Protein sequence :
MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTL
AAWNQLILSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQLEQAIT
ALLQRHPMLHIAFRPDGQQIWLPQPYWNGVTVHDLRHNDAESRQPYLEALRQRLSHRLLRVEIGETFDFQLTLLPDNRHR
LHVNIDLLIMDASSFTLFFDELNALLAGESLSAIDTRYDFRSYLLHQQKINQPLRDNARAYWLAKASTLPPAPVLPLTCE
PATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPSVGAMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG
ISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPARQRAIRERVNATG
APIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYYELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAG
AVYVPVSLDQPAARREKIYADASVRLVLICQHDASAWSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSGSTGTPK
GVVISHRGALNTCCDINTRYQVGPGDRVLALSALHFDLSVYDIFGVLRAGGVLVMVMENQRRDPHAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPAHWRSIP
YGFPLTNQRYRVVDEWGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTHPDERWYRTGDLGCYWPDGTIEFLGRRD
KQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRDPALPQAWHTLAGTLPCCAISPE
ISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEERKDEHPNFS
GHDLTLSQILRGTRDELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQSLQRPVRLLEVGTRTGRAAESLLVQLN
AGQVEYVGLEQSQEMLLSARQRLVPRLGARLSPWNADTLAVHAHSADIIWLNNALHRLLPEDPGLLATLQQLAVPGALLY
VMEFRQLTPSALLSTLLLTDGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDSQVRRDPRQLQAALAGRL
PGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENQAEADLPQGDTEKQVAALWQQLLSTGNVTRETDFFQQGGDSL
LATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKIDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGCQPGFALGGVGS
HFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLEQTPHWVIPAHTLHTPEEALRAREKLAHQVLNPEVWPV
FDLQVGYVNGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLPPPLPVTFRDYLQQPSLQSPNPDSLAWWQAQLDD
IPPAPALPLRCLPQEVETPRFTRLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTLFDRRPL
HPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAIPMPVVFTSALGFEQDNF
LARRNLLKPVWGISQTPQVWLDHQVYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLAALVPPV
KFAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHILLQRHEFSTLTLLDL
FTHPSPAALADYLAGVVTVEKTKHPRPVRRRQRRI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 98
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp2 CAA21390.1 - Virulence HPI Protein 0.0 98
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 98
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 97
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 41