Gene Information

Name : clp (HMPREF0573_10539)
Accession : YP_003718352.1
Strain : Mobiluncus curtisii ATCC 43063
Genome accession: NC_014246
Putative virulence/resistance : Virulence
Product : endopeptidase Clp
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 637645 - 640401 bp
Length : 2757 bp
Strand : -
Note : COG: COG0542; Pfam: PF02861,PF00004,PF07724; InterPro: IPR003959

DNA sequence :
ATGATATCTATGGATCAAAAATTTACGACTAAGTCTCAGGAGGCTCTGGCAACGGCACTGCGTACCGCGGGGGCGGCTGG
CAATGCCATGCTTGAACCGATACACCTGCTTTCTGCGCTACTGGAAGACCGTGAAGGAATTGCCTTCGAAGTGCTCAGCA
GTGTGGCTGATGCTGACGCTATTGGTCGGGAGGTACGCCGTGAACTGGCGTCTTTACCGGGGGCGACCGGTGAATCTGTG
GCGGAACCGCAGCCTTCCCAAGCTATGCTAAACGTCATTAGTGCAGCTGGTGCGAAAGCGCAAGAGCGGGGGGATGATTA
TGTCTCTACCGAACACTTACTGTTGGGGTTGGCTGAAAATGAGGGCAGAGTCGGCAAGATTCTGCGCGATGCGGGAGCTA
ACGACAAAAAACTGCGCCAAGCTATTAAAAAGATTCGCGGAGACGCCAAAGTGACCTCCCCGAACCCCGAAGCTACGTTT
AAAGCCCTGGAAAAATACGGCGATGACCTGACCCAGCGGGCTATGGATGGCAACCTTGACCCGGTAATTGGGCGCGACAG
CGAAATCCGGCGGGTCATTCAGGTGCTGAGTCGCCGCACCAAGAATAACCCGGTGTTGATTGGGGAGCCCGGCGTGGGCA
AGACCGCCGTGGTAGAAGGTCTAGCCCAACGTATCGTAGCGGGAGACGTGCCCGATTCCCTCAAGCACAAGCATCTCATT
GCTCTTGATTTGGCAGCGATGGTAGCGGGGGCGCAATATCGCGGTCAGTTCGAGGAACGGCTGAAGGCAGTGCTGGAAGA
AATCAAGAACGCTGAAGGTGAAGTGGTGACCTTCATTGACGAGCTTCACACCGTGGTCGGGGCGGGAGCCAGTGGCGGGT
CGATGGACGCCTCAAACATGCTGAAGCCCATGCTGGCGCGCGGAGAGCTGCGTCTAGTGGGGGCGACGACTTTGGACGAG
TACCGCGAGCACATTGAAAAGGATCCGGCTTTGGAACGGAGGTTCCAAACTGTATTCGTGGGGGAGCCCAGCGTGGAGGA
CACCGTCGCCATCTTGCGTGGCATAGCTCCGAAATATGAGGCTCATCACAAGGTGACTATTGCGGACGGGGCTTTAGTGG
CTGCCGCACAGCTTTCTAACCGTTATATTTCCGGTCGTCAGCTGCCAGATAAGGCTATCGATTTGATTGACGAAGCGGCG
TCTAGGCTGCGGATGGAGCTGGATTCCTCGCCGGAGGAGATTGACTCTCTGCGCCGAGAAGTGGAGCGGGTCAATATGGA
GTTGTCCTATTTGAACGCCTCCGACCCGAACCGGGAGGATCCGGCGACCGAGGGGCGCGTGGCGCAACTGCAAGCCAGTT
TGGATGAAAAGCAGGCGCAGCTAGATCGGCTAAACCTGCGGTGGGAAGCCGAAAAGGCGGGTCATAACCGAGTCGGCGAA
CTGCGCGTAAAGCTGGACGAACTGAATACGGCTCTGGAGCAAGCCATGCGCGAGGGGCGTTGGGAAGACGCAGGACGTCT
GCAAAATGGCGAAATTCCAGCGATTCAAGCCCAGATTACGGCAGCGGAAGGCGAGTCCTCGAACCAGTCGAGTGCGGCGG
GGCACGCTCCCACCGATTCCGCCATTTCTGGAACAAACAGTACAAAACCTGGTCATCCGGATGCTGGCGACTCGCAAGAC
GGTCCCATGATTGCTGAGAAAGTCGATGCTGCCGAGATTGCGGAAGTAGTCGCCTCTTGGACTGGAATCCCGGTGGGCAA
GCTGCTCCGGGGCGAATCTGAAAAGCTGCTGCACATGGAGGAATACCTGGGGCAACGCTTGATTGGGCAAAAGGATGCCG
TGCGCGCGGTGTCGAATGCGGTGCGTCGGGCGCGTGCAGGAGTCTCCGACCCGAATCGTCCGACAGGCTCTTTCCTGTTC
CTGGGACCGACTGGGGTAGGCAAGACCGAGCTGGCGAAGGCTCTGGCTGATTTCCTGTTTGATGATGAAAAGGCTTTGAC
CCGTATTGATATGAGTGAATATGGTGAGAAGCACTCGGTCGCGCGCCTGATTGGTGCCCCTCCGGGTTATGTCGGATACG
AAGAAGGCGGTCAGCTGACCGAAGCGGTGCGGCGGCGCCCTTATGGGGTGGTGTTGCTCGATGAGGTCGAAAAGGCGCAT
CCGGATGTTTACGACATTTTGCTGCAGGTGCTTGATGACGGTCGATTGACTGACGGTCAAGGGCGTACGGTTGATTTCCG
CAACGTCATTCTGGTGCTGACCTCGAACCTGGGGTCTCAGTTCCTGATTGATCGTGAGGCTGATCCGGCAGAGGCGCATC
GCCAAGTTTTGGATTTAGTGCGTGCCGCATTCAAACCGGAGTTCTTAAACCGTTTGGACGAAACCATCATGTTTGAGGCT
CTGTCTACCGAGGATTTGGAACAGATTGTCGACATCCAGATTCACCAGTTGACCACGCGCTTGGCGGAATCTCAGTTGAC
CCTCGAAGTCACCGAGTCGGCGCGATCTTGGCTGGCGGTCACGGGCTACGATCCGACTTACGGAGCGCGCCCGCTACGTC
GCTTGATTCAACGCGAGATTGGCGACCAGCTGGCAGAAAAGCTGCTGGCGGGGGACATTACGCCCGGTTCGACCGTGGTC
GTGGACATGCCCAATTTCAGTCCCACAGATTTAATAAAATCTGGAGACTTGAGTGAATTGACAGGACCAATGTCAGACCA
ATATCGACTGGAGCTAAGGGTAAAAAACGCCCAATAA

Protein sequence :
MISMDQKFTTKSQEALATALRTAGAAGNAMLEPIHLLSALLEDREGIAFEVLSSVADADAIGREVRRELASLPGATGESV
AEPQPSQAMLNVISAAGAKAQERGDDYVSTEHLLLGLAENEGRVGKILRDAGANDKKLRQAIKKIRGDAKVTSPNPEATF
KALEKYGDDLTQRAMDGNLDPVIGRDSEIRRVIQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVAGDVPDSLKHKHLI
ALDLAAMVAGAQYRGQFEERLKAVLEEIKNAEGEVVTFIDELHTVVGAGASGGSMDASNMLKPMLARGELRLVGATTLDE
YREHIEKDPALERRFQTVFVGEPSVEDTVAILRGIAPKYEAHHKVTIADGALVAAAQLSNRYISGRQLPDKAIDLIDEAA
SRLRMELDSSPEEIDSLRREVERVNMELSYLNASDPNREDPATEGRVAQLQASLDEKQAQLDRLNLRWEAEKAGHNRVGE
LRVKLDELNTALEQAMREGRWEDAGRLQNGEIPAIQAQITAAEGESSNQSSAAGHAPTDSAISGTNSTKPGHPDAGDSQD
GPMIAEKVDAAEIAEVVASWTGIPVGKLLRGESEKLLHMEEYLGQRLIGQKDAVRAVSNAVRRARAGVSDPNRPTGSFLF
LGPTGVGKTELAKALADFLFDDEKALTRIDMSEYGEKHSVARLIGAPPGYVGYEEGGQLTEAVRRRPYGVVLLDEVEKAH
PDVYDILLQVLDDGRLTDGQGRTVDFRNVILVLTSNLGSQFLIDREADPAEAHRQVLDLVRAAFKPEFLNRLDETIMFEA
LSTEDLEQIVDIQIHQLTTRLAESQLTLEVTESARSWLAVTGYDPTYGARPLRRLIQREIGDQLAEKLLAGDITPGSTVV
VDMPNFSPTDLIKSGDLSELTGPMSDQYRLELRVKNAQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-96 41
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 9e-107 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 7e-107 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clp YP_003718352.1 endopeptidase Clp VFG2076 Protein 1e-113 43
clp YP_003718352.1 endopeptidase Clp VFG2084 Protein 3e-103 41