Gene Information

Name : clpV1 (NIDE1985)
Accession : YP_003797634.1
Strain : Candidatus Nitrospira defluvii
Genome accession: NC_014355
Putative virulence/resistance : Virulence
Product : type VI secretion AtPase, ClpV1 family
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1897676 - 1900405 bp
Length : 2730 bp
Strand : +
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; PubMedId : 10982797, 11580258, 12624113, 12805357, 14567920, 14738756, 17201069, 2066329, 2185473, 8376377, 9927482, 19284603; Product type e : enzyme

DNA sequence :
ATGAATATCAATTTGAAAGGACTCATCGGCAAGCTCGATGACACCTGTCGTCGCGCGTTGGAGGGCGCGGCAGGGTTGTG
TCTGAGCCGGACCCATTATGACGTGGATATCGAGCATCTCCTCAGTAAATTGCTCGAAATGTCGAATACGGATATCCACA
AGATTTGTCACCACTACGAGATCGATCATTCGCGGTTCGCTCGAGACCTCACACGATCACTCGACCGCTTCAAGACCGGC
AATGCGAGGACGCCGACCCTGGCGCCTCAGGTCCCGCGCCTCATCAATGAAGCCTGGTCATTGGCCTCCATTGAGTACGG
AGTGAATCGTGTGCGGTCCGGCCACCTAATGCTGGCCTTGCTCGTCGAAGCCGATTTTGCCCGCATGATGCGCGAGCTGT
CGCCCGAGACGCAGAAGATCTCAGCCGAAGATCTACACCGACGGTTGGGGGAGATCGTCTCGGGTTCAGCGGAGGATCTG
GAAGATGTTCAGACGGGAGGCGTCGAGCAGCCGACGACGCAGGGGGCTCCTCATGCTTCCAGGACCCCGGCCCTTGATCA
ATTCACCATCGATCTGACAGCGAGGGCCAAACAAGGGCAGATCGATCCGGTGCTGGGGCGGGATGCCGAGATCCGGCAGG
TCGTGGATATCTTGACGCGCCGCCGGCAGAACAATCCCATTCTGACCGGAGAGGCCGGAGTGGGAAAAACCGCCGTCGTG
GAAGGATTGGCGCTCAGAATCGTCACGGGCGATGTCCCCCCGCCGTTACAAAACGTCACCCTGCGGGTGCTCGACCTTGC
GTTGCTCCAGGCCGGAGCGGGAGTAAAGGGGGAATTCGAAAATCGCCTCAAGTCGGTCATTAACGAAGTCAAAGCCTCCC
CGAAACCTATTATTACCTTTATCGATGAGGCGCACACCTTGATCGGGGCGGGCGGAGCAGCGGGCCAGGGCGATGCCGCC
AACATTCTCAAGCCGGCGCTGGCTCGGGGAGAGTTGCGGACGATTGCCGCCACCACCTGGGCCGAATACAAGAAATATTT
CGAGAAAGATCCGGCACTCACCAGACGGTTTCAGGTGGTGAAGGTGGAAGAGCCGACCGAATCCGTCGCGATCAACATGA
TGCGAGGATTAGCCGGGACGCTGGAAGAACATCACAAGGTGCGGTTGTTGGATGAGGCGATCGAGGCTGCGGTGAGACTG
TCCCATCGATATATTTCCGGCCGGCAACTGCCGGACAAAGCCGTCAGCGTGCTGGACACCGCCTGTGCCCGAGTGGCCCT
GGGACAGGTCGCAGAGCCGCCGGCGCTGGAAGACGCGCGCCGGCGAATCTCCCTCATCAATACCGAGGTCGACATTCTGG
AGCGGGAGGCCATCACGGGTGGAAGTCATCAGGAGCGTCTGGCCGAATTGGGCAGGGAGAAAGTCGCAGAAGAACAGCGT
CTGGCCGAGCTCAGGACCAGGTGGGAGCAGGAAAAAAAGGTGGTTGGAGAGATCGCAGGGATCCGGAAAAAGTTGGAACA
GCATGCCGTGCCACCAAAGGCTCCGGACAAGCCAACCGACAAGGTTCCGGACAAGGCAACCAAAGCGCCTCCCGACAAGG
CGATTGAAACGCCGCCCGCCAAGCTGACCCCTCAGGAAATCGAGACGTTGCAAACAGACTTGGCGAGGCTGAATAAAGAA
CTGACCGTTCTCCAGGGAGAGACCCCGCTCCTGCAGCCCTGTGTGGACGGGCAGGCCATCGCTCAGGTGATCTCGGCCTG
GACCGGCATTCCTATCGGACGGATGGTCAAAGACGAGATCAATACCGTGTTGACGCTCAAGGAGCATTTAGAAAAACGTG
TGGTAGGACAATCGCATGCCCTGGATGCGCTGAGCCAACGCCTGCGAACGGCACGGGCAAAATTGGAGGATCCTCGCCGG
CCGATCGGGGTGTTCATGTTTGTCGGACCAAGCGGGGTGGGGAAGACGGAAACGGGTCTGGCGCTCGCCGAGTTGCTGTT
CGGCAGCGATCAAAACATCACCACCCTGAATATGTCCGAGTTCAAGGAAGAGCACAAAGTTTCGCTGTTGATGGGCTCAC
CTCCGGGCTATGTGGGCTATGGCGAGGGAGGGGTCTTGACGGAAGCGGTACGCCGAAAACCGTATAGCGTGATCCTGCTG
GACGAGATGGAAAAAGCCCATCCGGGTGTGCAGGACATTTTCTATCAGGTGTTCGACAAGGGCATGATGAAAGATGGCGA
GGGGCGGGATATCGATTTCAAAAACACCGTGATCATCATGACCTCCAACGCGGGGACGGATACGTTCATGAAGCTCTGTG
CCGATCCGGAGACCAGGCCTGATCCAGAGGCCCTAGCCGATGCGATTCGCCCTGATCTCTTGAAATACTTCAAGCCGGCC
TTTCTGGGACGACTCATCGTGGTGCCATACTATCCGATTTCTCCCGATATCATGCGGCGCATCATCGAATTGCAGCTCAG
CCGGGTCCGCAGCCGCATCAAGGAGAACCACCGTGCGGTGATGTCTTACGACGAAGCCCTGATCACGGCGATTGCGGACC
GCTGCACCGAAGTGGAGAGCGGCGCGCGGAACGTGGATCACATTCTGACGCGGACGTTGTTGCCGGAACTCTCGACGGAA
TTTCTTTCACGGATGGCAGCAGGGGAATTGGTCAACAAGGTCCATATCTCGGTCGAGCCTGGCGGGAACTTCCGTTACGA
GGTGGCGTGA

Protein sequence :
MNINLKGLIGKLDDTCRRALEGAAGLCLSRTHYDVDIEHLLSKLLEMSNTDIHKICHHYEIDHSRFARDLTRSLDRFKTG
NARTPTLAPQVPRLINEAWSLASIEYGVNRVRSGHLMLALLVEADFARMMRELSPETQKISAEDLHRRLGEIVSGSAEDL
EDVQTGGVEQPTTQGAPHASRTPALDQFTIDLTARAKQGQIDPVLGRDAEIRQVVDILTRRRQNNPILTGEAGVGKTAVV
EGLALRIVTGDVPPPLQNVTLRVLDLALLQAGAGVKGEFENRLKSVINEVKASPKPIITFIDEAHTLIGAGGAAGQGDAA
NILKPALARGELRTIAATTWAEYKKYFEKDPALTRRFQVVKVEEPTESVAINMMRGLAGTLEEHHKVRLLDEAIEAAVRL
SHRYISGRQLPDKAVSVLDTACARVALGQVAEPPALEDARRRISLINTEVDILEREAITGGSHQERLAELGREKVAEEQR
LAELRTRWEQEKKVVGEIAGIRKKLEQHAVPPKAPDKPTDKVPDKATKAPPDKAIETPPAKLTPQEIETLQTDLARLNKE
LTVLQGETPLLQPCVDGQAIAQVISAWTGIPIGRMVKDEINTVLTLKEHLEKRVVGQSHALDALSQRLRTARAKLEDPRR
PIGVFMFVGPSGVGKTETGLALAELLFGSDQNITTLNMSEFKEEHKVSLLMGSPPGYVGYGEGGVLTEAVRRKPYSVILL
DEMEKAHPGVQDIFYQVFDKGMMKDGEGRDIDFKNTVIIMTSNAGTDTFMKLCADPETRPDPEALADAIRPDLLKYFKPA
FLGRLIVVPYYPISPDIMRRIIELQLSRVRSRIKENHRAVMSYDEALITAIADRCTEVESGARNVDHILTRTLLPELSTE
FLSRMAAGELVNKVHISVEPGGNFRYEVA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-167 44
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 8e-143 42
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 1e-142 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpV1 YP_003797634.1 type VI secretion AtPase, ClpV1 family VFG2076 Protein 0.0 52
clpV1 YP_003797634.1 type VI secretion AtPase, ClpV1 family VFG2084 Protein 2e-155 44