Gene Information

Name : GM18_2921 (GM18_2921)
Accession : YP_004199641.1
Strain : Geobacter sp. M18
Genome accession: NC_014973
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 3483731 - 3486724 bp
Length : 2994 bp
Strand : -
Note : KEGG: dde:Dde_1859 DEAD/DEAH box helicase-like; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
ATGAAACCGACCGACACCAGCGAAAAGGGCCTGGAATCCATCATCGTCGCCTCTCTCGTGGAGGAGGCCGGATATGTTCA
GGGCGACCCGCAGGACTATGACCGGGAACACGCCGTCGACCTGGCCAAACTGCTGCAGTTCCTCGCCGCCACCCAGCCCG
ACACCTATGAGGCTCTCGGCATCGATGACGAAGGCCCCAAGCGCACACAATTCCTGCACCGCCTGCAGGGCGAGATCGCC
AGGCGTGGCGTGGTGGACGTGCTGCGCGGCGGCATCAAGCACGGCCCGGCCCATGTGGACCTTTTCTACGGCACGCCGAC
GCCGGGCAACGTGAAGGCAGCCGAACGGTTCGCGGCCAACATCTTCAGCGTCACCCGCCAGCTCCGCTACAGCCGCGTTG
ATCCCGCGCTCTCCCTCGACATGGCCGTGTTCATCAACGGCCTGCCCATCGCCACCTTCGAACTCAAAAATAAGCTCACC
AAGCAGACGGTGCTCGATGCCGTGCAGCAGTACCAGCGCGACCGCGACCCGAAGGAGCTGCTGTTTCAGTTCGGCCGTTG
CGTCGTCCATTTTGCCGTGGACGATCACGAGGTGCGTTTCTGTACCCACCTCAAGGGCAAGGGCTCGTGGTTTCTGCCCT
TCGACAAAGGCTACAACGACGGCGCTGGCAATCCGCCGAACCCTCATGGGCTCGCCACCGATTACCTGTGGAAGGAGACC
CTCTCCAAAGATGGGTTGACAGATATCCTGGAAAACTACGCCCAGGTGGTGGAGGAAAAGGACGAGAAGACCGGCAAAAA
GAGGTACAAGCAGATTTTCCCTCGCTACCACCAGTTGAAGGTGGTGCGCATGCTGCTGGCCAATGCCGCTGAGAGTGGCA
TCGGCAGGCGCTACCTGATCCAGCACTCGGCGGGCAGCGGCAAAAGTAACTCCATCGCCTGGCTGGCGCATCAGCTCGTT
GGGCTGGAACACGAGAGCAAGGCGTTGTTCGATTCGGTCATTGTGGTCACCGACCGACGGGTGCTCGACAAGCAGATCCG
CGACACCATCAAGCAGTTCGCCCAGGTCTCCGCCACCGTCGGCCATGCCGAACACTCCGGCGACCTGCGCAAATTCCTCA
AGGCCGGGAAGAAAATCATCATCACCACCGTGCAGAAGTTCCCGTTCATACTCGATGAGATCGGCGACGAACACCGCCAG
AGCAAGTTCGCCATCATCATCGACGAGGCCCATTCCAGCCAGGGCGGCAAGACCACCGCCGCCATGAACCGTGTGCTGGA
AGAGACCGCGCCCTATGGCGGCTCTGATGACGAGGGGGAAGAGACAGTCGAGGACAAGATCAACAAGATCATGGAAGGCC
GGAAGATGGTGACCAACGCCAGCTACTTTGCCTTCACCGCGACTCCAAAAAATAAGACCCTGGAGATCTTCGGCGAGCCG
AACCCGCAGCCCGACGGCACCGTGAAGCACCACCCATTCCACAGCTACACCATGAAACAGGCCATCCAGGAGGGCTTCAT
CCTCGATGTGCTGAAGAACTACACCCCGGTGGAGAGTTATTACCGCCTGGCCAAGACAGTGGAGGACGATCCGCTCTTCG
ACGCCAACAAGGCCCAGAAAAAGCTGCGCCGCTATGTGGAGTCCCATGAGCACGCCATCCGCGAGAAGGCGGAGATCATG
GTGGACCACTTCCACGCCCAGGTGATCGGCCACCGCAAGATCGGCGGCCATGCCCGGGCCATGGTCATCACCAACGGTAT
CGAGCGGGCCATTCAGTATTTCCACGCCTTCAAGGACTACCTCAAGGAGCGCAAAAGCCCTTACGCACCCATCGTGGCCT
TTTCCGGGGAGCACGAGTATGGCGGCAAGAAGGTCACCGAAGCGACGCTCAACGGCTTCCCCAGCAGCCAGATCCCGGAC
AAGGTGCAACAGGGCCCGTACCGCTTCCTGATCGTCGCCGACAAGTTCCAGACCGGCTACGACGAACCGCTGCTGCACAC
CATGTACGTGGACAAGGCGCTTTCCGGTATCAAGGCGGTGCAGACCCTCTCGCGGCTCAATCGCGCCCACCCGCAGAAGC
ACGACACTTTTGTGCTCGATTTCTACAACGACTCGGAGACCATCCAGAAGTCGTTCGAGCCCTATTACCGCACCACCATC
CTCAGTGACGAGACCGACCCCAATAAGCTGCACGACCTGAAGTCGGATCTGGACGGCTACCAGGTCTATTCGCAGGCGCA
AATCGACGATCTGGTGGGGCTCTATCTGAATGGCGCAGACCGCGACAAGCTTGACCCGATCCTGGACGCCTGCGTGGCCA
CCTACAACGCCGATCTTGATGAGGACGGCCAGGTGGACTTCAAAGGCAAGGCCAAGGCCTTCGTCCGTACCTACGGCTTT
CTGTCCTCGATTCTGGCGTACTCGAATGCCGACTGGGAAAAGCTGTCGATTTTTCTGAATTTCCTGATCCCGAAACTCCC
AGCGCCCAAGGAAGAGGATCTCTCTCGGGGCATCCTGGAGGCCATCGACATGGACAGCTACCGTGTCGAGGTTAAAACCA
GCCTGAAGATCGGCCTTCCGGATCAGGATGCCGAAATCGGACCGGTGCCGACCAGCGGCGGTGGCCGCAAACCGGAACCG
GAGCTGGACCAACTGAGCAATATCATCAAGGCGTTTAACGACCAGTTCGGCAACATCGAGTGGAAAGACGGCGACAAGAT
CCGCAAGGTCATCGCCGAGGAGATCCCGGCCAAGGTCGCAGCGGACGCGGCCTATCAGAACGCCATGAAGAACAACGACA
AGAAGACCGCCCGGATCGAGCACGACGCAGCATTGCAGCGCGTCATGATCGACCTCTTGTCCGACCACACCGAGCTATTC
AAGCAGTTCAGCGACAACCCGTCTTTCAAGAAATGGCTGGGCGACACCATCTTTGGCGTGACCTATCAGCAACAGGCCGG
ACAATCGGCAACGGGGGCGAGCCATGAACGTTAA

Protein sequence :
MKPTDTSEKGLESIIVASLVEEAGYVQGDPQDYDREHAVDLAKLLQFLAATQPDTYEALGIDDEGPKRTQFLHRLQGEIA
RRGVVDVLRGGIKHGPAHVDLFYGTPTPGNVKAAERFAANIFSVTRQLRYSRVDPALSLDMAVFINGLPIATFELKNKLT
KQTVLDAVQQYQRDRDPKELLFQFGRCVVHFAVDDHEVRFCTHLKGKGSWFLPFDKGYNDGAGNPPNPHGLATDYLWKET
LSKDGLTDILENYAQVVEEKDEKTGKKRYKQIFPRYHQLKVVRMLLANAAESGIGRRYLIQHSAGSGKSNSIAWLAHQLV
GLEHESKALFDSVIVVTDRRVLDKQIRDTIKQFAQVSATVGHAEHSGDLRKFLKAGKKIIITTVQKFPFILDEIGDEHRQ
SKFAIIIDEAHSSQGGKTTAAMNRVLEETAPYGGSDDEGEETVEDKINKIMEGRKMVTNASYFAFTATPKNKTLEIFGEP
NPQPDGTVKHHPFHSYTMKQAIQEGFILDVLKNYTPVESYYRLAKTVEDDPLFDANKAQKKLRRYVESHEHAIREKAEIM
VDHFHAQVIGHRKIGGHARAMVITNGIERAIQYFHAFKDYLKERKSPYAPIVAFSGEHEYGGKKVTEATLNGFPSSQIPD
KVQQGPYRFLIVADKFQTGYDEPLLHTMYVDKALSGIKAVQTLSRLNRAHPQKHDTFVLDFYNDSETIQKSFEPYYRTTI
LSDETDPNKLHDLKSDLDGYQVYSQAQIDDLVGLYLNGADRDKLDPILDACVATYNADLDEDGQVDFKGKAKAFVRTYGF
LSSILAYSNADWEKLSIFLNFLIPKLPAPKEEDLSRGILEAIDMDSYRVEVKTSLKIGLPDQDAEIGPVPTSGGGRKPEP
ELDQLSNIIKAFNDQFGNIEWKDGDKIRKVIAEEIPAKVAADAAYQNAMKNNDKKTARIEHDAALQRVMIDLLSDHTELF
KQFSDNPSFKKWLGDTIFGVTYQQQAGQSATGASHER

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 42
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 42
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-180 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
GM18_2921 YP_004199641.1 hypothetical protein VFG1098 Protein 0.0 42