Gene Information

Name : Glov_2197 (Glov_2197)
Accession : YP_001952433.1
Strain : Geobacter lovleyi SZ
Genome accession: NC_010814
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2342419 - 2345418 bp
Length : 3000 bp
Strand : -
Note : PFAM: type III restriction protein res subunit; protein of unknown function DUF450; SMART: DEAD-like helicases; KEGG: mma:MM_2976 type I restriction-modification system restriction subunit

DNA sequence :
ATGACCACTGATACCAGTGAGCGTGGCCTGGAGGATCTGATCTGCACCACCATGACCGGCCGAACCGCAGTTGTTGCGGC
TGGTGGTGTGCATGATCCGGTTGAGCCGTTTGGCGGTACCGGCTGGTTGTTGGGTGATGCCCGTGACTATGATCGGGAAT
ACTGCGTTGATGTGACACAGTTGCATGCCTTTGTAGAGGCAACCCAACCGGAGATCGCCACTGCTCTTGAACTGGAGAAG
GACACGCCGGTACGCCGCCGGTTTCTGGCCCGTGTAGAGTCTGAATGCGCCAGGCGTGGGGTGATTGACCTGCTGCGCAA
AGGGGTAAAGCACGAGAAGTACCATATCGACCTCTTCTATGGCAGCCCATCCCCTGGTAATGACAAGGCAGCCCAGTTGT
ACATTGCCAACCGTTTCAGCCTCACCCGACAACTCCGCTACAGTCGTGATGAAACCCGCCGGGCATTGGATCTTGCTCTG
TTCATCAACGGCCTGCCCATTGCCACCTTTGAACTGAAGAACAGCCTGACCAAGCAGACGGTGGAGGACGCCATTGAGCA
GTACCGCCGGGACCGCGATCCTCGGGAGCGGCTGTTCAGCTTCGGTCGCTGCATCGTCCATTTTGCGGTGGATGACCGTG
AGGTACAGATGTGTACCGAGCTTAAGGGAAAGGGTTCATGGTTCCTGCCGTTCAACCTGGGCTGGAATGACGGTGCCGGG
AATCCGCCCAATCCGCACGGCCTCAAGACCGACTACCTCTGGAAACGGGTTCTTACCCTTCAGGGCGTTACTGACATCAT
TGAGAACTATGCCCAGATTGTTGAAGAGATAAACCCCAAAACCCGTAAGAAATCCCGCAAACAGGTGTTTCCCCGGTATC
ACCAGTTGGATGTGGTGCGGCGGTTGCTGGCCGATGCGCTGAAGCAAGGCGTGGGGCAGCGTTATCTGATCCAACATTCG
GCTGGCAGCGGCAAGTCCAACTCCATCGCCTGGCTGGCCCATCAGTTGGTCGCTTTGCGCAAGGACGGGAAGGATCTCTT
TGACTCAATCATCGTCATTACTGACCGCCGCATTCTCGATGATCAGATCAAAAACACCATCAAGGGCTTCATGCAGGTTG
GCTCCACCGTGGGGCACGCAGAGCACTCCGGCGACCTGCGGAAGTTCATAGAATCCGGCAAGAAGATCATCATCAGCACG
GTGCAAAAGTTCCCCTATATCCTGAATGAGATCGGTGATGAGCACCGGGGGCGGAGCTTTGCCATCATCATCGATGAAGC
ACACTCCAGCCAGGGGGGGAAGGCTGCCGGGGCATTGAACGCAGCCCTGACCGACCCTGAAGACGAGATCAACGATGTAC
TGGAAAAACGGATGGCGGCCCGGAAGATGCTTACCAACGCCAGCTACTTCGCCTTCACCGCCACCCCCAAGAACAAGACG
CTGGAGATATTTGGTGAGCCGTATCCGGGCGAAGAGGGGAAGGTCAAGCATCGCCCATTCCACTGTTACACCATGAAACA
GGCGATTCAGGAGGGATTCATCCTGGATGTGCTCAAAAGCTATACCCCGGTTAACAGCTATTACAAGTTGATCAAGAAGG
TTAAAGACGACCCCGAGTTTGACAAGAACAAGGCCCAGAAAAAGCTGCGCCGGTATGTTGAAAGCCATGATCACGCCATT
CGGCTGAAAGCAGAGATCATGGTGGATCACTTCCGCGAGCAGGTTATTGCCAAGGGTAAGATCGGTGGGCAGGCCCGTGT
CATGGTGGTCTGCAACGGTATAGAGCGCGCTATCCAGTATTTCCATGCCATTAAGTCCTACCTGGAGGAGCAGAAAAGTC
CATACCAGGCCATTGTTGCGTTTTCCGGTGAGTATGATTTTCGAGGTCAGAAGGTTACTGAGGCATCCCTGAATGGCTTC
CCCAGTGGAGACATCGCAGACAAAATTCAGGAAGACCCGTACCGCATTCTGGTTTGCGCCGATAAGTTTCAAACCGGCTA
TGATGAGCCGCTGCTACATACCATGTATGTGGATAAGACCCTCTCCGGCATCAAGGCAGTGCAGACCCTGTCGCGGTTGA
ATCGTGCCCATCCTCGCAAACACGACGTGTTTGTGCTGGATTTTATGAATGACGCTGACACGATTCAGGCCGCCTTTGCC
GATTATTACCGCACTACGGTCTTGAGTGAAGAGACTGACCCTAACAAGCTTCATGACCTGAAGGCGGTACTGGACAACTA
TCACGTCTACCGTGCGGCAGTTGTGGATGAGCTGGTAGGGCTGTATCTTGGTGGCGCTCAACGTGACAAGTTAGACCCAC
TGTTGGACGCCTGTGTGGCGGTTTATATTGAGGAGTTGCATGAGGATGCCCAGATTGATTTCAAGGGCAAGGCCAAGGCG
TTTACACGCACCTATGAATTCCTGTCTTCTATTTTGCCTTATAGCGATGTTGATTGGGAAAAACTCTCCATCTTCCTGTC
CTTTTTGATACCAAAACTGCCTGCCCCCCAAGAGGAGGATCTCTCCAAGGGGATACTGGAATCCATTGATATGGATAGCT
ATCGTACCGAAAAACAGGCTGTTATGAACATCATCCTTGCAGATGAAGAAGCAGAAATCGATCCAATACCGGTAGGCGGT
GGTGGTGGAAAACCGCAGCCTGATATGGATAAGCTCACCAATATCCTTAAATCGTTCAACGAACAGTTCGGTACCCTGTT
TAGCGATGTTGATCGTGTTGCAAAGCGTATCCAGGATGATGTCGCTCCTAAAGTGGCAGCTGATCAGGCTTACCGGAACG
CCAAACAGAATACCCCGAATGCTGCCCGCCTGGAGCATGACAAGGCGTTGGCGCGGGTCATGCTGTCACTGCTCAAGGAT
GATACCGAGGCCTACAAGCAGTTTGTTGAGAATGAATCTTTTAAGCGGAGTGTCTCAGATATGGTGTTTGCCATGACGAA
TGTAGCTTGTGAACCACCGAAACCGGTTGCACCGGCATGA

Protein sequence :
MTTDTSERGLEDLICTTMTGRTAVVAAGGVHDPVEPFGGTGWLLGDARDYDREYCVDVTQLHAFVEATQPEIATALELEK
DTPVRRRFLARVESECARRGVIDLLRKGVKHEKYHIDLFYGSPSPGNDKAAQLYIANRFSLTRQLRYSRDETRRALDLAL
FINGLPIATFELKNSLTKQTVEDAIEQYRRDRDPRERLFSFGRCIVHFAVDDREVQMCTELKGKGSWFLPFNLGWNDGAG
NPPNPHGLKTDYLWKRVLTLQGVTDIIENYAQIVEEINPKTRKKSRKQVFPRYHQLDVVRRLLADALKQGVGQRYLIQHS
AGSGKSNSIAWLAHQLVALRKDGKDLFDSIIVITDRRILDDQIKNTIKGFMQVGSTVGHAEHSGDLRKFIESGKKIIIST
VQKFPYILNEIGDEHRGRSFAIIIDEAHSSQGGKAAGALNAALTDPEDEINDVLEKRMAARKMLTNASYFAFTATPKNKT
LEIFGEPYPGEEGKVKHRPFHCYTMKQAIQEGFILDVLKSYTPVNSYYKLIKKVKDDPEFDKNKAQKKLRRYVESHDHAI
RLKAEIMVDHFREQVIAKGKIGGQARVMVVCNGIERAIQYFHAIKSYLEEQKSPYQAIVAFSGEYDFRGQKVTEASLNGF
PSGDIADKIQEDPYRILVCADKFQTGYDEPLLHTMYVDKTLSGIKAVQTLSRLNRAHPRKHDVFVLDFMNDADTIQAAFA
DYYRTTVLSEETDPNKLHDLKAVLDNYHVYRAAVVDELVGLYLGGAQRDKLDPLLDACVAVYIEELHEDAQIDFKGKAKA
FTRTYEFLSSILPYSDVDWEKLSIFLSFLIPKLPAPQEEDLSKGILESIDMDSYRTEKQAVMNIILADEEAEIDPIPVGG
GGGKPQPDMDKLTNILKSFNEQFGTLFSDVDRVAKRIQDDVAPKVAADQAYRNAKQNTPNAARLEHDKALARVMLSLLKD
DTEAYKQFVENESFKRSVSDMVFAMTNVACEPPKPVAPA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 4e-169 42
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 4e-169 42
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-168 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Glov_2197 YP_001952433.1 hypothetical protein VFG1098 Protein 2e-169 42