Gene Information

Name : EGYY_28870 (EGYY_28870)
Accession : YP_004712259.1
Strain : Eggerthella sp. YY7918
Genome accession: NC_015738
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 3004740 - 3007463 bp
Length : 2724 bp
Strand : -
Note : -

DNA sequence :
ATGAACATGGGCAACTTGAACAAATTAGCCCTCACCGCGCAAGAAGCGCTTCAGCAGACGATTACCATCGCTTCTGAAAA
AGAGGCCTCCCAGGCAGAGCCTATCCATATGTTGAAGGCGTTGCTTGAATCGAAAGAAAACAACCTCTCGGCCATCATCA
AGCGCATCGGCGCCGATCCCTTCCAGCTACAGGTTAACGTCGACGCTGAAATCGATCGTATGCCGAAGGTGAGCAGCAGC
GGCGGCATGATGATGAGTGGCGTGCCGGGTCCTGCACTTATGAACGTGATCGACAACGCGGTAAAGATTGCCGAGAAGCT
GGGCGATAGCTACGCGACAAGCGAGCATTTGCTCATCGCGTTGTCGGAAGACAAGGGTGCTGCGGGCAAAATTTTGTCGG
TGGCAGGCGTGACGCGCAAGACCATCGAAGCGGCCTACGACGAGCTGCGCGGCGATACGCGCGTGACCGATCCGCAAAAT
AAGGCCCAGTTCGAAGCGCTGGAGCAGTACGGTCAGAACCTGACGCAGCAAGCGCGCGAGGGCAAGCTGGACCCCGTCAT
CGGCCGTGCGGAAGAAATTCGCCGCACCATCCAGGTGCTTTCGCGCCGCACAAAGAACAACCCGGTACTTATCGGCGAGC
CGGGCACCGGCAAGACCGCCATCGTGGAAGGGCTGGCACAGCGTATCGTAGCGGGCGACGTTCCCTCGTCGCTGAGAGAT
CGTGATATTATCGCGCTCGACTTGCCGTCGATGATTGCTGGTGCGAAGTACCGCGGCGAATTCGAGGACCGCTTGAAGGC
GGTGCTGCGCGAGGTGAAGCAGAGCGAGGGTCAGATCATCCTGTTCATCGACGAGTTGCACACCATCGTGGGTGCGGGAT
CCACCGGCGATAGCTCCATGGATGCGGGCAACATGCTCAAGCCGGCTCTTGCGCGCGGCGAACTGCACGCCATCGGCGCA
ACGACGCTCGACGAATATCGCAAGTATGTGGAAAAAGATGCCGCACTCGAGCGACGTTTCCAACCGGTGCTCGTTACCGA
GCCTACGGTGGAGGACACCATTGCCATCCTGCGCGGACTTAAAGAAAAATATGAGATTCACCACGGCGTGCGTATCACCG
ACGCTGCCATCGTGGCCGCCGCGGAATTATCGAACCGCTACATTTCCGATCGATTCCTGCCCGACAAGGCCATCGATCTC
ATGGACGAAGCGGCCAGCCGCCTGCGCATCGAAATCGATTCGATGCCCGAAGAGGTGGATGCTGCCGAGCGCAAGCTTAC
GCAGATGCAGATTGAGGAGCAGGCTCTCATGAAGGAGAGCGACGACGCCTCCAAGGAGCGCCTGGAAGCGCTGCGACGTG
ATATTGCTGCTGCGCGGGAAGATTTGGACAAGCGCAAGGCTGAGTGGCAAAACGAGAAAGACGTTATCGAAAGCGTGCAA
CTTCTGAAGGGCGAGCTGGAAGGCGCCCAGATGGAAGAGGAGCGCGCCACCCGCGAAGGCGATCTGTCGAAAGCCTCCGA
GCTGCGCTATGCGCGCATTCCCGAATTGCAGCGCCGCCTGCATGAAGCCGAAGAAACGCTCAATGTGAAGCAGCAGGACG
GTGCCATCCTCAAAGAGGAAGTGTCTGACGACGAAATTGCCGAGGTTGTGTCCACGTGGACCGGCATCCCGGTGCAAAAG
ATGATGCAGGGTGAAATGGCGAAGCTCATCGACTTGGAAGAAAAGCTGCATGAGCGCGTGGTGGGTCAGGATGAAGCCGT
GTCCGCCGTGGCGGGCGCCATTCGCCGCAATCGCGCAGGGCTTTCCGACCCTGATCGTCCTATCGGCTCGTTTTTATTCT
TGGGTCCGACAGGCGTAGGCAAAACCGAGCTTGCCAAAGCGCTTGCGGAGTACCTGTTCGATAGCGAAAAGTCGATGGTG
CGCATCGACATGTCCGAATACATGGAAAAGTTCAGCGTACAGCGCCTGATCGGCGCGCCTCCGGGATACGTCGGTTACGA
CGAGGGCGGTCAGTTGACCGAGGCCGTACGCCGCAAACCCTACAGTGTGATCCTGCTCGACGAGATTGAGAAAGCGCATC
CGGATGTGTTCAACATCTTGCTTCAGGTGCTTGACGACGGGCGTTTGACCGATGGTCAGGGTCGCGTGGTGTCGTTCAAA
AACGCCATCATCATCATGACGTCAAACGTGGGTTCGCAGTCCATTCGCGAATTCTCCAACCAGGGTGGCAGCGGATCGAT
GGGTCAGATGATGGAAGATATGATGTCGGGCGACATCGCATCAACCGCTAAGCGTCTGGCCGAGTTGCAAACGCAGATAA
ACGACGCGCTGCGGGCCACGTTCAGGCCGGAATTCCTGAACCGTATCGACGATATCATCACCTTCAACGCGCTTTCCATC
GAGGCTATGGAGCCCATCGTGGAATTGCAGCTGAACGATGTTCGCGATCGTTTGGCCGATCGTCGCATTACGCTTGATGT
GACGCCTGCTGCAATGGAGCATCTCTCCATTGACGGCTATGATCCGGTCTTTGGCGCCCGTCCGTTGCGTCGTCTCATTC
AGCGCGAAGTGGTTGACCGCATCGCCCAGAAGGTGGTCGAAGGTAAAATGCGTGATCGCAGCCACGTGCTGATCGACCTC
GATGCCGACGGCAATTACGAATGCAAGGTGGAAGAGCCGCTCGATTTTGACACGCTCACCCTCGATGCCGAGCCGGTGAA
GTAA

Protein sequence :
MNMGNLNKLALTAQEALQQTITIASEKEASQAEPIHMLKALLESKENNLSAIIKRIGADPFQLQVNVDAEIDRMPKVSSS
GGMMMSGVPGPALMNVIDNAVKIAEKLGDSYATSEHLLIALSEDKGAAGKILSVAGVTRKTIEAAYDELRGDTRVTDPQN
KAQFEALEQYGQNLTQQAREGKLDPVIGRAEEIRRTIQVLSRRTKNNPVLIGEPGTGKTAIVEGLAQRIVAGDVPSSLRD
RDIIALDLPSMIAGAKYRGEFEDRLKAVLREVKQSEGQIILFIDELHTIVGAGSTGDSSMDAGNMLKPALARGELHAIGA
TTLDEYRKYVEKDAALERRFQPVLVTEPTVEDTIAILRGLKEKYEIHHGVRITDAAIVAAAELSNRYISDRFLPDKAIDL
MDEAASRLRIEIDSMPEEVDAAERKLTQMQIEEQALMKESDDASKERLEALRRDIAAAREDLDKRKAEWQNEKDVIESVQ
LLKGELEGAQMEEERATREGDLSKASELRYARIPELQRRLHEAEETLNVKQQDGAILKEEVSDDEIAEVVSTWTGIPVQK
MMQGEMAKLIDLEEKLHERVVGQDEAVSAVAGAIRRNRAGLSDPDRPIGSFLFLGPTGVGKTELAKALAEYLFDSEKSMV
RIDMSEYMEKFSVQRLIGAPPGYVGYDEGGQLTEAVRRKPYSVILLDEIEKAHPDVFNILLQVLDDGRLTDGQGRVVSFK
NAIIIMTSNVGSQSIREFSNQGGSGSMGQMMEDMMSGDIASTAKRLAELQTQINDALRATFRPEFLNRIDDIITFNALSI
EAMEPIVELQLNDVRDRLADRRITLDVTPAAMEHLSIDGYDPVFGARPLRRLIQREVVDRIAQKVVEGKMRDRSHVLIDL
DADGNYECKVEEPLDFDTLTLDAEPVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-108 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EGYY_28870 YP_004712259.1 hypothetical protein VFG0079 Protein 2e-163 47
EGYY_28870 YP_004712259.1 hypothetical protein VFG2084 Protein 4e-115 42
EGYY_28870 YP_004712259.1 hypothetical protein VFG2076 Protein 9e-119 42