Gene Information

Name : Rahaq_4564 (Rahaq_4564)
Accession : YP_004215272.1
Strain :
Genome accession: NC_015062
Putative virulence/resistance : Virulence
Product : type VI secretion protein EvpB
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 125071 - 126612 bp
Length : 1542 bp
Strand : +
Note : KEGG: spe:Spro_1784 hypothetical protein; TIGRFAM: type VI secretion protein, EvpB/VC_A0108 family; PFAM: protein of unknown function DUF877

DNA sequence :
ATGTCAGTCAACACTGAAAACTCTTCTTCCAGCCAGACAACGGTTCTGGATCGCCCTGACGCCGGTACGGTATACGCTTC
CCTGTTTGATAAAATCAACCTGACGCCTGTGTCCTCACTGGGTGATCTGAACGATTTTCAGGATACGTCTGCGCTGGCCG
ATGTCACCGCTGATCAGCGTGTCACTGCCGCTGTACAGGTCTTCCTTGAGCGCCTGAAGCTGTCCGGTCAGACGGTTGAA
CGTCTGGACAAAACCCTGCTCGATCATCATATCGCCGAACTCGACAGCCAGATCAGCCGTCAGTTGGATGTGGTCATGCA
TCACCCGGAATTTCAGCGTATTGAATCCGGCTGGCGTGGTCTGAAGTTCCTGGTTGATCGTACTGATTTCCGCCAGAACG
TAAAAATTGAACTGCTGGATGTGTCTAAAGACGACCTCCGTCAGGACTTCGAAGATTGTCCTGAGATCATCCAGAGCGGC
CTGTACCGCCATACCTACATTCAGGAATATGACACTCCGGGCGGCGAGCCAATCGGTTCTGTGATTTCCAACTACGAGTT
TGACGCGTCGGCGCAAGATGTCGCCCTGCTGCGTAACATTTCTAAAGTATCTGCCGCTGCGCATATGCCATTCGTCGGTG
CGGTCGGCCCTAAATTCTTCCTGAAAGATTCCATGGAAGAAGTGGCGGCGATTAAAGATATCGGCAACTACTTTGACCGT
GCGGAATACATCAAGTGGAAATCCTTCCGCGATTCTGACGATTCCCGCTATATCGGTCTGACCATGCCGCGCGTACTCGG
TCGCCTGCCGTATGGTCCGGATACCGTTCCTGTTCGCAGCTTCAACTACGTTGAAGAAGTGAAAGGTCCGGATCACGAGA
AATATCTGTGGACAAACGCGGCGTTCGCCTTTGCGGCCAACATGGTGAAAAGCTTCATCAAAAATGGCTGGTGTGTGCAA
ATCCGTGGCCCACAGGCGGGCGGTGCAGTGACTGATCTGCCGATCCATCTGTACGATCTCGGCACCGGCAATCAGGTGAA
AATCCCGTCAGAAGTGATGATCCCGGAAACCCGCGAATTTGAATTTGCCAATCTGGGCTTTATCCCGCTGTCTTACTACA
AAAACCGCGATTACTCCTGCTTCTTCTCCGCGAACTCTGCGCAGAAACCGGCGCTGTACGATACTGCCAATGCCACTGCA
AACAGCCGCATCAATGCACGCCTGCCGTACATCTTCCTGCTTTCACGCATCGCACACTACCTGAAGCTGATTCAGCGGGA
AAATATCGGCACCACCAAAGACCGTCGTCTGCTGGAACTGGAACTGAACACCTGGATCCGTACGCTGGTCACTGAAATGA
CTGACCCGGGCGACGACCTGCAAGCCTCTCACCCGCTGCGTGACGCCAAAGTGACCGTCGAAGATATCGAAGACAACCCG
GGCTTCTTCCGCGTGAAACTGTACGCCGTACCGCACTTCCAGGTGGAAGGGATGGACGTGAATCTGTCACTGGTTTCTCA
GATGCCTAAAGCGAAAGCGTAA

Protein sequence :
MSVNTENSSSSQTTVLDRPDAGTVYASLFDKINLTPVSSLGDLNDFQDTSALADVTADQRVTAAVQVFLERLKLSGQTVE
RLDKTLLDHHIAELDSQISRQLDVVMHHPEFQRIESGWRGLKFLVDRTDFRQNVKIELLDVSKDDLRQDFEDCPEIIQSG
LYRHTYIQEYDTPGGEPIGSVISNYEFDASAQDVALLRNISKVSAAAHMPFVGAVGPKFFLKDSMEEVAAIKDIGNYFDR
AEYIKWKSFRDSDDSRYIGLTMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHEKYLWTNAAFAFAANMVKSFIKNGWCVQ
IRGPQAGGAVTDLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYSCFFSANSAQKPALYDTANATA
NSRINARLPYIFLLSRIAHYLKLIQRENIGTTKDRRLLELELNTWIRTLVTEMTDPGDDLQASHPLRDAKVTVEDIEDNP
GFFRVKLYAVPHFQVEGMDVNLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 3e-103 45
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 2e-103 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Rahaq_4564 YP_004215272.1 type VI secretion protein EvpB VFG2475 Protein 2e-114 48
Rahaq_4564 YP_004215272.1 type VI secretion protein EvpB VFG2093 Protein 2e-112 46
Rahaq_4564 YP_004215272.1 type VI secretion protein EvpB VFG2070 Protein 3e-90 41