Gene Information

Name : NRG857_20910 (NRG857_20910)
Accession : YP_006122524.1
Strain : Escherichia coli NRG 857C
Genome accession: NC_017634
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4456896 - 4459664 bp
Length : 2769 bp
Strand : +
Note : COG3451 Type IV secretory pathway, VirB4 components

DNA sequence :
ATGTTCAGTTTATTCACCAAAAAACAGCCGCAGCCGGTGGCTGATGAACCGGCGTTGCCGTCACAGAAAAAGAGCGGCAG
GCTGACGCAGGACGGCGTGTGTTCGCTGTATCCCAATAACGCCTCGTTTATCGACTTCCTGCCGTGGGTGGAATACTTGC
CGGAAAGCCAGAGCCTGCTGCTCGATGACGGTGTTTCGGTGGGCGCCGTATTTGAAGTGGTGCCGGTGGGAACTGAAGGG
CGCACCGGCGAACGCCTGGAAGAGATCCGCGATATCGTTGAGGATGCGCTCCAGGACAGCTTTGACGAGCGGGACAGTCA
CCCGTGGGTTATCCAGTTCTTCTGCCAGGATGAAACCGATGTGACGGAGTACATCGATAAACTGCGCGGCTATGTTAAAC
CGTGGGCGCAGGGAACAGCATTCACCGACGCGTATATGAATGAGATGGAACGGCATATGCGCGGTATTGCGGTGCCGAAA
GGGCTGTTTATCGACAAAGCGGTCACCGGCGCACCGTGGCGCGGTCAGCAGCGTCGTACCCGGATGGTGATTTATCGCTA
TGTGAACCCGGGAGCGAACGAACCGTATTCACCAGAAGAGCAACTGAATCAGGTCTGTGAGCGATTATCGTCTGCGCTGG
CCGGGGCGGGAATAGTGACTTCCCGGCAGAACGGCGAGCAGATCCACGCCTGGCTGCTGCGCTGGTTTAACCCTGAGCCG
GACTGGGTGGATAAAGAGACGCTGTATCGCACGGCGGCGCATGTTGACACCCGCAACGGTGAGCTGCCGGTGCTTAATGA
TTTCAGTGAAACCCTGATGTTCACGCCGCCGCGCTCTGATGTTGAGAACGGCGTGTGGTGGTTCGATGAGCGGCCGCATA
AAGCGGTCAGCATCGATCGTATCCGCCGTGCGCCTTCGGTCGGCCATCTTACAGGCGAAACCCGCAAAGGGGAGAACATT
AACGCGCTGATGGATTTGCTGCCGGAAGGGACGGTGATTTCTCTGACGCTGGTCGTGCAGCCGCAGGATGTGCTGGAAGA
GCGCTTCAACCATCTGGCGAAAAATGCGATTGGCGAGAATACTGAGTCCGAACGCGTCCGGGCCGATGCCGATACGGCCA
AAAGTTATTTGGGCGAACGGCATAAACTGTACCGCGCCAGCATGACCTTCTTTCTCTCCGGGGCGTCGCTGAAAATCCTC
AACAGCCGCCAGCGCGACCTGACGGCCATTCTCCTGAACGCCGGGTTGCAGCCGGTGAAACCGGAATACGACGTCGCGCC
GCTGAATGGCTACCTCCGGGCGCTGCCGATGTGTTTTAACCCGACGCTGGATAAGAAGAACTGGTACACCCGCCTGACGT
TTGTTCAGCATATGGCAAATATGCTACCGGTGTTTGGCCGTGACACCGGCACCGGTCATCCGGGATTTACCTTCTTTAAC
CGGGGCGGTGCGCCACTGACGTTCGATCCACTTAACAGCGCCGACCGCTCCAAAAACGCGCACCTGGTGCTGTTTGGCCC
GACAGGATCGGGCAAGTCGGCAACGCTTAACAGCCTGTTTGCCCAGTTGATGGCGATTCACCGCCCACGTCTGTTTATTG
CCGAAGCGGGTAACTCGTTTGGCCTGTTTGCCGATTATTGCGCGGAGCTGGAGCTGTCGGTGAACAAGGTCAGTATCAAG
CCGGGCTGTGGTATTTCGCTGGCCCCGTTCGCCGATGCCTACAAGCTGATTGAAGCGCCAGTGAAAACTGTTGACGAAAG
CGAGCTGAGCGAACGTATCGAGGTGGAGGAAACGGACGACGGCGATGCGGAGGAAGAGCGCGATATTCTGGGCGAGCTGG
AGATTGTGGCGCGTCTGATGGTCACCGGGGGCGAAGCCAAGGAAGAGGCGCGGCTGGAGCGTGCCGATCGGGGCATGATG
CGCGAGGCGATTCTGGCCGGGGCGCGTAAGGCCTGGAATGAAGAGCGTCAGATGATCACCGGCGACCTTCAGGATGCGTT
TTACGTCTTCGCGCAGGATGAATCCCGCCCGGAAGTTCGCCGTGCCCGGGCGCAGAATATGGGTGAATCGCTGGGCGTAT
TTATGCAGGGCTTCCTGGGCGAGTTGTTTAACCGCCCGGGGCAGAACTGGCCGGAAGCAGATGTCACGCTGATTGATCTC
GGCACACTGGCGCGTGAAGGCTATGAAGCGCCGCTGGCCGTGGCCTATACCTCGCTGGTTAATACCATCAACAATATCGC
TGAACGTGACCAGTTTCTGGACCGGGAGATCGTCTTCCCAACCGATGAAGCACACATGATCACCACGAACCCGTTGCTGG
CCCCGTACGTGGTTAAGGTGGTGAAAATGTGGCGTAAACTTGGTGCGTGGTTGTGGCTGGCAACGCAGAACCTGGAAGAT
TTCCCGGCCACGGCCAAGAAGATGCTCAATATGATCGAATGGTGGCTGTGTCTGGTGATGCCGCCTGAAGAGGTGGAAGA
GATCGCCCGCTTTAAAAAGCTGACTGCCGAGCAAAAAGCGATGCTGCTGTCTGCGACTAAAACGCCGCGCTGCTACACCG
AAGGCGTGGTGCTGGCAACCCGTATTGAAGCGTTGTTCCGGGCTGTACCGCCAAGCCTGTATCTGGCGCTGGGTATGACG
GAGAAGGATGAAAAGGCGGCACGTCGGGCATTGATGGTTGAACATGGTATCAGCGAGCTGGACGCAGCGAAGCGCATTGC
GCACCAACTCGATATCGCCCGTGGGATCGCCACGAAGGAGGCGGCATGA

Protein sequence :
MFSLFTKKQPQPVADEPALPSQKKSGRLTQDGVCSLYPNNASFIDFLPWVEYLPESQSLLLDDGVSVGAVFEVVPVGTEG
RTGERLEEIRDIVEDALQDSFDERDSHPWVIQFFCQDETDVTEYIDKLRGYVKPWAQGTAFTDAYMNEMERHMRGIAVPK
GLFIDKAVTGAPWRGQQRRTRMVIYRYVNPGANEPYSPEEQLNQVCERLSSALAGAGIVTSRQNGEQIHAWLLRWFNPEP
DWVDKETLYRTAAHVDTRNGELPVLNDFSETLMFTPPRSDVENGVWWFDERPHKAVSIDRIRRAPSVGHLTGETRKGENI
NALMDLLPEGTVISLTLVVQPQDVLEERFNHLAKNAIGENTESERVRADADTAKSYLGERHKLYRASMTFFLSGASLKIL
NSRQRDLTAILLNAGLQPVKPEYDVAPLNGYLRALPMCFNPTLDKKNWYTRLTFVQHMANMLPVFGRDTGTGHPGFTFFN
RGGAPLTFDPLNSADRSKNAHLVLFGPTGSGKSATLNSLFAQLMAIHRPRLFIAEAGNSFGLFADYCAELELSVNKVSIK
PGCGISLAPFADAYKLIEAPVKTVDESELSERIEVEETDDGDAEEERDILGELEIVARLMVTGGEAKEEARLERADRGMM
REAILAGARKAWNEERQMITGDLQDAFYVFAQDESRPEVRRARAQNMGESLGVFMQGFLGELFNRPGQNWPEADVTLIDL
GTLAREGYEAPLAVAYTSLVNTINNIAERDQFLDREIVFPTDEAHMITTNPLLAPYVVKVVKMWRKLGAWLWLATQNLED
FPATAKKMLNMIEWWLCLVMPPEEVEEIARFKKLTAEQKAMLLSATKTPRCYTEGVVLATRIEALFRAVPPSLYLALGMT
EKDEKAARRALMVEHGISELDAAKRIAHQLDIARGIATKEAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YE3482 YP_001007641.1 hypothetical protein Not tested YAPI Protein 0.0 69
api39 CAF28513.1 hypothetical protein Not tested YAPI Protein 0.0 67
YpsIP31758_3705 YP_001402659.1 hypothetical protein Not tested YAPI Protein 0.0 67
STY4573 NP_458658.1 hypothetical protein Not tested SPI-7 Protein 0.0 64
PMI2576 YP_002152295.1 hypothetical protein Not tested Not named Protein 0.0 61
ORF SG57 AAN62279.1 hypothetical protein Not tested PAGI-3(SG) Protein 0.0 55
ORF C47 AAN62141.1 hypothetical protein Not tested PAGI-2(C) Protein 0.0 54
Pmu_03070 YP_005176206.1 conjugative transfer ATPase TraC-like, PFL family Not tested ICEPmu1 Protein 0.0 52
unnamed ABR13362.1 hypothetical protein Not tested PAGI-5 Protein 0.0 51
RL022 AAP84149.1 conserved hypothetical protein Not tested PAPI-1 Protein 0.0 51
EXA33 ABD94642.1 conserved hypothetical protein Not tested ExoU island A Protein 0.0 51
virB4 CAI36125.1 VirB4 component Not tested PPHGI-1 Protein 0.0 51