Gene Information

Name : ETEC_3951 (ETEC_3951)
Accession : YP_006117488.1
Strain : Escherichia coli ETEC H10407
Genome accession: NC_017633
Putative virulence/resistance : Virulence
Product : putative restriction methylase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4259894 - 4260736 bp
Length : 843 bp
Strand : +
Note : Similar to N-terminus to codon 310 of Serratia marcescens. TraG1 UniProt:P95799 (EMBL:U60283 (563 aa) fasta scores: E()=0.00082, 21.382% id in 304 aa, and to entire protein of Escherichia coli. hypothetical protein. UniProt:Q8VQQ6 (EMBL:AF453442 (280 aa)

DNA sequence :
ATGTCTGCAATCACACTCTCCCGGCCGGAAGTGGTCAACGGGCATACGGACGTTATCTGCTCAACCTCAGTCAGCCACAT
TCTGGCTGTACGAAAGAGTACACTGCTGCAAATCGACACACTTATCCGGCAACTGGCTGAAATCTCATCAATGACAGAAA
GTATTGGCGGTAAAACCGCACTGGACTGGGCCATGAAACAGGATTTTCGCTGCGGCTGCTGGCTCATGGAGAAACCTGAA
ACGGCAATGAAAGCCATCACCCGGAACCTTGACCGTGAAATCTGGCGTGACCTGATGCAACGCTCCGGGATGCTTTCCTT
AATGGATGCACAGGCCCGTGATACATGGTACCAGTCACTGGAGTACGATAATTTTCCGGAAATCAGTGAAGCGAACATTC
TGAGCACATTTGAACAACTGCACCAGAACAAAGATGAAGTGTTTGAGCGAGGGGTGATCAACGTCTTCAGGGGACTGAGC
TGGAATTACAAAACCAATTGCCCCTGCAAGTTTGGCAGTAAAATTATCATCAACAACCTGGTAAGGTGGGACAGATGGGG
GTTTCACCTTATCACCGGGCAACAGGCCGATCGACTTGCCGATCTGGAAAGAATGCTGCACCTGTTCAGCGGCAAACCGA
TCCCCGACAACCGGGAAAACATCACCATTCATCTGGATGACCACATCCAGTCTGTTCAGGGTAAAGAGGACTATGAAGAT
GAGATGTTCAGCATCAGATACTTTAAGAAGGGCTCCGCACACATCACGTTCAGGAAGCCAGAACTGGTTGGCAGACTTAA
TGAGATTATTGCGAAACACTATCCGGGAGTGTTACCTTCATAA

Protein sequence :
MSAITLSRPEVVNGHTDVICSTSVSHILAVRKSTLLQIDTLIRQLAEISSMTESIGGKTALDWAMKQDFRCGCWLMEKPE
TAMKAITRNLDREIWRDLMQRSGMLSLMDAQARDTWYQSLEYDNFPEISEANILSTFEQLHQNKDEVFERGVINVFRGLS
WNYKTNCPCKFGSKIIINNLVRWDRWGFHLITGQQADRLADLERMLHLFSGKPIPDNRENITIHLDDHIQSVQGKEDYED
EMFSIRYFKKGSAHITFRKPELVGRLNEIIAKHYPGVLPS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed AAL57572.1 unknown Not tested LEE Protein 6e-126 95
unnamed CAI43906.1 hypothetical protein Not tested LEE Protein 1e-124 94
z1226 CAD33792.1 Z1226 protein Not tested PAI I 536 Protein 6e-124 93
Z1226 NP_286760.1 hypothetical protein Not tested TAI Protein 2e-124 93
Z1664 NP_287166.1 hypothetical protein Not tested TAI Protein 2e-124 93
ECO103_3595 YP_003223452.1 hypothetical protein Not tested LEE Protein 1e-121 91
unnamed CAE85207.1 hypothetical protein Not tested PAI V 536 Protein 1e-122 90
ECO111_3781 YP_003236116.1 hypothetical protein Not tested LEE Protein 3e-121 89
aec79 AAW51762.1 Aec79 Not tested AGI-3 Protein 1e-120 89
unnamed AAK00485.1 unknown Not tested SHI-1 Protein 4e-120 88
S3207 NP_838489.1 hypothetical protein Not tested SHI-1 Protein 5e-120 88
SF3002 NP_708776.1 hypothetical protein Not tested SHI-1 Protein 5e-120 88
ORF_49 AAZ04463.1 hypothetical protein Not tested PAI I APEC-O1 Protein 5e-120 88
APECO1_3484 YP_854327.1 hypothetical protein Not tested PAI I APEC-O1 Protein 7e-120 88
unnamed CAD66210.1 hypothetical protein Not tested PAI III 536 Protein 7e-110 87

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ETEC_3951 YP_006117488.1 putative restriction methylase VFG1533 Protein 4e-124 93
ETEC_3951 YP_006117488.1 putative restriction methylase VFG0666 Protein 2e-120 88
ETEC_3951 YP_006117488.1 putative restriction methylase VFG1685 Protein 4e-110 87