Gene Information

Name : EcE24377A_4894 (EcE24377A_4894)
Accession : YP_001465822.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4882684 - 4883502 bp
Length : 819 bp
Strand : +
Note : identified by similarity to GB:ABB61180.1; match to protein family HMM PF06067

DNA sequence :
ATGCGACTAGCCAGTCGTTTTGGTTATGCGAACCAGATACGCCGTGACCGTCCGCTGACACACGAAGAACTGATGCACTA
TGTTCCCAGTATTTTCGGTGAAGACCGGCATACTTCCCGCAGTAAACGATATGCGTACATTCCCACCATCACCGTACTGG
AAAGCCTGCAGCAGGAAGGTTTTCAGCCATTCTTCGCCTGCCAGACCCGCGTGCGCGACCCGGGCCGCCGGGGATACACA
AAACACATGCTGCGTCTGCGGCGGGCCGGAGAGATAAACGGAGAACATGTCCCTGAAATTATTCTGCTCAACTCTCATGA
CGGTACCTCCAGCTACCAGATGCTGCCGGGTTACTTCAGATTCGTCTGCCAGAATGGCTGCGTCTGCGGTCAGTCTCTGG
GGGAAGTGCGTGTTCCACACCGGGGAAATGTGGTGGAGAAAGTTATCGAAGGGGCTTACGAAGTGGTGGGGGTTTTTGAC
CGGATTGAGGAAAAGCGTGATGCCATGCAGTCGCTGGTCCTGCCGCCACCGGCACGCCAGGCGCTGGCACAGGCGGCACT
GACTTACCGTTATGGTGACGAACATCAGCCAGTCACCACCGCTGACATTCTGACGCCACGACGCCGGGAGGATTACGGTA
AGGACCTGTGGAGTACTTATCAGACCATCCAGGAGAATATGCTGAAAGGCGGGATTTCCGGCCGCAGTGCAAAAGGAAAA
CGTATCCACACCCGTGCCATTCACAACATCGACACCGATATTAAGCTCAACCGCGCATTGTGGGTAATGGCAGAAACGCT
GCTGGAGAGCCTGCGCTGA

Protein sequence :
MRLASRFGYANQIRRDRPLTHEELMHYVPSIFGEDRHTSRSKRYAYIPTITVLESLQQEGFQPFFACQTRVRDPGRRGYT
KHMLRLRRAGEINGEHVPEIILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGNVVEKVIEGAYEVVGVFD
RIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHQPVTTADILTPRRREDYGKDLWSTYQTIQENMLKGGISGRSAKGK
RIHTRAIHNIDTDIKLNRALWVMAETLLESLR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ECO103_3587 YP_003223444.1 hypothetical protein Not tested LEE Protein 2e-119 98
S3199 NP_838482.1 hypothetical protein Not tested SHI-1 Protein 1e-117 97
SF2994 NP_708768.1 hypothetical protein Not tested SHI-1 Protein 1e-117 97
yafZ ADD91704.1 YafZ Not tested PAI-I AL862 Protein 2e-118 96
unnamed CAI43843.1 hypothetical protein Not tested LEE Protein 2e-118 96
aec71 AAW51754.1 Aec71 Not tested AGI-3 Protein 2e-118 96
unnamed CAD66201.1 hypothetical protein Not tested PAI III 536 Protein 4e-116 95
c5156 NP_757004.1 hypothetical protein Not tested PAI II CFT073 Protein 1e-113 93
yafZ YP_854321.1 hypothetical protein Not tested PAI I APEC-O1 Protein 1e-113 93
ORF_47 AAZ04456.1 conserved hypothetical protein Not tested PAI I APEC-O1 Protein 1e-113 93
Z1215 NP_286750.1 hypothetical protein Not tested TAI Protein 5e-112 92
Z1655 NP_287158.1 hypothetical protein Not tested TAI Protein 5e-112 92
unnamed CAD42096.1 hypothetical protein Not tested PAI II 536 Protein 1e-108 90
z1215 CAD33785.1 Z1215 protein Not tested PAI I 536 Protein 1e-109 90
yafZ CAE85199.1 YafZ protein Not tested PAI V 536 Protein 5e-111 90
ORF SG86 AAN62307.1 conserved hypothetical protein Not tested PAGI-3(SG) Protein 1e-72 56
ORF C80 AAN62174.1 conserved hypothetical protein; associated with oriT region on plasmids Not tested PAGI-2(C) Protein 2e-72 53

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcE24377A_4894 YP_001465822.1 hypothetical protein VFG0658 Protein 6e-118 97
EcE24377A_4894 YP_001465822.1 hypothetical protein VFG1676 Protein 2e-116 95
EcE24377A_4894 YP_001465822.1 hypothetical protein VFG1526 Protein 7e-110 90
EcE24377A_4894 YP_001465822.1 hypothetical protein VFG1615 Protein 6e-109 90