Gene Information

Name : c4578 (c4578)
Accession : NP_756438.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG2404
EC number : -
Position : 4340940 - 4341782 bp
Length : 843 bp
Strand : +
Note : Escherichia coli O157:H7 ortholog: z1664

DNA sequence :
ATGTCAGACATCACAATCTCCCGTCCGGAAGTGGTCACCGGGCATACGGACGTTATCTGCTCAACCAGTATCCGCCACAT
TCTGGCTGTACGAAAGAGTACACTGCTGCAAATCGACACACTTATCCGGCAACTGGCTGAAATATCAGCAATGACAGAAA
GTATTGGCGGTAAAGCCACCCCGGGCTGGGCCATGAAACAGGATTTTCGCTGCGGCTGCTGGCTGATGGAGAAACCGGAA
ACCGCAATGAAAGCCATCACACGCAATCTCGATCGCGAAATCTGGCGTGACCTGATGCAACGTTCAGGGATGCTTTCCTT
AATGGATGCACAGGCCCGTGATACATGGTACCGGTCACTGGAGTACGATAATTTTCCGGAAATCAGTGAAGCGAACATTC
TGATCACATTTGAACAACTACACCAGAATAAGGATGAGGTGTTTGAGCGAGGGGTGATCAACGTCTTCAGGGGACTGAGC
TGGAATTACAAAACCAATTGCCCCTGCAAATTTGGCAGTAAAATTATCGTCAACAACCTGGTGAGGTGGGACCGGTGGGG
GTTTCATCTTATCACCGGGCAACAGACTGACCGACTCGTCGACCTGGAAAGAATGCTACATCTGTTCAGCGGCAAACCGA
TCCCCGACAACCGGGAAAACATCACTATTCGTCTGGATGATCACATCCGGTCTGTTCAGGGTAAAGAATGCTATGAAGAT
GAGATGTTCAGCATCAGATACTTTAAGAAAGGCTCCGCGCACATCAGGTTCAGGAAACCAGAACTGGTTGACAGGCTGAA
TGATATTATTGCGAAACACTATCCGGGAGTGTTACCTTCATAA

Protein sequence :
MSDITISRPEVVTGHTDVICSTSIRHILAVRKSTLLQIDTLIRQLAEISAMTESIGGKATPGWAMKQDFRCGCWLMEKPE
TAMKAITRNLDREIWRDLMQRSGMLSLMDAQARDTWYRSLEYDNFPEISEANILITFEQLHQNKDEVFERGVINVFRGLS
WNYKTNCPCKFGSKIIVNNLVRWDRWGFHLITGQQTDRLVDLERMLHLFSGKPIPDNRENITIRLDDHIRSVQGKECYED
EMFSIRYFKKGSAHIRFRKPELVDRLNDIIAKHYPGVLPS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85207.1 hypothetical protein Not tested PAI V 536 Protein 1e-124 94
Z1664 NP_287166.1 hypothetical protein Not tested TAI Protein 2e-125 94
unnamed AAL57572.1 unknown Not tested LEE Protein 2e-124 94
Z1226 NP_286760.1 hypothetical protein Not tested TAI Protein 2e-125 94
unnamed CAI43906.1 hypothetical protein Not tested LEE Protein 5e-123 94
ECO111_3781 YP_003236116.1 hypothetical protein Not tested LEE Protein 3e-123 93
z1226 CAD33792.1 Z1226 protein Not tested PAI I 536 Protein 1e-124 93
ECO103_3595 YP_003223452.1 hypothetical protein Not tested LEE Protein 4e-120 90
unnamed AAK00485.1 unknown Not tested SHI-1 Protein 1e-119 89
S3207 NP_838489.1 hypothetical protein Not tested SHI-1 Protein 1e-119 89
SF3002 NP_708776.1 hypothetical protein Not tested SHI-1 Protein 1e-119 89
aec79 AAW51762.1 Aec79 Not tested AGI-3 Protein 2e-122 89
ORF_49 AAZ04463.1 hypothetical protein Not tested PAI I APEC-O1 Protein 2e-120 88
APECO1_3484 YP_854327.1 hypothetical protein Not tested PAI I APEC-O1 Protein 3e-120 88
unnamed CAD66210.1 hypothetical protein Not tested PAI III 536 Protein 1e-109 87

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
c4578 NP_756438.1 hypothetical protein VFG1533 Protein 7e-125 93
c4578 NP_756438.1 hypothetical protein VFG0666 Protein 6e-120 89
c4578 NP_756438.1 hypothetical protein VFG1685 Protein 7e-110 87