Gene Information

Name : EcHS_A0942 (EcHS_A0942)
Accession : YP_001457679.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Unknown
Product : major tail sheath protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG3497
EC number : -
Position : 941052 - 942224 bp
Length : 1173 bp
Strand : +
Note : identified by similarity to SP:P22501; match to protein family HMM PF04984

DNA sequence :
ATGGCTCAGGATTACCACCACGGAGTGCGCGTTGTTGAAGTCAACGAAGGCACTCGATCCATTACTACGGTGAGCACCGC
CATCGTGGGCATGGTCTGCACGGGCGATGATGCCGATGCAAAAATGTTCCCTCTTAATAAACCCGTGCTGATCACTGATG
TGCTTACTGCCAGCGGTAAAGCGGGTGAGTCAGGTACGCTGGCCCGTTCGCTGGATGCCATCGCTGACCAGGCAAAACCC
GTGACCGTTGTTGTGCGTGTGCCGCAGGGTGAAACGGAAGAAGAAACCACGACCAATATCATCGGAGCAGTGACCGCTGA
AGGTAAAAAAACAGGCATGAAAGCCCTGTTATCTGCCCAGTCACAGCTCGGCGTTAAACCGCGCATTCTCGGCGTGCCAG
GCCACGACACCAAGGCGGTAGCTACTGAGTTGCTGAGCGTGGCGCAAAGCCTGCGTGGATTTGCTTACCTGTCAGCGTAT
GGCTGCAAGACGGTACAGGAGGCGATCACTTACCGTGAAAACTTCAGCCAGCGCGAAGGGATGCTGATCTGGCCTGACTT
TACTGGCTGGGACACGGTGCTGAATGCCGAAGCAACGGCATATGCCACCGCCCGTGCGCTTGGTCTGCGCGCCAAAATTG
ACGAGCAGACCGGATGGCACAAAAGCCTGTCCAACGTGGGCGTGAACGGTGTCACCGGAATTTCTGCTGATGTGTTCTGG
GATCTGCAGGACCCGGCAACCGATGCAGGTCTGCTGAACCAGAACGACGTCACCACGCTTGTGCGTAAAGACGGTTTCCG
CTTCTGGGGTTCCCGCTGCCTGAGTGATGACCCGCTCTTTGCCTTCGAAAACTACACCCGCACGGCGCAGGTGCTGATGG
ACACGATGGCAGAAGCACACATGTGGGCGGTGGATAAACCGCTTAACCCGTCGCTGGCCCGCGACATTATCGAGGGGATC
CGCGCCAAAATGCGCAGCCTGGTCAGTCAGGGCTATCTCATTGGTGGTGATTGCTGGCTGGATGAGTCGGTGAACGACAA
AGACACGCTGAAAGCCGGAAAACTCACCATCGACTACGACTACACGCCAGTGCCGCCACTTGAAAACCTGATGCTGCGTC
AGCGCATCACCGATCAGTACCTGGTGAATTTCTCTAGCCAGGTCAGCGCGTAA

Protein sequence :
MAQDYHHGVRVVEVNEGTRSITTVSTAIVGMVCTGDDADAKMFPLNKPVLITDVLTASGKAGESGTLARSLDAIADQAKP
VTVVVRVPQGETEEETTTNIIGAVTAEGKKTGMKALLSAQSQLGVKPRILGVPGHDTKAVATELLSVAQSLRGFAYLSAY
GCKTVQEAITYRENFSQREGMLIWPDFTGWDTVLNAEATAYATARALGLRAKIDEQTGWHKSLSNVGVNGVTGISADVFW
DLQDPATDAGLLNQNDVTTLVRKDGFRFWGSRCLSDDPLFAFENYTRTAQVLMDTMAEAHMWAVDKPLNPSLARDIIEGI
RAKMRSLVSQGYLIGGDCWLDESVNDKDTLKAGKLTIDYDYTPVPPLENLMLRQRITDQYLVNFSSQVSA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4607 NP_458690.1 probable major tail sheath protein Not tested SPI-7 Protein 4e-158 91
t4301 NP_807898.1 major tail sheath protein Not tested SPI-7 Protein 4e-158 91
unnamed ABR13484.1 putative tail sheath protein Not tested PAGI-6 Protein 3e-118 68