Gene Information

Name : EcHS_A0235 (EcHS_A0235)
Accession : YP_001457009.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3519
EC number : -
Position : 254623 - 256473 bp
Length : 1851 bp
Strand : -
Note : NULL; identified by similarity to GB:CAG76341.1; match to protein family HMM PF05947; match to protein family HMM TIGR03359

DNA sequence :
ATGGAATTTGAAGAACGCTATTTCCGGGAAGAACTCGATTACCTGCGCCAGCTTAGCAAGCTGCTGGCAACGGAAAAACC
CCATCTGGCCCGCTTCCTGGCCGAAAAAGATGCGGATCCGGATATTGAACGCCTGCTGGAAGGGGTGGCTTTTCTTACCG
GCAATCTCCGCCAGAAAATTGAGGATGAATTCCCAGAACTGACGCACGGGCTTATTAAGATGCTATGGCCTAATTACCTG
CGTCCGGTTCCGGCAATGACCCTTATTGAATATACGCCGGATATGGATAAGTCTTCTGTACCGGTGTTAATCCCCCGTAA
TGAGCAGTTTACAACCAACGCCGGGGAAATCAGAGTTGATGAAGTGCTGCCCTCTGATGCTAAAAAGGAGGAGCCGCCTC
CCTGTACCTTCACTCTCTGCCGGGATATCTGGCTGCTGCCCGTTCGCCTGGAGCAGATTGAAAACCGCAGTACGACCCGT
AATGGTGTTATCAACATCACCTTTTCGGTCGCACCGGGAACGGACTTCCGCACGCTGGATCTGAACAAACTTCGCTTCTG
GCTCGGCAATGACGACAACTATACCCGTGACCAGCTTTATTTATGGTTCTGCGAATACTTGCAGGGTGCCGACCTGACTG
TGGGTGAACAGCATATTCGCCTGCCTGAGTTTATGCTAAAAGCTGTCGGTTTTGAGCCGCAGGATGCCATGCTGCCCTGG
CCGAAAAACGTCCACAGCGGCTACCGGATCCTTCAGGAGTATTTCTGTTACCCCGATGCGTTTCTCTTTTTTGATCTTTG
TGGTTGTCCGGCTTTGCCTGACGGATTGCAGGCGGAATTCTTTACCCTGCAACTGCGTTTTTCGCGCCCTTTGCCCGTGG
ACATCCGGCTGCGCCGCGATTCCCTGCGCCTGTATTGCGCACCTGCCATTAATTTATTTATCCACCATGCAGAAGCCATC
ACGCTGGACAACCGGCGGGCAGACTATCCGCTGGTTCCCAGCCGCCATTACCCACAACATTACGATGTATTTTCCGTTAA
CAGTGTGGTGAGCCAGGTCCAGGATATGTTCAGGAAAAAAGATCTGGGGCGTCCTGTTTCGACGCAGGCCGCGCGCCAGT
GGCCAGCCTTTGAAAGTTTCAGCCATCAGATGGAATACAGCCGGAAGCGGGAAGTGGTGTACTGGCATCACCGGACCAAA
ACATCCCTGTTCCATCGCGGCTTTGATCATACCCTTGCCTTTATACATGCTGATGGCAGTTATCCGTCAGACGAATCTCT
GCTCAGTAATGAAGTGGTTTCGGTATCGCTGACCTGTACCAACCGTGAGCTTCCGTCACAAATTCGTTCCGGCGATATCA
CCGGCACAACCGGTAAAAATGCAGCTGTTGCTTCATTTCGCAACATTACCCGCCCGACGCAACCACTCTGGCCGGTCATT
GATGGCAGCCTGCACTGGTCCCTACTCTCCGCCATGAACCTGAATTATCTGTCATTACTGGATACGGACGCGCTGAAGCA
GGTCATCGCCAACTTTGATCGCCACGCAATCCATCATCCGCAGACGGCACGGCTGTCACAACAAAAGCTGGATGCCATTG
AGCGTCTGGAGACCCGCCCCGTTGATCGCCTGTTTACGGGTATTCCCGTCCGGGGACTGGCCTCCACGCTGTATCTGCAC
CCGGAGCCGTTTGTCTGTGAAGGGGAAATGTATCTGCTCGGTACGGTGCTTTCGCATTTTCTGTCGCTGTACGCCAGCGT
TAACTCATTCCACATGCTGACCGTTGTGAACACAGAAAGCCAGGAGACATGGAAATGGACGGAAAGAATCGGGCAGCATC
CTCTTATCTGA

Protein sequence :
MEFEERYFREELDYLRQLSKLLATEKPHLARFLAEKDADPDIERLLEGVAFLTGNLRQKIEDEFPELTHGLIKMLWPNYL
RPVPAMTLIEYTPDMDKSSVPVLIPRNEQFTTNAGEIRVDEVLPSDAKKEEPPPCTFTLCRDIWLLPVRLEQIENRSTTR
NGVINITFSVAPGTDFRTLDLNKLRFWLGNDDNYTRDQLYLWFCEYLQGADLTVGEQHIRLPEFMLKAVGFEPQDAMLPW
PKNVHSGYRILQEYFCYPDAFLFFDLCGCPALPDGLQAEFFTLQLRFSRPLPVDIRLRRDSLRLYCAPAINLFIHHAEAI
TLDNRRADYPLVPSRHYPQHYDVFSVNSVVSQVQDMFRKKDLGRPVSTQAARQWPAFESFSHQMEYSRKREVVYWHHRTK
TSLFHRGFDHTLAFIHADGSYPSDESLLSNEVVSVSLTCTNRELPSQIRSGDITGTTGKNAAVASFRNITRPTQPLWPVI
DGSLHWSLLSAMNLNYLSLLDTDALKQVIANFDRHAIHHPQTARLSQQKLDAIERLETRPVDRLFTGIPVRGLASTLYLH
PEPFVCEGEMYLLGTVLSHFLSLYASVNSFHMLTVVNTESQETWKWTERIGQHPLI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
APECO1_1761 YP_851424.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 60

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcHS_A0235 YP_001457009.1 hypothetical protein VFG2078 Protein 1e-118 43