Gene Information

Name : EcHS_A0226 (EcHS_A0226)
Accession : YP_001457000.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 241171 - 244695 bp
Length : 3525 bp
Strand : -
Note : identified by match to protein family HMM PF06744; match to protein family HMM PF06761; match to protein family HMM TIGR03348

DNA sequence :
GTGTTCAGATTACCAACACCCCGACTACTCAGCGGACTCAAATCAGCCCTGCGACCGGCGATGCCCCGGTTTAAAGTCTC
TGCTTTCTGGCTGCTGATACTGGCGTGGATTTTTCTGCTGGTGTGGATCTGGTGGAAAGGCCCGACGTGGACGCTGTATG
AAGAACAGTGGCTCAAACCACTGGCGAACCGCTGGCTGGCAACGGCGGCGTGGGGGATTATTGCCCTGATGTGGCTTACC
GTCCGGGTGATGAAGCGCCTGCAACAGCTTGAAAAAATGCAAAAGCAACAGCGCGAGGAAGCCGTTGATCCGCTCAGCGT
GGAACTGAACGCCCAGCAGCGTTATCTTGACCGCTGGCTGCTGCGCCTGCAACGCCATCTCGACAACCGCCGTTTCCTGT
GGCAGTTGCCGTGGTATATGGTCATCGGCCCGGCGGGCAGTGGCAAAACGACACTGCTGCGCGAGGGGTTTCCGTCCGAC
ATTATTTATGCCCCGGAGGGCGCACGCGGCGCAGAACAACGCCTGTACCTCACGCCCCATGTCGGTAAACAGGCGGTGAT
CTTTGATATCGACGGCACACTCTGCGCTCCCGCTGATGCGGATATCCTGCATCGCCGTCTGTGGGAACATGCCCTCGGCT
GGCTGAAAGAAAAGCGCGCGCGCCAGCCGCTGAATGGGATTATTCTGACACTCGATTTACCCGATCTGCTTACCGCAGAC
AAACGCCGCCGCGAGCATCTGTTACAGGCGCTGCGCAGCCGCTTGCAGGATATACGCCAGCATCTTCACTGCCAGTTACC
GGTTTACGTGGTACTTACCCGGCTTGATTTATTGCAGGGTTTTGCCGCCCTGTTTCAGTCCCTGAACAGACAGGATCGCG
ATGCGATTCTGGGCGTCACGTTCACCCGCCGTGCCCATGAAAATGATGACTGGCGAACAGAGTTGAATGCTTTCTGGCAG
ACATGGGTGGATCGAATGAATCTGGCGTTGCCGGATCTGATGGTCGCTCAGACTCACACCCGCGCGTCTTTATTCAGTTT
TTCCCGCCAGATGCAGGGAAGCCGTGAACCGCTGGTGTCACTGCTTGAGGGTCTGCTTGATGGCGAAAATATGAACGTGA
TGCTGCGTGGTGTCTATCTCACCTCTTCGCTTCAGCGTGGACAGATGGATGATATATTCACCCAGTCTGCCGCCCGCCAG
TACCGGCTGGGCAATAACCCACTGGCGTCCTGGCCCCTGGTGGACACCGCGCCTTATTTCACCCGCAGCCTGTTCCCGCA
GGCATTACTCGCAGAGCCTAATCTGGCAACAGAGAGCCGCGCCTGGCTGATACGTTCCCGTCGCCGCCTGACGGTTTTCT
CTGCCACAGGCGGCGTGGCAGCACTGCTGCTCATCACCGGCTGGCATCACTATTACAACGGTAACTATCAGTCTGGCATC
ACCGTGCTTAAGCAGGCCAAAGCCTTTATGGACGTGCCGCCTCCGCAGGGGGAAGATGACTTTGGCAACCTGCAACTGCC
GCTCCTGAACCCGGTACGCGATGCCACACTGGCCTATGGCGACTGGGGCGACCGCAGCCGTCTGGCCGATATGGGACTGT
ACCAGGGACGACGTATCGGGCCTTATGTGGAACAGACCTATCTGCAACTGCTGGAGCAACGTTACCTGCCCTCGCTGTTT
AACGGGCTGGTCAAAGCGCTGAACGCCGCGCCGCCGGAGAGTGAAGAAAAACTCGCGGTGCTGCGCGTGATGCGAATGCT
GGAGGACAAAAGCGGACGTAACAATCAGGTGGTGAAGCAGTATATGGCAAAACGCTGGAGCGAAAAGTTCCACGGTCAGC
GCGATATCCAGGCACAACTGATGTCCCATCTTGACTACGCGCTGGCTCATACTGACTGGCACGCAGAGCGTCAGGCGGGC
GACGGTGACGCCATCAGCCGCTGGACGCCATATGACAAGCCCGTGGTATCAGCACAGAAAGAACTGAGCAAACTGCCTGT
CTACCAGCGGGTTTACCAGAGCCTGAAAACGCGGGCGCTGGGCGTTCTTCCTGCCGACCTCAATCTGCGTGACCAGGTAG
GGCCAACCTTTGACCAGGTGTTTACATCTGCCGATGACAACAAACTGGTTGTTCCACAGTTTCTTACCCGTTACGGCCTG
CAAAGCTATTTTGTAAAACAGCGCGATGAACTGGTTGAACTGACGGCGATGGATTCCTGGGTACTTAACCTTACCCGCAG
CGTGAAATACAGTGACGCCGACCGCGCGGAAATCCAGCGCCAGTTGACCGAGCAGTATATCAGCGACTACACCGCCACCT
GGCGGGCCGGGATGGACAATCTGAATATCCGCAATTTTGAGTCCATCGGACAACTGACCGGGGCGCTGGAGCAGGTTATC
AGCGGCGACCAGCCTTTGCAGCGGGCGCTGACCGTGCTGCGTGACAACACACAGCCAGGCGTCTTTTCTGAAAAACTCTC
TGCCAAAGAACGGGAGGAAGCCCTGGCAGAGCCGGATTACCAGTTACTCACCCGCCTCGGGCATGAATTCGCCCCGGAAA
ACAGTACCCTGGCAGTACAGAAAGACAAAGAAAGCACGATGCAGGCCGTGTATCAGCAACTCACCGAGTTGCACCGCTAC
CTGCTGGCAATCCAGAACGCGCCTGTACCAGGGAAATCGGCGCTGAAAGCCGTGCAGTTACGGCTTGATCAGAACAGCAG
CGATCCGATATTCGCCACCCGCCAGATGGCAAAAACGCTGCCTGCTCCGCTCAACCGCTGGGTTGGCAGACTGACTGACC
AGGCCTGGCATGTGGTGATGGTGGAGGCTGTTCATTATATGGAAGTGGACTGGCGCGACAGCGTGGTGAAACCGTTTAAC
GAGCAACTGGCAAATAACTATCCGTTTAATCCGCGTTCTGCACAGGATGCCTCACTGGATGCCTTCGAACGCTTCTTTAA
ACCGGATGGCATACTGGATACCTTCTACCAGCAGAACCTGAAGCTGTTTATCGATAATGACCTGAGTCTGGAGGATGGCG
ATAACAACGTCATTATTCGCGAAGATATTATTGCGCAACTGGAAACTGCGCAGAAAATCCGTGACATCTTCTTCAGCAAA
CAGAACGGTCTGGGAACATCCTTTGCCGTGGAAACGGTATCGCTTTCAGGCAATAAACGCCGCAGTGTACTGAACCTTGA
CGGTCAGTTAGTCGATTACAGCCAGGGCCGTAACTATACCGCCCATCTGGTCTGGCCTAACAACATGCGCGAAGGCAACG
AAAGTAAGCTGACGCTCATCGGCACCAGCGGCAACGCGCCGCGCAGTATCAGCTTCAGCGGGCCGTGGGCGCAGTTCCGC
CTGTTCGGGGCCGGACAACTGACCGGAGTACAGGATGGCAACTTTACCGTGCGCTTTAGCGTGGACGGTGGCGCGATGAC
CTACCGTGTGCATACCGACACGGAAGATAACCCGTTCAGCGGTGGGTTGTTCAGCCAGTTTGGTCTGTCAGACACACTGT
ACTGA

Protein sequence :
MFRLPTPRLLSGLKSALRPAMPRFKVSAFWLLILAWIFLLVWIWWKGPTWTLYEEQWLKPLANRWLATAAWGIIALMWLT
VRVMKRLQQLEKMQKQQREEAVDPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSD
IIYAPEGARGAEQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTAD
KRRREHLLQALRSRLQDIRQHLHCQLPVYVVLTRLDLLQGFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQ
TWVDRMNLALPDLMVAQTHTRASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQ
YRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWLIRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGI
TVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRIGPYVEQTYLQLLEQRYLPSLF
NGLVKALNAAPPESEEKLAVLRVMRMLEDKSGRNNQVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQAG
DGDAISRWTPYDKPVVSAQKELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGL
QSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESIGQLTGALEQVI
SGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRY
LLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLTDQAWHVVMVEAVHYMEVDWRDSVVKPFN
EQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSK
QNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFR
LFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 79
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 78
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 57