Name : EcHS_A0226 (EcHS_A0226) Accession : YP_001457000.1 Strain : Escherichia coli HS Genome accession: NC_009800 Putative virulence/resistance : Unknown Product : hypothetical protein Function : - COG functional category : S : Function unknown COG ID : COG3523 EC number : - Position : 241171 - 244695 bp Length : 3525 bp Strand : - Note : identified by match to protein family HMM PF06744; match to protein family HMM PF06761; match to protein family HMM TIGR03348 DNA sequence : GTGTTCAGATTACCAACACCCCGACTACTCAGCGGACTCAAATCAGCCCTGCGACCGGCGATGCCCCGGTTTAAAGTCTC TGCTTTCTGGCTGCTGATACTGGCGTGGATTTTTCTGCTGGTGTGGATCTGGTGGAAAGGCCCGACGTGGACGCTGTATG AAGAACAGTGGCTCAAACCACTGGCGAACCGCTGGCTGGCAACGGCGGCGTGGGGGATTATTGCCCTGATGTGGCTTACC GTCCGGGTGATGAAGCGCCTGCAACAGCTTGAAAAAATGCAAAAGCAACAGCGCGAGGAAGCCGTTGATCCGCTCAGCGT GGAACTGAACGCCCAGCAGCGTTATCTTGACCGCTGGCTGCTGCGCCTGCAACGCCATCTCGACAACCGCCGTTTCCTGT GGCAGTTGCCGTGGTATATGGTCATCGGCCCGGCGGGCAGTGGCAAAACGACACTGCTGCGCGAGGGGTTTCCGTCCGAC ATTATTTATGCCCCGGAGGGCGCACGCGGCGCAGAACAACGCCTGTACCTCACGCCCCATGTCGGTAAACAGGCGGTGAT CTTTGATATCGACGGCACACTCTGCGCTCCCGCTGATGCGGATATCCTGCATCGCCGTCTGTGGGAACATGCCCTCGGCT GGCTGAAAGAAAAGCGCGCGCGCCAGCCGCTGAATGGGATTATTCTGACACTCGATTTACCCGATCTGCTTACCGCAGAC AAACGCCGCCGCGAGCATCTGTTACAGGCGCTGCGCAGCCGCTTGCAGGATATACGCCAGCATCTTCACTGCCAGTTACC GGTTTACGTGGTACTTACCCGGCTTGATTTATTGCAGGGTTTTGCCGCCCTGTTTCAGTCCCTGAACAGACAGGATCGCG ATGCGATTCTGGGCGTCACGTTCACCCGCCGTGCCCATGAAAATGATGACTGGCGAACAGAGTTGAATGCTTTCTGGCAG ACATGGGTGGATCGAATGAATCTGGCGTTGCCGGATCTGATGGTCGCTCAGACTCACACCCGCGCGTCTTTATTCAGTTT TTCCCGCCAGATGCAGGGAAGCCGTGAACCGCTGGTGTCACTGCTTGAGGGTCTGCTTGATGGCGAAAATATGAACGTGA TGCTGCGTGGTGTCTATCTCACCTCTTCGCTTCAGCGTGGACAGATGGATGATATATTCACCCAGTCTGCCGCCCGCCAG TACCGGCTGGGCAATAACCCACTGGCGTCCTGGCCCCTGGTGGACACCGCGCCTTATTTCACCCGCAGCCTGTTCCCGCA GGCATTACTCGCAGAGCCTAATCTGGCAACAGAGAGCCGCGCCTGGCTGATACGTTCCCGTCGCCGCCTGACGGTTTTCT CTGCCACAGGCGGCGTGGCAGCACTGCTGCTCATCACCGGCTGGCATCACTATTACAACGGTAACTATCAGTCTGGCATC ACCGTGCTTAAGCAGGCCAAAGCCTTTATGGACGTGCCGCCTCCGCAGGGGGAAGATGACTTTGGCAACCTGCAACTGCC GCTCCTGAACCCGGTACGCGATGCCACACTGGCCTATGGCGACTGGGGCGACCGCAGCCGTCTGGCCGATATGGGACTGT ACCAGGGACGACGTATCGGGCCTTATGTGGAACAGACCTATCTGCAACTGCTGGAGCAACGTTACCTGCCCTCGCTGTTT AACGGGCTGGTCAAAGCGCTGAACGCCGCGCCGCCGGAGAGTGAAGAAAAACTCGCGGTGCTGCGCGTGATGCGAATGCT GGAGGACAAAAGCGGACGTAACAATCAGGTGGTGAAGCAGTATATGGCAAAACGCTGGAGCGAAAAGTTCCACGGTCAGC GCGATATCCAGGCACAACTGATGTCCCATCTTGACTACGCGCTGGCTCATACTGACTGGCACGCAGAGCGTCAGGCGGGC GACGGTGACGCCATCAGCCGCTGGACGCCATATGACAAGCCCGTGGTATCAGCACAGAAAGAACTGAGCAAACTGCCTGT CTACCAGCGGGTTTACCAGAGCCTGAAAACGCGGGCGCTGGGCGTTCTTCCTGCCGACCTCAATCTGCGTGACCAGGTAG GGCCAACCTTTGACCAGGTGTTTACATCTGCCGATGACAACAAACTGGTTGTTCCACAGTTTCTTACCCGTTACGGCCTG CAAAGCTATTTTGTAAAACAGCGCGATGAACTGGTTGAACTGACGGCGATGGATTCCTGGGTACTTAACCTTACCCGCAG CGTGAAATACAGTGACGCCGACCGCGCGGAAATCCAGCGCCAGTTGACCGAGCAGTATATCAGCGACTACACCGCCACCT GGCGGGCCGGGATGGACAATCTGAATATCCGCAATTTTGAGTCCATCGGACAACTGACCGGGGCGCTGGAGCAGGTTATC AGCGGCGACCAGCCTTTGCAGCGGGCGCTGACCGTGCTGCGTGACAACACACAGCCAGGCGTCTTTTCTGAAAAACTCTC TGCCAAAGAACGGGAGGAAGCCCTGGCAGAGCCGGATTACCAGTTACTCACCCGCCTCGGGCATGAATTCGCCCCGGAAA ACAGTACCCTGGCAGTACAGAAAGACAAAGAAAGCACGATGCAGGCCGTGTATCAGCAACTCACCGAGTTGCACCGCTAC CTGCTGGCAATCCAGAACGCGCCTGTACCAGGGAAATCGGCGCTGAAAGCCGTGCAGTTACGGCTTGATCAGAACAGCAG CGATCCGATATTCGCCACCCGCCAGATGGCAAAAACGCTGCCTGCTCCGCTCAACCGCTGGGTTGGCAGACTGACTGACC AGGCCTGGCATGTGGTGATGGTGGAGGCTGTTCATTATATGGAAGTGGACTGGCGCGACAGCGTGGTGAAACCGTTTAAC GAGCAACTGGCAAATAACTATCCGTTTAATCCGCGTTCTGCACAGGATGCCTCACTGGATGCCTTCGAACGCTTCTTTAA ACCGGATGGCATACTGGATACCTTCTACCAGCAGAACCTGAAGCTGTTTATCGATAATGACCTGAGTCTGGAGGATGGCG ATAACAACGTCATTATTCGCGAAGATATTATTGCGCAACTGGAAACTGCGCAGAAAATCCGTGACATCTTCTTCAGCAAA CAGAACGGTCTGGGAACATCCTTTGCCGTGGAAACGGTATCGCTTTCAGGCAATAAACGCCGCAGTGTACTGAACCTTGA CGGTCAGTTAGTCGATTACAGCCAGGGCCGTAACTATACCGCCCATCTGGTCTGGCCTAACAACATGCGCGAAGGCAACG AAAGTAAGCTGACGCTCATCGGCACCAGCGGCAACGCGCCGCGCAGTATCAGCTTCAGCGGGCCGTGGGCGCAGTTCCGC CTGTTCGGGGCCGGACAACTGACCGGAGTACAGGATGGCAACTTTACCGTGCGCTTTAGCGTGGACGGTGGCGCGATGAC CTACCGTGTGCATACCGACACGGAAGATAACCCGTTCAGCGGTGGGTTGTTCAGCCAGTTTGGTCTGTCAGACACACTGT ACTGA Protein sequence : MFRLPTPRLLSGLKSALRPAMPRFKVSAFWLLILAWIFLLVWIWWKGPTWTLYEEQWLKPLANRWLATAAWGIIALMWLT VRVMKRLQQLEKMQKQQREEAVDPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSD IIYAPEGARGAEQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTAD KRRREHLLQALRSRLQDIRQHLHCQLPVYVVLTRLDLLQGFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQ TWVDRMNLALPDLMVAQTHTRASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQ YRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWLIRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGI TVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRIGPYVEQTYLQLLEQRYLPSLF NGLVKALNAAPPESEEKLAVLRVMRMLEDKSGRNNQVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQAG DGDAISRWTPYDKPVVSAQKELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGL QSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESIGQLTGALEQVI SGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRY LLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLTDQAWHVVMVEAVHYMEVDWRDSVVKPFN EQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSK QNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFR LFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
aec30 | YP_851415.1 | hypothetical protein | Not tested | PAI II APEC-O1 | Protein | 0.0 | 79 |
aec30 | AAQ96724.1 | Aec30 | Not tested | AGI-1 | Protein | 0.0 | 78 |
pmt1 | AAN64194.1 | Pmt1 | Not tested | macrophage toxin pathogenicity island | Protein | 0.0 | 57 |