Name : ECH74115_B0104 (ECH74115_B0104) Accession : YP_002268480.1 Strain : Genome accession: NC_011350 Putative virulence/resistance : Virulence Product : immunoglobulin A1 protease domain protein Function : - COG functional category : S : Function unknown COG ID : COG4625 EC number : - Position : 64310 - 68212 bp Length : 3903 bp Strand : + Note : identified by match to protein family HMM PF02395; match to protein family HMM PF03797; match to protein family HMM TIGR01414 DNA sequence : ATGAATAAAATATACTCTCTTAAATACAGCCATATTACAGGAGGGTTAATCGCTGTTTCTGAATTATCCGGCAGAGTATC ATCAAGAGCAACTGGTAAGAAAAAACACAAACGCATACTTGCATTATGTTTTTTAGGCTTATTACAATCCTCATATTCTT TTGCGTCACAGATGGATATTTCAAATTTCTACATCCGTGACTATATGGATTTTGCACAGAACAAGGGCATATTTCAGGCT GGCGCAACAAATATTGAAATAGTGAAGAAAGATGGCTCCACCCTGAAACTACCGGAAGTACCATTTCCTGACTTCTCACC GGTTGCAAACAAAGGGTCAACCACATCTATTGGTGGTGCATACAGTATCACAGCCACACACAATACGAAAAACCACCACT CAGTTGCGACGCAAAACTGGGGGAACAGCACGTACAAACAAACTGACTGGAATACTTCACATCCTGATTTTGCAGTATCC CGACTTGACAAGTTTGTTGTTGAGACCCGAGGTGCGACTGAAGGCGCAGATATTTCGTTATCAAAACAGCAGGCACTTGA ACGTTACGGGGTTAATTATAAAGGAGAAAAGAAACTTATCGCATTCAGAGCCGGCTCTGGTGTGGTATCCGTTAAAAAAA ATGGACGCATAACTCCATTTAATGAGGTTTCTTATAAGCCAGAAATGTTAAATGGCTCTTTCGTTCACATTGATGACTGG AGTGGATGGCTGATATTAACCAACAACCAGTTTGATGAGTTTAATAACATTGCCTCTCAGGGTGACAGCGGTTCAGCACT GTTCGTCTATGATAACCAAAAGAAAAAGTGGGTTGTCGCTGGAACTGTCTGGGGGATTTATAATTACGCCAATGGCAAAA ACCACGCAGCATACAGTAAATGGAACCAGACAACCATTGACAACCTGAAGAACAAGTATTCTTACAACGTGGATATGTCA GGGGCTCAGGTTGCAACCATTGAAAATGGAAAACTGACAGGCACTGGCTCAGACACCACCGATATAAAAAATAAGGACTT AATATTTACTGGCGGTGGAGATATCCTCCTGAAATCCTCTTTTGATAATGGTGCTGGCGGTCTTGTCTTTAATGATAAAA AGACCTATCGAGTAAACGGGGATGATTTCACCTTTAAAGGTGCCGGTGTTGATACAAGAAACGGCAGCACCGTTGAGTGG AATATCCGGTATGATAATAAAGACAACCTTCACAAAATTGGTGATGGCACATTAGATGTCCGAAAAACCCAGAACACCAA CCTGAAAACAGGTGAGGGTCTTGTCATTCTTGGAGCTGAAAAAACATTCAATAATATCTACATAACCAGTGGTGATGGAA CTGTCCGACTGAATGCAGAAAATGCACTGTCTGGCGGTGAATACAACGGTATTTTCTTTGCGAAAAATGGCGGAACTCTT GACCTGAACGGATATAATCAGTCTTTCAATAAAATTGCTGCAACTGATTCAGGTGCTGTAATAACCAATACGTCAACCAA AAAATCCATTTTATCCCTGAATAATACTGCTGACTATATCTATCACGGCAACATAAACGGGAATCTGGACGTACTTCAGC ATCATGAGACGAAAAAAGAGAACCGTCGTCTTATTCTTGATGGGGGCGTGGACACAACAAATGATATAAGCCTGCGTAAT ACACAACTGTCCATGCAGGGACATGCCACTGAACATGCCATTTATCGGGATGGAGCTTTCTCTTGTTCACTACCAGCTCC TATGCGCTTTTTGTGTGGCAGTGATTATGTTGCAGGAATGCAAAATACAGAAGCTGATGCTGTAAAACAAAACGGAAATG CCTATAAAACCAACAATGCTGTCTCTGATTTATCGCAGCCAGACTGGGAAACCGGAACATTCAGATTTGGAACGCTACAT CTTGAAAATTCCGATTTTTCTGTTGGTCGTAATGCAAATGTAATCGGGGACATTCAGGCCAGTAAATCAAACATTACTAT TGGTGACACTACAGCATATATTGATTTGCATGCTGGTAAAAATATTACCGGTGATGGTTTTGGCTTCCGCCAGAATATTG TGCGTGGAAACTCACAAGGAGAAACGCTGTTTACAGGAGGGATCACAGCAGAAGACAGCACTATCGTTATTAAAGATAAA GCAAAAGCATTATTTTCAAATTATGTATACCTGCTGAACACAAAAGCAACCATAGAGAACGGTGCTGATGTGACAACTCA AAGTGGTATGTTCTCCACGAGCGATATCAGCATCTCTGGTAATCTGTCCATGACAGGCAATCCCGACAAAGACAATAAAT TCGAGCCCTCAATATATCTGAATGATGCTTCTTATCTACTGACTGACGACTCCGCCAGACTCGTTGCCAAAAATAAAGCA TCTGTGGTGGGAGATATACACTCCACTAAAAGTGCATCCATCATGTTTGGTCATGATGAAAGCGACCTCTCGCAGTTGTC TGACAGAACCTCAAAAGGGCTTGCACTTGGTCTTTTAGGTGGCTTTGATGTCTCATATCGCGGTTCAGTCAATGCCCCGT CAGCATCTGCCACTATGAACAACACCTGGTGGCAACTAACCGGAGATTCTGCGCTGAAAACACTGAAAAGTACAAACAGC ATGGTCTATTTCACTGACAGCGCAAACAATAAGAAATTCCATACGCTGACGGTCGATGAGCTGGCAACCAGCAACAGCGC CTATGCGATGCGTACAAACCTTTCTGAATCAGACAAACTGGAGGTCAAAAAACACTTGTCTGGTGAGAACAATATTTTAC TCGTTGATTTCCTTCAGAAACCAACGCCTGAAAAACAACTGAATATTGAACTGGTAAGCGCGCCAAAAGACACCAATGAA AATGTCTTTAAAGCCAGTAAACAAACCATTGGTTTCAGTGATGTAACGCCGGTCATTACAACCAGGGAAACCGATGACAA AATAACATGGTCACTGACAGGCTATAACACGGTAGCAAACAAGGAAGCAACCCGGAATGCCGCCGCCCTGTTCTCTGTTG ACTATAAAGCGTTTCTGAACGAGGTCAACAACCTGAACAAACGTATGGGTGACCTGCGTGATATCAACGGCGAAGCCGGT GCATGGGCACGCATCATGAGCGGTACCGGCTCTGCCAGTGGTGGTTTCAGTGACAACTACACGCACGTTCAGGTCGGGGT CGACAAAAAACACGAGCTGGACGGACTGGATTTGTTTACCGGTTTCACTGTCACACACACTGACAGCAGTGCCTCCGCCG ATGTTTTCAGTGGTAAAACGAAGTCTGTGGGGGCTGGCCTGTATGCTTCCGCCATGTTTGATTCCGGTGCCTATATCGAC CTGATTGGCAAGTATGTTCACCATGATAATGAGTACACTGCAACCTTTGCCGGACTCGGAACCCGTGATTACAGCACGCA TTCATGGTATGCCGGTGCAGAAGCGGGCTACCGCTATCATGTCACTGAGGATGCCTGGATTGAGCCACAGGCTGAGCTGG TTTACGGTTCTGTATCCGGTAAACAGTTTGCATGGAAGGACCAGGGAATGCATCTGTCCATGAAGGACAAGGACTACAAT CCGCTGATTGGCCGAACGGGTGTGGATGTGGGTAAATCCTTCTCTGGTAAGGACTGGAAAGTGACAGCCCGTGCCGGTCT GGGCTACCAGTTCGACCTGCTGGCTAACGGCGAAACCGTATTGCGGGATGCATCTGGTGAAAAACGCATCAAAGGTGAAA AGGACAGCCGTATGCTGATGTCCGTTGGCCTGAATGCAGAAATCAGGGATAACGTCCGCTTTGGACTGGAGTTTGAGAAA TCCGCCTTTGGTAAGTACAACGTTGATAATGCTGTCAACGCTAATTTCCGTTACTCGTTCTGA Protein sequence : MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQSSYSFASQMDISNFYIRDYMDFAQNKGIFQA GATNIEIVKKDGSTLKLPEVPFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDWNTSHPDFAVS RLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLIAFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDW SGWLILTNNQFDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSKWNQTTIDNLKNKYSYNVDMS GAQVATIENGKLTGTGSDTTDIKNKDLIFTGGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITSGDGTVRLNAENALSGGEYNGIFFAKNGGTL DLNGYNQSFNKIAATDSGAVITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGVDTTNDISLRN TQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGMQNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLH LENSDFSVGRNANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQGETLFTGGITAEDSTIVIKDK AKALFSNYVYLLNTKATIENGADVTTQSGMFSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSVNAPSASATMNNTWWQLTGDSALKTLKSTNS MVYFTDSANNKKFHTLTVDELATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIELVSAPKDTNE NVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVANKEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAG AWARIMSGTGSASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKTKSVGAGLYASAMFDSGAYID LIGKYVHHDNEYTATFAGLGTRDYSTHSWYAGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEK SAFGKYNVDNAVNANFRYSF |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
sigA | NP_838462.1 | serine protease | Virulence | SHI-1 | Protein | 0.0 | 56 |
sigA | NP_708742.1 | serine protease | Virulence | SHI-1 | Protein | 0.0 | 56 |
sigA | AAF67320.1 | exported serine protease SigA | Virulence | SHI-1 | Protein | 0.0 | 56 |
sat | YP_002414040.1 | Serine protease | Not tested | Not named | Protein | 0.0 | 56 |
espC | AAG37043.1 | enterotoxin EspC | Virulence | espC PAI | Protein | 0.0 | 53 |
unnamed | CAC39286.1 | hypothetical protein | Not tested | LPA | Protein | 0.0 | 48 |
pic | NP_838464.1 | serine protease precurser | Virulence | SHI-1 | Protein | 0.0 | 43 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
ECH74115_B0104 | YP_002268480.1 | immunoglobulin A1 protease domain protein | VFG0844 | Protein | 0.0 | 100 |
ECH74115_B0104 | YP_002268480.1 | immunoglobulin A1 protease domain protein | VFG0862 | Protein | 0.0 | 58 |
ECH74115_B0104 | YP_002268480.1 | immunoglobulin A1 protease domain protein | VFG0630 | Protein | 0.0 | 56 |
ECH74115_B0104 | YP_002268480.1 | immunoglobulin A1 protease domain protein | VFG0902 | Protein | 0.0 | 56 |
ECH74115_B0104 | YP_002268480.1 | immunoglobulin A1 protease domain protein | VFG0772 | Protein | 0.0 | 53 |