Name : ECOPMV1_03270 (ECOPMV1_03270) Accession : YP_008573044.1 Strain : Escherichia coli PMV-1 Genome accession: NC_022370 Putative virulence/resistance : Virulence Product : Procyclic acidic repetitive protein (PARP) Function : - COG functional category : - COG ID : - EC number : - Position : 3360404 - 3364966 bp Length : 4563 bp Strand : - Note : - DNA sequence : ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTAAGCGCAACCCTGTTAGCCGGTTGTGATGGTGG TGGTTCAGGATCGTCCTCCGATACGCCGTCTGTAGATTCTGGATCAGGGACTTTGCCGGAAGTGAAACCCGATCCAACAC CAACCCCGGAGCCGACACCTGAGCCGACGCCGGACCCAGAACCTACGCCGGATCCAACACCTGATCCTGAGCCGACACCA GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGGTAACTGGTGCTACCTG TAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCAATACCGTGAGTTGTGTGGTGGGCAGTACGACCATTGCAA CATTCAACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGACAAAGTGTCGTTTAGCCTGGAGGACGCGCAGGAG CTGGCGAATTCTGAAAATAAGAAAACCAACGCCATCTCTCTGGTGACGTCCAGCGACAGTTGCCCCGCAGATGCAGAACA GCTTTGTCTTACTTTCTCGTCAGTGGTTGATCGCGCGCGATTTGAAAAACTGTATAAGCAAATTGATCTGGCAACAGACA ATTTCAGCAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACTGATAAAGCGCCGTCCACCCATACCTCAACGGTA GTGCCAGTCACGACAGAGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTA TCAACCCACTGAAATCATTCTTTCCGAAGGCCAACTGGTGGATAGCCTGGGGAACGGTGTTGCTGGCGTTGACTACTACA CCAATTCAGGCCGTGGCGTAACTGACGAAAACGGTAAATTTTCCTTTAGCTGGGGCGAAACCATCTCCTTTGGTATCGAT ACCTTTGAACTGGGCTCAGTACGTGGCAATAAGTCGACCATTGCGCTGACTGAATTGGGTGATGAAGTTCGCGGGGCAAA TATCGATCAGCTCATTCATCGTTATTCGACGACTGGTCAAAATAATACTCGTGTTGTTCCGGACGATGTACGCAAGGTCT TTGCCGAATATCCCAACGTGATCAACGAGATAATCAATCTTTCGTTATCCAACGGTGCGACGCTGGATGAAGGCGATCAA AACGTTGTGCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGATACCGCGATTTGTGCGAA AACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTTAATGACGGCCAGATTCAGGGCGTTATTA ACAAGCTGTGGGGCGTGGATACGAACTATCAGTCTGTCAGCAAGTTCCACGTCTTCCATGACTCTACCAACTTCTATGGC AGCACCGGTAACGCGCGCGGTCAGGCGGTGGTAAATATCTCCAACTCGGCATTCCCGATTCTGATGGCGCGTAATGATAA AAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTACGGAAGCGCCTTCCATTG TGCAGCCAGAGAACGTTACGCGCGATACTGCGACTTTCAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAAGGCAAA CTGATGGTTATCGGTAACCCGCACTACAACAGCATCCTGCGTTGCCCGAACGGTTACAGTTGGGGCGGTGGTGTTAATAG TAAAGGTGAGTGTACGCTCAGCGGTGATTCTGATGACATGAAGCACTTTATGCAGAACGTACTGCGCTACTTGTCAAATG ACATCTGGCAGCCAAATACCAAGAGCATCATGACTGTCGGCACCAACCTGGAGAACGTTTATTTCAAAAAAGCGGGCCAG GTATTGGGAAATAGTGCACCATTTGCTTTCCATGAGGATTTCACTGGTATCACGGTTAAACAGTTGACCAGCTATGGCGA TCTGAATCCGGAAGAGATTCCGTTGCTGATCCTCAACGGCTTTGAATATGTGACTCAGTGGTCTGGCGATCCCTATGCTG TGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCTTATCTGAACAAAGGT GGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCCAGTTTTGTGCGTCTGCTGGATGC CGCGGGTCTGTCAATGGCTCTGAACAAATCGGTGGTGAACAACGATCCGCAAGGGTATCCGGATCGCGTTCGTCAGCGTC GCGCGACTGGCATTTGGGTTTATGAACGTTATCCTGCTGCAGACGGCGCGCAACCGCCGTACACCATCGACCCAAATACA GGGGAAGTGACCTGGAAATACCAGCAAGACAACAAGCCTGATGACAAGCCGAAACTGGAAGTTGCGAGCTGGCAGGAGGA AGTTGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAAGCGGAATACACAACAGAAGAATCTCTGGAAGCGGCAA AGGCAAAAATCTTTGAGAAGTTTCCTGGGTTACAGGAGTGTAAGGACTCGACTTACCATTACGAGATTAACTGTTTGGAG CGCCGCCCAGGCACGGATGTTCCGGTAACAGGTGGCATGTATGTTCCGCGCTATACGCAACTGAATCTTGACGCCGACAC CGCGAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGCCTGTATCAGCATGAGCTTTATTTCCGTACCA AAGGCAGTAAAGGTGAGCGTCTGAACAGTGTTGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAACGAT ACGAAATATCGTTACGAAGAGGGCAAGGAAGATGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGCCAA TGATGCCTATGCAGGCGGCACCAAGTGCTCCGCAGATCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGTGACG GTAGCAGCAAAGCGGGCATGATGAACCCAAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGTCTGATGCTGGGC CGTTCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCAGGATCCGTATCGGCAAAGGGTGAGAGCGTTAC GGAAAACATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCAACCGGCCTGTGGGCACCGGCCC AGCAGGACGTCACCATTAAGTCTTCGGCGTCAGTCCCAGTGACTGTTACCGTGGCGCTGGCTGACGACCTGACTGGACGT GAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTATACTCTGGAGGCTAACGGTGAAGTGACCTT CAAGGTGCCTTATGGTGGTCTGATTTATATCAAGGGCGACAGTAAGGATGATGTTTCTGCTAACTTCACCTTTACCGGTG TAGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGATCTGGACTCACCGGCGCCGCTGGGCGAGCTGGAGTCT GCGTCGTTCGTCTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTTCACTGGTGGTGTAGCAGAATTCGCTAAAGA TCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCTATA AAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCGGGTTATCCGGTA ATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATTTGGCACGAAGTCGG TCATAACGCTGCAGAAACACCGCTGAACGTACCGGGTGCAACTGAAGTGGCGAACAACGTGCTGGCGCTGTACATGCAGG ATCGCTATCTCGGTAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCCGGAATATCTGGACGAGAGCAACGGTCAG GCCTGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGTTGAAGGAGTGGGCAGAGGAAAACTTTGATAT CAAACAGTGGTATCCAGATGGTGAGCTGCCTAAGTTCTACAGCGATCGTAAAGGGATGAAGGGCTGGAACCTGTTCCAGT TGATGCACCGTAAAGCGCGCGGCGATGATGTTGGTAACAGCACCTTTGGTGGCAAGAATTACTGTGCTGAATCCAATGGT AACGCTGCCGACACGCTGATGCTGTGTGCATCCTGGGTCGCTCAGGCGGATCTTTCGGAATTCTTTAAGAAATGGAATCC GGGTGCAAGTGCTTACCAGTTGCCGGGAGCAACGGAGATGAGTTTCCAGGGCGGTGTGAGCTCTTCGGCTTACAGCACGC TGGCGTCACTCAAGCTGCCGAAACCGGAAAAAGGGCCGGAAACCATTAACAAGGTTACCGAGCATAAGATGTCTGCCGAG TAA Protein sequence : MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTP EPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKVSFSLEDAQE LANSENKKTNAISLVTSSDSCPADAEQLCLTFSSVVDRARFEKLYKQIDLATDNFSKLVNEEVENNAATDKAPSTHTSTV VPVTTEGTKPDLNASFVSANAEQFYQYQPTEIILSEGQLVDSLGNGVAGVDYYTNSGRGVTDENGKFSFSWGETISFGID TFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLDEGDQ NVVLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYQSVSKFHVFHDSTNFYG STGNARGQAVVNISNSAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVQPENVTRDTATFNLPFISLGQVGEGK LMVIGNPHYNSILRCPNGYSWGGGVNSKGECTLSGDSDDMKHFMQNVLRYLSNDIWQPNTKSIMTVGTNLENVYFKKAGQ VLGNSAPFAFHEDFTGITVKQLTSYGDLNPEEIPLLILNGFEYVTQWSGDPYAVPLRADTSKPKLTQQDVTDLIAYLNKG GSVLIMENVMSNLKEESASSFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRATGIWVYERYPAADGAQPPYTIDPNT GEVTWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEYTTEESLEAAKAKIFEKFPGLQECKDSTYHYEINCLE RRPGTDVPVTGGMYVPRYTQLNLDADTAKAMVQAADLGTNIQRLYQHELYFRTKGSKGERLNSVDLERLYQNMSVWLWND TKYRYEEGKEDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLG RSWWDLNIKVDVEKYPGSVSAKGESVTENISLYSNPTKWFAGNMQSTGLWAPAQQDVTIKSSASVPVTVTVALADDLTGR EKHEVALNRPPRVTKTYTLEANGEVTFKVPYGGLIYIKGDSKDDVSANFTFTGVVKAPFYKDGEWKNDLDSPAPLGELES ASFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYPV MNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLNVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLDESNGQ AWARGGAGDRLLMYAQLKEWAEENFDIKQWYPDGELPKFYSDRKGMKGWNLFQLMHRKARGDDVGNSTFGGKNYCAESNG NAADTLMLCASWVAQADLSEFFKKWNPGASAYQLPGATEMSFQGGVSSSAYSTLASLKLPKPEKGPETINKVTEHKMSAE |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
unnamed | CAE85238.1 | hypothetical protein | Not tested | PAI V 536 | Protein | 0.0 | 86 |
VC0395_A0370 | YP_001216326.1 | lipoprotein | Not tested | VPI-1 | Protein | 0.0 | 49 |
VC0845 | NP_230493.1 | hypothetical protein | Not tested | VPI-1 | Protein | 0.0 | 49 |
acfD | AAK20802.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 49 |
acfD | ACK75667.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 49 |
acfD | ACK75652.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75646.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75664.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75670.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75655.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75649.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75661.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
acfD | ACK75658.1 | accessory colonization factor AcfD | Virulence | VPI | Protein | 0.0 | 48 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
ECOPMV1_03270 | YP_008573044.1 | Procyclic acidic repetitive protein (PARP) | VFG0106 | Protein | 0.0 | 49 |