Name : c3655 (c3655) Accession : NP_755530.1 Strain : Escherichia coli CFT073 Genome accession: NC_004431 Putative virulence/resistance : Virulence Product : antigen 43 Function : - COG functional category : M : Cell wall/membrane/envelope biogenesis COG ID : COG3468 EC number : - Position : 3491808 - 3494936 bp Length : 3129 bp Strand : + Note : AG43; Sap; Escherichia coli K-12 ortholog: b2000; Escherichia coli O157:H7 ortholog: z1651 DNA sequence : ATGCTGATGAAACGACATCTGAATACCTGCTACAGGCTGGTATGGAATCACATTACGGGCGCTTTCGTGGTTGCCTCCGA ACTGGCCCGCGCACGGGGTAAACGTGGCGGTGTGGCGGTTGCACTGTCTCTTGCCGCGGTCACGCCACTCCCGGTGCTGT CTGCTGACATCGTTGTGCACCCGGGTGAAACAGTGAATGGCGGAACACTGGTAAACCATGACAACCAGTTTGTATCCGGA ACAGCTAATGGCGTGACTGTCAGTACCGGGCTTGAGCTGGGGCCGGACAGTGACGAAAACACCGGCGGGCAATGGATAAA AGCGGGTGGCACAGGCAGAAACACCACTGTCACCGCAAATGGTCGTCAGATTGTGCAGGCAGGAGGAACTGCCAGTGATA CGGTTATTCGTGATGGCGGAGGGCAGAGCCTTAACGGACTGGCGGTGAACACCACGCTGGATAACAGAGGTGAGCAGTGG GTACACGGGGGAGGGAAAGCTGCCGGTACAATTATTAACCAGGATGGTTACCAGACCATAAAACATGGCGGACTGGCAAC CGGAACCATCGTCAACACCGGTGCAGAAGGTGGTCCGGAGTCTGAAAATGTGTCCAGCGGTCAGATGGTCGGAGGGACGG CTGAATCCACCACCATCAATAAAAATGGCCGGCAGGTTATCTGGTCTTCGGGGATGGCACGGGACACCCTCATTTACGCC GGTGGTGACCAGACGGTACACGGAGAGGCACATAACACCCGACTGGAGGGGGGTAACCAGTATGTACACAACGGTGGCAC GGCAACAGAGACGCTGATAAACCGTGATGGCTGGCAGGTGATTAAGGAAGGAGGAACTGCCGCGCATACCACCATCAACC AGAAAGGAAAGCTGCAGGTGAATGCCGGCGGTAAAGCGTCTGATGTCACCCAGAACACGGGCGGTGCACTGGTTACCAGT ACAGCTGCAACCGTCACCGGCACAAACCGCCTGGGAGCATTCTCCGTTGTGGCGGGTAAGGCTGATAATGTCGTACTGGA AAATGGCGGACGTCTGGATGTGCTGAGCGGACACACAGCCACTAATACCCGTGTGGATGATGGCGGAACGCTGGATATCC GCAACGGTGGTGCCGCCACCACCGTATCCATGGGGAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGT ACCCGGAGCGACGGAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCAC GCTGAACGCCGGTGATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCA CCACCACGCTGAATAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGC GATGCACTCCTGCAGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAA CACCACACTCACCCAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCA TTGCTCAGCGCGGTACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTC GCCTCCGATGCCACCTGGAATATCCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGAT TCATTTCACCTCCTCACGCACAGGGACATTTGTGCCGGCGACTCTGAAAGTGAAAAACCTGAACGGGCAGAATGGCACCA TCAGCCTGCGTGTACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGCGGCAGGGCAACCGGAAAA ACCATCCTGAACCTGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTGGAAGC CATTAACGGTGCCACCACGGAGGAAGGGGCCTTTGTCCAGGGGAACAGGCTGCAGGCCGGGGCCTTTAACTACTCCCTCA ACCGGGACAGTGATGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCAGAAGTCCCCCTGTATGCCTCCATGCTG ACACAGGCAATGGACTATGACCGGATTCTGGCAGGCTCACGCAGCCATCAGACCGGTGTAAACGGTGAAAATAACAGCGT CCGTCTCAGCATTCAGGGCGGCCATCTCGGTCACGATAACAACGGCGGTATTGCCCGTGGGGCCACGCCGGAAAGCAGCG GCAGCTATGGCTTCGTCCGCCTGGAGGGTGACCTGCTGAGAACAGATGTTGCCGGTATGTCTGTGACCGCAGGGATATAT GGTGCTGCAGGCCATTCTTCCGTTGATGTTAAGGATGATGACGGCTCCCGTGCCGGCACGGTCCGGGATGATGCCGGCAG CCTGGGCGGATATATGAACCTGACACACACCTCCTCCGGCCTGTGGGCTGACATAGTGGCACAGGGAACCCGCCACAGTA TGAAAGCGTCATCGGGCAATAACGACTTCCGCGCACGGGGCCGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTC AGTATCACTGACAACCTGATGCTGGAGCCACGGCTGCAGTACACCTGGCAGGGGCTTTCGCTGGATGACGGTAAGGACAA CGCCGGTTATGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGA CCTTTGGTGAAGGCACCTCATCCCGTGCCCCCCTGCGTGACAGTGCAAAACACAGTGTGCGTGAACTGCCGGTGAACTGG TGGGTACAGCCTTCCGTTATCCGCACCTTCAGCTCCCGGGGTGATATGCGTGTGGGGACCTCCACTGCAGGCAGCGGGAT GACGTTCTCTCCCTCACAGAATGGCACATCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCC TGGGCGTTCAGGCCGGTTATGCCCACAGCGTCAGCGGCAGCAGCGCTGAAGGGTATAACGGTCAGGCCACACTGAATGTG ACCTTCTGA Protein sequence : MLMKRHLNTCYRLVWNHITGAFVVASELARARGKRGGVAVALSLAAVTPLPVLSADIVVHPGETVNGGTLVNHDNQFVSG TANGVTVSTGLELGPDSDENTGGQWIKAGGTGRNTTVTANGRQIVQAGGTASDTVIRDGGGQSLNGLAVNTTLDNRGEQW VHGGGKAAGTIINQDGYQTIKHGGLATGTIVNTGAEGGPESENVSSGQMVGGTAESTTINKNGRQVIWSSGMARDTLIYA GGDQTVHGEAHNTRLEGGNQYVHNGGTATETLINRDGWQVIKEGGTAAHTTINQKGKLQVNAGGKASDVTQNTGGALVTS TAATVTGTNRLGAFSVVAGKADNVVLENGGRLDVLSGHTATNTRVDDGGTLDIRNGGAATTVSMGNGGVLLADSGAAVSG TRSDGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREG DALLQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTL ASDATWNIPDNATVQSVVDDLSHAGQIHFTSSRTGTFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGK TILNLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASML TQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTDVAGMSVTAGIY GAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYMNLTHTSSGLWADIVAQGTRHSMKASSGNNDFRARGRGWLGSLETGLPF SITDNLMLEPRLQYTWQGLSLDDGKDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSAKHSVRELPVNW WVQPSVIRTFSSRGDMRVGTSTAGSGMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNV TF |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
SF2991 | NP_708765.1 | outer membrane fluffing protein | Not tested | SHI-1 | Protein | 0.0 | 96 |
unnamed | CAD66200.1 | hypothetical protein | Virulence | PAI III 536 | Protein | 0.0 | 96 |
sap | AAK00474.1 | Sap | Not tested | SHI-1 | Protein | 0.0 | 96 |
aidA | ADD91708.1 | AidA | Not tested | PAI-I AL862 | Protein | 0.0 | 88 |
flu | CAI43838.1 | antigen 43 | Not tested | LEE | Protein | 0.0 | 87 |
Z1211 | NP_286746.1 | adhesin | Not tested | TAI | Protein | 0.0 | 58 |
Z1651 | NP_287154.1 | adhesin | Not tested | TAI | Protein | 0.0 | 58 |
unnamed | AAL08472.1 | putative autotransporter | Not tested | SRL | Protein | 0.0 | 57 |
unnamed | CAE85197.1 | antigen43 protein orthologue | Not tested | PAI V 536 | Protein | 0.0 | 57 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
c3655 | NP_755530.1 | antigen 43 | VFG1675 | Protein | 0.0 | 96 |
c3655 | NP_755530.1 | antigen 43 | VFG0655 | Protein | 0.0 | 96 |
c3655 | NP_755530.1 | antigen 43 | VFG1063 | Protein | 0.0 | 57 |