Gene Information

Name : ECO26_5486 (ECO26_5486)
Accession : YP_003232352.1
Strain : Escherichia coli 11368
Genome accession: NC_013361
Putative virulence/resistance : Virulence
Product : adhesin
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 5571377 - 5574496 bp
Length : 3120 bp
Strand : +
Note : Integrative element ECO26_IE09

DNA sequence :
ATGAAACGACATCTGAACACCAGCTACAGGCTGGTATGGAATCACATGACGGGCGCTTTCGTGGTTGCCTCCGAACTGGC
CCGCGCACGGGGTAAACGTGGCGGTGTGGCGGTTGCACTGTCTCTTGCCGCAGTCACGTCACTCCCGGTGCTGGCTGCTG
ACATCGTTGTGCACCCGGGAGAAACCGTGAACGGCGGAACACTGGCAAATCATGACAACCAGATTGTCTTCGGTACGACC
AACGGAATGACCATCAGTACCGGGCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGGCAATGGGTACAGGATGG
CGGAACAGCCAACAAAACGACTGTCACCAGTGGTGGTCTTCAGAGAGTGAACCCCGGTGGAAGTGTCTCAGACACGGTTA
TCAGTGCCGGAGGCGGACAGAGCCTTCAGGGACGGGCAGTGAACACCACGCTGAATGGTGGCGAACAGTGGATGCATGAG
GGGGCGATAGCCACAGGAACCGTCATTAATGATAAGGGCTGGCAGGTCGTCAAGCCCGGTACAGTGGCAACGGATACCGT
TGTTAATACCGGGGCGGAAGGGGGACCGGATGCAGAAAACGGTGATACCGGGCAGTTTGTTCGCGGGGATGCCGTACGCA
CAACCATCAATAAAAACGGTCGCCAGATTGTGAGAACTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGAC
CAGACTGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGATACCAGTATGTGCACAACGGCGGTACAGCGTCTGA
CACTGTTGTGAACAGTGACGGCTGGCAGATTGTCAAAAACGGGGGTGTGGCCGGGAATACCACCGTTAATCAGAAGGGCA
GACTGCAGGTGGACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGCGGCGCACTGGTTACCAGTACGGCTGCA
ACCGTTACCGGCATAAACCGCCTGGGAGCATTCTCTGTTGTGGAGGGTAAAGCTGATAATGTCGTACTGGAAAATGGCGG
ACGCCTGGATGTGCTGACCGGACACACAGCCACTAATACCCGCGTGGATGATGGCGGAACGCTGGATGTCCGCAACGGTG
GCACCGCCACCACCGTATCCATGGGAAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGTACCCGGAGC
GACGGAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCACGCTGAACGC
CGGTGATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCACCACCACGC
TGAATAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGCGATGCACTC
CTGCAGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAACACCACACT
CACCCAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCATTGCTCAGC
GCGGTACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTCGCCTCCGGT
GCCACCTGGAATATCCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGATTCATTTCAC
CTCCACCCGCACAGGGAAGTTCGTACCGGCAACCCTGAAAGTGAAAAACCTGAACGGACAGAATGGCACCATCAGCCTGC
GTGTACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGCGGCAGGGCAACCGGAAAAACCATCCTG
AACCTGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTTGAAGCCATTAACGG
TGCCACCACGGAGGAAGGGGCCTTTGTCCAGGGGAACAGGCTGCAGGCCGGTGCCTTTAACTACTCCCTCAACCGGGACA
GTGATGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCAGAAGTCCCCCTGTATGCCTCCATGCTGACACAGGCA
ATGGACTATGACCGGATTGTGGCAGGCTCCCGCAGCCATCAGACCGGTGTAAATGGTGAAAACAACAGCGTCCGTCTCAG
CATTCAGGGCGGTCATCTCGGTCACGATAACAATGGCGGTATTGCCCGTGGGGCCACGCCGGAAAGCAGCGGCAGCTATG
GATTCGTCCGTCTGGAGGGTGACCTGATGAGAACAGAGGTTGCCGGTATGTCTGTGACCGCGGGGGTATATGGTGCTGCT
GGCCATTCTTCCGTTGATGTTAAGGATGATGACGGCTCCCGTGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGG
ATACCTGAATCTGATACACAACGCCTCCGGACTGTGGGCTGACATTGTGGCCCTGGGAACCCGCCACAGCATGAAAGCGT
CAACGGACAATAACGACTTCCGCGCCCGGGGTTGGGGCTGGCTGGGTTCACTGGAAACCGGTCTGCCCTTCAGTATCACT
GACAACCTGATGCTGGAGCCACAACTGCAGTATACCTGGCAGGGACTTTCCCTGGATGACGGTAAGGACAACGCCGGTTA
TGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGCG
AAGGCACCTCATCCCGTGCCCCCCTGCGTGACAGTGCAAAACACAGTGTGAGTGAATTACCGGTGAACTGGTGGGTACAG
CCTTCTGTTATCCGCACCTTCAGCTCCCGGGGAGATATGCGTGTGGGGACTTCCACTGCAGGCAGCGGGATGACGTTCTC
TCCCTCACAGAATGGCACATCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCCTGGGCGTTC
AGGCCGGTTATGCCCACAGCGTCAGCGGCAGCAGCGCTGAAGGGTATAACGGTCAGGCCACACTGAATGTGACCTTCTGA

Protein sequence :
MKRHLNTSYRLVWNHMTGAFVVASELARARGKRGGVAVALSLAAVTSLPVLAADIVVHPGETVNGGTLANHDNQIVFGTT
NGMTISTGLEYGPDNEANTGGQWVQDGGTANKTTVTSGGLQRVNPGGSVSDTVISAGGGQSLQGRAVNTTLNGGEQWMHE
GAIATGTVINDKGWQVVKPGTVATDTVVNTGAEGGPDAENGDTGQFVRGDAVRTTINKNGRQIVRTEGTANTTVVYAGGD
QTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAA
TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRS
DGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDAL
LQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASG
ATWNIPDNATVQSVVDDLSHAGQIHFTSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTIL
NLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQA
MDYDRIVAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLMRTEVAGMSVTAGVYGAA
GHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLIHNASGLWADIVALGTRHSMKASTDNNDFRARGWGWLGSLETGLPFSIT
DNLMLEPQLQYTWQGLSLDDGKDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSAKHSVSELPVNWWVQ
PSVIRTFSSRGDMRVGTSTAGSGMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 96
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 96
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 89
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 88
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 88
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 58
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 58
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 57
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 57

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO26_5486 YP_003232352.1 adhesin VFG1675 Protein 0.0 89
ECO26_5486 YP_003232352.1 adhesin VFG0655 Protein 0.0 88
ECO26_5486 YP_003232352.1 adhesin VFG1063 Protein 0.0 57