Gene Information

Name : flu (ECUMN_4881)
Accession : YP_002415461.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Virulence
Product : antigen 43 (Ag43) phase-variable biofilm formation autotransporter; CP4-44 prophage
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 5045922 - 5049041 bp
Length : 3120 bp
Strand : +
Note : Evidence 1b : Function experimentally demonstrated in the studied species; PubMedId : 20392470, 97257509, 99212225, 2661530, 9298646; Product type h : extrachromosomal origin

DNA sequence :
ATGAAACGACATCTGAACACCAGCTACAGGCTGGTATGGAATCACATTACGGGCACCCTGGTGGTGGCCTCCGAACTGGC
CCGCTCACGGGGAAAACGCGCTGGTGTGGCGGTTGCGCTGTCTCTTGCTGCTGTCACATCAGTCCCGGCACTGGCTGCTG
ACAAGGTTGTACAGGCGGGAGAAACCGTGAACGACGGAACACTGACAAATCATGACAACCAGATTGTCCTCGGTACGGCC
AACGGAATGACCATCAGTACCGGTCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGACAATGGATACAAAATGG
CGGTATCGCCAACAACACTACTGTCACCGGTGGTGGTCTTCAGAGAGTGAATGCCGGAGGAAGCGTTTCAGACACGGTTA
TCAGTGCCGGAGGCGGACAGAGCCTTCAGGGGCAGGCAGTGAACACCACTCTGAACGGCGGTGAGCAGTGGGTACATGAA
GGCGGGATTGCAACGGGTACCGTCATTAATGAGAAGGGCTGGCAGGCCATCAAATCCGGCGCAGTGGCAACCGACACGGT
TGTGAATACCGGCGCGGAAGGGGGACCGGATGCGGAAAATGGTGATACCGGGCAGACCGTCTACGGAGATGCCGTACGCA
CCACCATCAATAAAAATGGTCGTCAGATTGTGGCTGCTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGAC
CAGACTGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGGTACCAGTATGTGCACAACGGCGGTACGGCGTCTGG
CACTGTTGTAAACAGTGACGGCTGGCAGATTGTCAAAAACGGGGGTGTGGCCGGGAATACCACCGTTAATCAGAAGGGCA
GACTGCAGGTGGACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGCGGCGCACTGGTTACCAGTACGGCTGCA
ACCGTTACCGGCATAAACCGCCTGGGAGCATTCTCTGTTGTGGAGGGTAAAGCTGATAATGTCGTACTGGAAAATGGCGG
ACGCCTGGATGTGCTGACCGGACACACAGCCACTAATACCCGCGTGGATGATGGCGGAACGCTGGATGTCCGCAACGGTG
GCACCGCCACCACCGTATCCATGGGAAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGTACCCGGAGC
GACGGAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCACGCTGAACGC
CGGTGATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCACCACCACGC
TGAATAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGCGATGCACTC
CTGCAGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAACACCACACT
CACCCAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCATTGCTCAGC
GCGGTACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTCGCCTCTGGT
GCCACCTGGAATATTCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGATTCATTTCAC
CTCCACCCGCACAGGGAAGTTCGTACCGGCAACCCTGAAAGTGAAAAACCTGAACGGACAGAATGGCACCATCAGCCTGC
GTGTACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGTGGCAGGGCAACCGGAAAAACCATCCTG
AACCTGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTGGAAGCCATTAACGG
TGCCACCACGGAGGAAGGGGCCTTTATCCAGGGGAATAAGCTGCAGGCCGGTGCCTTTAACTACTCCCTCAACCGGGACA
GTGATGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCTGAAGTCCCCCTGTATACATCCATGCTGACACAGGCA
ATGGACTATGACCGGATTCTGGCAGGCTCCCGCAGCCATCAGACCGGTGTAAACGGTGAAAATAACAGCGTCCGTCTCAG
CATTCAGGGCGGTCATCTCGGTCACGATAACAACGGCGGTATTGCCCGTGGAGCCACGCCGGAAAGCAGCGGCAGCTATG
GCTTCGTCCGTCTGGAGGGTGACCTGCTCAGAACAGAGGTTGCCGGTATGTCTCTGACGACAGGGGTGTATGGTGCCGCA
GGCCATTCTTCCGTTGATGTTAAGGATGATGACGGTTCCCGCGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGG
ATACCTGAATCTGACACACACGTCCTCCGGCCTGTGGGCTGACATTGTGGCACAGGGAACCCGCCACAGCATGAAAGCGT
CATCGGACAATAACGACTTCCGCGCCCGGGGCTGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACT
GACAACCTGATGCTGGAGCCACAACTGCAGTATACCTGGCAGGGACTTTCCCTGGATGACGGCCAGGATAACGCCGGTTA
TGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGAGCTTTGGTG
AAGGCACCTCATCCCGTGACACCCTGCGCGACAGTGCAAAACACCGTGTGCGTGAACTGCCGGTGAACTGGTGGGTACAG
CCTTCTGTTATCCGCACTTTCAGCTCCCGGGGTGACATGAGCATGGGTACAGCCGCAGCCGGCAGTAACATGACGTTCTC
ACCGTCCCGGAATGGCACGTCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCCTGGGCGTTC
AGGCCGGTTATGCCCACAGTGTCAGCGGCAGCAGCGCTGAAGGTTATAACGGTCAGGCCACGCTGAATGTGACTTTCTGA

Protein sequence :
MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVNDGTLTNHDNQIVLGTA
NGMTISTGLEYGPDNEANTGGQWIQNGGIANNTTVTGGGLQRVNAGGSVSDTVISAGGGQSLQGQAVNTTLNGGEQWVHE
GGIATGTVINEKGWQAIKSGAVATDTVVNTGAEGGPDAENGDTGQTVYGDAVRTTINKNGRQIVAAEGTANTTVVYAGGD
QTVHGHALDTTLNGGYQYVHNGGTASGTVVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAA
TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRS
DGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDAL
LQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASG
ATWNIPDNATVQSVVDDLSHAGQIHFTSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTIL
NLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFIQGNKLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYTSMLTQA
MDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAA
GHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSIT
DNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMSFGEGTSSRDTLRDSAKHRVRELPVNWWVQ
PSVIRTFSSRGDMSMGTAAAGSNMTFSPSRNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 99
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 99
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 89
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 89
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 89
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 59
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 59
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 59
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
flu YP_002415461.1 antigen 43 (Ag43) phase-variable biofilm formation autotransporter; CP4-44 prophage VFG0655 Protein 0.0 89
flu YP_002415461.1 antigen 43 (Ag43) phase-variable biofilm formation autotransporter; CP4-44 prophage VFG1675 Protein 0.0 89
flu YP_002415461.1 antigen 43 (Ag43) phase-variable biofilm formation autotransporter; CP4-44 prophage VFG1063 Protein 0.0 59