Gene Information

Name : flu1 (EC042_2242)
Accession : YP_006096552.1
Strain : Escherichia coli 042
Genome accession: NC_017626
Putative virulence/resistance : Virulence
Product : antigen 43 precursor (fluffing protein) (autotransporter)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2335419 - 2338538 bp
Length : 3120 bp
Strand : +
Note : highly similar (87.199% id) to EC042_4511

DNA sequence :
ATGAAACGACATCTGAATACCTGCTACAGGCTGGTATGGAATCACATTACGGGCGCTTTCGTGGTTGCCTCCGAACTGGC
CCGCGCACGGGGTAAACGTGGCGGTGTGGCGGTTGCACTGTCTCTTGCCGCAGTCACGTCACTCCCGGTGCTGGCTGCTG
ACATCGTTGTGCACCCGGGAGAAACCGTGAACGGCGGAACACTGGCAAATCATGACAACCAGATTGTCTTCGGTACGACC
AACGGAATGACCATCAGTACCGGGCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGGCAATGGGTACAGGATGG
CGGAACAGCCAACAAAACGACTGTCACCAGTGGTGGTCTTCAGAGAGTGAACCCCGGTGGAAGTGTCTCAGACACGGTTA
TCAGTGCCGGAGGCGGACAGAGCCTTCAGGGACGGGCAGTGAACACCACGCTGAATGGTGGCGAACAGTGGATGCATGAG
GGGGCGATAGCCACAGGAACCGTCATTAATGATAAGGGCTGGCAGGTCGTCAAGCCCGGTACAGTGGCAACGGATACCGT
TGTTAATACCGGGGCGGAAGGGGGACCGGATGCAGAAAACGGTGATACCGGGCAGTTTGTTCGCGGGGATGCCGTACGCA
CAACCATCAATAAAAACGGTCGCCAGATTGTGAGAGCTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGAC
CAGACTGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGATACCAGTATGTGCACAACGGCGGTACAGCGTCTGA
CACTGTTGTGAACAGTGACGGCTGGCAGATTGTCAAAAACGGGGGTGTGGCCGGGAATACCACCGTTAATCAGAAGGGCA
GACTGCAGGTGGACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGCGGCGCACTGGTTACCAGTACGGCTGCA
ACCGTTACCGGCATAAACCGCCTGGGAGCATTCTCTGTTGTGGAGGGTAAAGCTGATAATGTCGTACTGGAAAATGGCGG
ACGCCTGGATGTGCTGACCGGACACACAGCCACTAATACCCGCGTGGATGATGGCGGAACGCTGGATGTCCGCAACGGTG
GCACCGCCACCACCGTATCCATGGGAAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGTACCCGGAGC
GACGGAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCACGCTGAACGC
CGGTGATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCACCACCACGC
TGAATAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGCGATGCACTC
CTGCAGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAACACCACACT
CACCCAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCATTGCTCAGC
GCGGTACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTCGCCTCCGGT
GCCACCTGGAATATCCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGATTCATTTCAC
CTCCACCCGCACAGGGAAGTTCGTACCGGCAACCCTGAAAGTGAAAAACCTGAACGGACAGAATGGCACCATCAGCCTGC
GTGTACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGCGGCAGGGCAACCGGAAAAACCATCCTG
AACCTGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTTGAAGCCATTAACGG
TGCCACCACGGAGGAAGGGGCCTTTGTCCAGGGGAACAGGCTGCAGGCCGGTGCCTTTAACTACTCCCTCAACCGGGACA
GTGATGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCAGAAGTCCCCCTGTATGCCTCCATGCTGACACAGGCA
ATGGACTATGACCGGATTCTGGCAGGCTCCCGCAGCCATCAGACCGGTGTAAGCGGTGAAAATAACAGCGTCCGTCTCAG
CATTCAGGGCGGTCATCTCGGTCACGATAACAACGGCGGTATTGCCCGTGGGGCCACGCCGGAAAGCAGCGGCAGCTATG
GCTTCGTCCGTCTGGAGGGTGACCTGCTCAGAACAGAGGTTGCCGGTATGTCTCTGACGACAGGGGTGTATGGTGCTGCA
GGCCATTCTTCCGTTGATGTTAAGGATGATGACGGCTCCCGCGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGG
ATACCTGAATCTGACACACACGTCCTCCGGCCTGTGGGCTGACATTGTGGCACAGGGAACCCGCCACAGCATGAAAGCGT
CATCGGATAATAACGACTTCCGCGCCCGGGGCTGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACT
GACAACCTGATGCTGGAGCCACAACTGCACTATACCTGGCAGGGACTTTCCCTGGATGACGGCCAGGATAACGCCGGTTA
TGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGTG
AAGGCACCTCATCCCGTGACACCCTGCGCGACAGTACAAAACACGGTGTGAGTGAACTGCCTGTGAACTGGTGGGTACAG
CCTTCTGTTATCCGCACCTTCAGCTCCCGGGGAGACATGAGCATGGGTACAGCCGCAGCCGGCAGTAACATGACGTTCTC
ACCGTCCCAGAATGGCACATCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCCTGGGCGTTC
AGGCCGGTTATGCCCACAGCGTCAGCGGCAGCAGCGCTGAAGGGTATAACGGTCAGGCCACACTGAATGTGACCTTCTGA

Protein sequence :
MKRHLNTCYRLVWNHITGAFVVASELARARGKRGGVAVALSLAAVTSLPVLAADIVVHPGETVNGGTLANHDNQIVFGTT
NGMTISTGLEYGPDNEANTGGQWVQDGGTANKTTVTSGGLQRVNPGGSVSDTVISAGGGQSLQGRAVNTTLNGGEQWMHE
GAIATGTVINDKGWQVVKPGTVATDTVVNTGAEGGPDAENGDTGQFVRGDAVRTTINKNGRQIVRAEGTANTTVVYAGGD
QTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAA
TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRS
DGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDAL
LQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASG
ATWNIPDNATVQSVVDDLSHAGQIHFTSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTIL
NLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQA
MDYDRILAGSRSHQTGVSGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAA
GHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSIT
DNLMLEPQLHYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSTKHGVSELPVNWWVQ
PSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 96
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 96
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 90
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 89
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 89
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 59
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 59
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 59
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
flu1 YP_006096552.1 antigen 43 precursor (fluffing protein) (autotransporter) VFG1675 Protein 0.0 90
flu1 YP_006096552.1 antigen 43 precursor (fluffing protein) (autotransporter) VFG0655 Protein 0.0 89
flu1 YP_006096552.1 antigen 43 precursor (fluffing protein) (autotransporter) VFG1063 Protein 0.0 59