Gene Information

Name : flu (ECDH10B_2146)
Accession : YP_001730949.1
Strain : Escherichia coli K-12
Genome accession: NC_010473
Putative virulence/resistance : Virulence
Product : CP4-44 prophage; antigen 43 (Ag43) phase-variable biofilm formation autotransporter
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 2160571 - 2163690 bp
Length : 3120 bp
Strand : +
Note : -

DNA sequence :
ATGAAACGACATCTGAATACCTGCTACAGGCTGGTATGGAATCACATGACGGGCGCTTTCGTGGTTGCCTCCGAACTGGC
CCGCGCACGGGGTAAACGTGGCGGTGTGGCGGTTGCACTGTCTCTTGCCGCAGTCACGTCACTCCCGGTGCTGGCTGCTG
ACATCGTTGTGCACCCGGGAGAAACCGTGAACGGCGGAACACTGGCAAATCATGACAACCAGATTGTCTTCGGTACGACC
AACGGAATGACCATCAGTACCGGGCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGGCAATGGGTACAGGATGG
CGGAACAGCCAACAAAACGACTGTCACCAGTGGTGGTCTTCAGAGAGTGAACCCCGGTGGAAGTGTCTCAGACACGGTTA
TCAGTGCCGGAGGCGGACAGAGCCTTCAGGGACGGGCTGTGAACACCACGCTGAATGGTGGCGAACAGTGGATGCATGAG
GGGGCGATAGCCACAGGAACCGTCATTAATGATAAGGGCTGGCAGGTCGTCAAGCCCGGTACAGTGGCAACGGATACCGT
TGTTAATACCGGGGCGGAAGGGGGACCGGATGCAGAAAACGGTGATACCGGGCAGTTTGTTCGCGGGGATGCCGTACGCA
CAACCATCAATAAAAACGGTCGCCAGATTGTGAGAGCTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGAC
CAGACTGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGATACCAGTATGTGCACAACGGCGGTACAGCGTCTGA
CACTGTTGTGAACAGTGACGGCTGGCAGATTGTCAAAAACGGGGGTGTGGCCGGGAATACCACCGTTAATCAGAAGGGCA
GACTGCAGGTGGACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGCGGCGCACTGGTTACCAGTACGGCTGCA
ACCGTTACCGGCATAAACCGCCTGGGAGCATTCTCTGTTGTGGAGGGTAAAGCTGATAATGTCGTACTGGAAAATGGCGG
ACGCCTGGATGTGCTGACCGGACACACAGCCACTAATACCCGCGTGGATGATGGCGGAACGCTGGATGTCCGCAACGGTG
GCACCGCCACCACCGTATCCATGGGAAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGTACCCGGAGC
GACGGAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCACGCTGAACGC
CGGTGATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCACCACCACGC
TGAATAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGCGATGCACTC
CTGCAGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAACACCACACT
CACCCAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCATTGCTCAGC
GCGGTACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTCGCCTCCGGT
GCCACCTGGAATATCCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGATTCATTTCAC
CTCCACCCGCACAGGGAAGTTCGTACCGGCAACCCTGAAAGTGAAAAACCTGAACGGACAGAATGGCACCATCAGCCTGC
GTGTACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGCGGCAGGGCAACCGGAAAAACCATCCTG
AACCTGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTGGAAGCCATTAACGG
TGCCACCACGGAGGAAGGGGCCTTTGTCCAGGGGAACAGGCTGCAGGCCGGTGCCTTTAACTACTCCCTCAACCGGGACA
GTGATGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCAGAAGTCCCCCTGTATGCCTCCATGCTGACACAGGCA
ATGGACTATGACCGGATTGTGGCAGGCTCCCGCAGCCATCAGACCGGTGTAAATGGTGAAAACAACAGCGTCCGTCTCAG
CATTCAGGGCGGTCATCTCGGTCACGATAACAATGGCGGTATTGCCCGTGGGGCCACGCCGGAAAGCAGCGGCAGCTATG
GATTCGTCCGTCTGGAGGGTGACCTGATGAGAACAGAGGTTGCCGGTATGTCTGTGACCGCGGGGGTATATGGTGCTGCT
GGCCATTCTTCCGTTGATGTTAAGGATGATGACGGCTCCCGTGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGG
ATACCTGAATCTGGTACACACGTCCTCCGGCCTGTGGGCTGACATTGTGGCACAGGGAACCCGCCACAGCATGAAAGCGT
CATCGGACAATAACGACTTCCGCGCCCGGGGCTGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACT
GACAACCTGATGCTGGAGCCACAACTGCAGTATACCTGGCAGGGACTTTCCCTGGATGACGGTAAGGACAACGCCGGTTA
TGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGCG
AAGGCACCTCATCCCGTGCCCCCCTGCGTGACAGTGCAAAACACAGTGTGAGTGAATTACCGGTGAACTGGTGGGTACAG
CCTTCTGTTATCCGCACCTTCAGCTCCCGGGGAGATATGCGTGTGGGGACTTCCACTGCAGGCAGCGGGATGACGTTCTC
TCCCTCACAGAATGGCACATCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCCTGGGCGTTC
AGGCCGGTTATGCCCACAGCGTCAGCGGCAGCAGCGCTGAAGGGTATAACGGTCAGGCCACACTGAATGTGACCTTCTGA

Protein sequence :
MKRHLNTCYRLVWNHMTGAFVVASELARARGKRGGVAVALSLAAVTSLPVLAADIVVHPGETVNGGTLANHDNQIVFGTT
NGMTISTGLEYGPDNEANTGGQWVQDGGTANKTTVTSGGLQRVNPGGSVSDTVISAGGGQSLQGRAVNTTLNGGEQWMHE
GAIATGTVINDKGWQVVKPGTVATDTVVNTGAEGGPDAENGDTGQFVRGDAVRTTINKNGRQIVRAEGTANTTVVYAGGD
QTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAA
TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRS
DGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDAL
LQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASG
ATWNIPDNATVQSVVDDLSHAGQIHFTSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTIL
NLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQA
MDYDRIVAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLMRTEVAGMSVTAGVYGAA
GHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLVHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSIT
DNLMLEPQLQYTWQGLSLDDGKDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSAKHSVSELPVNWWVQ
PSVIRTFSSRGDMRVGTSTAGSGMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 96
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 96
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 89
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 88
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 88
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 58
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 58
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 58
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 58

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
flu YP_001730949.1 CP4-44 prophage; antigen 43 (Ag43) phase-variable biofilm formation autotransporter VFG1675 Protein 0.0 89
flu YP_001730949.1 CP4-44 prophage; antigen 43 (Ag43) phase-variable biofilm formation autotransporter VFG0655 Protein 0.0 88
flu YP_001730949.1 CP4-44 prophage; antigen 43 (Ag43) phase-variable biofilm formation autotransporter VFG1063 Protein 0.0 58