Gene Information

Name : flu (ECDH1ME8569_1935)
Accession : YP_006129336.1
Strain : Escherichia coli DH1
Genome accession: NC_017638
Putative virulence/resistance : Virulence
Product : CP4-44 prophage; antigen 43 (Ag43)phase-variable biofilm formation autotransporter
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2055186 - 2058461 bp
Length : 3276 bp
Strand : +
Note : -

DNA sequence :
ATGCACAGATACGTACAGAAAGACATTCAGGGAACAACAGAACCACAATTCAGAAACTCCCACAGCCGGACCTCCGGCAC
TGTAACCCTTTACCTGCCGGTATCCACGTTTGTGGGTACCGGCTTTTTTATTCACCCTCAATCTAAGGAAAAGCTGATGA
AACGACATCTGAATACCTGCTACAGGCTGGTATGGAATCACATGACGGGCGCTTTCGTGGTTGCCTCCGAACTGGCCCGC
GCACGGGGTAAACGTGGCGGTGTGGCGGTTGCACTGTCTCTTGCCGCAGTCACGTCACTCCCGGTGCTGGCTGCTGACAT
CGTTGTGCACCCGGGAGAAACCGTGAACGGCGGAACACTGGCAAATCATGACAACCAGATTGTCTTCGGTACGACCAACG
GAATGACCATCAGTACCGGGCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGGCAATGGGTACAGGATGGCGGA
ACAGCCAACAAAACGACTGTCACCAGTGGTGGTCTTCAGAGAGTGAACCCCGGTGGAAGTGTCTCAGACACGGTTATCAG
TGCCGGAGGCGGACAGAGCCTTCAGGGACGGGCTGTGAACACCACGCTGAATGGTGGCGAACAGTGGATGCATGAGGGGG
CGATAGCCACAGGAACCGTCATTAATGATAAGGGCTGGCAGGTCGTCAAGCCCGGTACAGTGGCAACGGATACCGTTGTT
AATACCGGGGCGGAAGGGGGACCGGATGCAGAAAACGGTGATACCGGGCAGTTTGTTCGCGGGGATGCCGTACGCACAAC
CATCAATAAAAACGGTCGCCAGATTGTGAGAGCTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGACCAGA
CTGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGATACCAGTATGTGCACAACGGCGGTACAGCGTCTGACACT
GTTGTGAACAGTGACGGCTGGCAGATTGTCAAAAACGGGGGTGTGGCCGGGAATACCACCGTTAATCAGAAGGGCAGACT
GCAGGTGGACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGCGGCGCACTGGTTACCAGTACGGCTGCAACCG
TTACCGGCATAAACCGCCTGGGAGCATTCTCTGTTGTGGAGGGTAAAGCTGATAATGTCGTACTGGAAAATGGCGGACGC
CTGGATGTGCTGACCGGACACACAGCCACTAATACCCGCGTGGATGATGGCGGAACGCTGGATGTCCGCAACGGTGGCAC
CGCCACCACCGTATCCATGGGAAATGGCGGTGTACTGCTGGCCGATTCCGGTGCCGCTGTCAGTGGTACCCGGAGCGACG
GAAAGGCATTCAGTATCGGAGGCGGTCAGGCGGATGCCCTGATGCTGGAAAAAGGCAGTTCATTCACGCTGAACGCCGGT
GATACGGCCACGGATACCACGGTAAATGGCGGACTGTTCACCGCCAGGGGCGGCACACTGGCGGGCACCACCACGCTGAA
TAACGGCGCCATACTTACCCTTTCCGGGAAGACGGTGAACAACGATACCCTGACCATCCGTGAAGGCGATGCACTCCTGC
AGGGAGGCTCTCTCACCGGTAACGGCAGCGTGGAAAAATCAGGAAGTGGCACACTCACTGTCAGCAACACCACACTCACC
CAGAAAGCCGTCAACCTGAATGAAGGCACGCTGACGCTGAACGACAGTACCGTCACCACGGATGTCATTGCTCAGCGCGG
TACAGCCCTGAAGCTGACCGGCAGCACTGTGCTGAACGGTGCCATTGACCCCACGAATGTCACTCTCGCCTCCGGTGCCA
CCTGGAATATCCCCGATAACGCCACGGTGCAGTCGGTGGTGGATGACCTCAGCCATGCCGGACAGATTCATTTCACCTCC
ACCCGCACAGGGAAGTTCGTACCGGCAACCCTGAAAGTGAAAAACCTGAACGGACAGAATGGCACCATCAGCCTGCGTGT
ACGCCCGGATATGGCACAGAACAATGCTGACAGACTGGTCATTGACGGCGGCAGGGCAACCGGAAAAACCATCCTGAACC
TGGTGAACGCCGGCAACAGTGCGTCGGGGCTGGCGACCAGCGGTAAGGGTATTCAGGTGGTGGAAGCCATTAACGGTGCC
ACCACGGAGGAAGGGGCCTTTGTCCAGGGGAACAGGCTGCAGGCCGGTGCCTTTAACTACTCCCTCAACCGGGACAGTGA
TGAGAGCTGGTATCTGCGCAGTGAAAATGCTTATCGTGCAGAAGTCCCCCTGTATGCCTCCATGCTGACACAGGCAATGG
ACTATGACCGGATTGTGGCAGGCTCCCGCAGCCATCAGACCGGTGTAAATGGTGAAAACAACAGCGTCCGTCTCAGCATT
CAGGGCGGTCATCTCGGTCACGATAACAATGGCGGTATTGCCCGTGGGGCCACGCCGGAAAGCAGCGGCAGCTATGGATT
CGTCCGTCTGGAGGGTGACCTGATGAGAACAGAGGTTGCCGGTATGTCTGTGACCGCGGGGGTATATGGTGCTGCTGGCC
ATTCTTCCGTTGATGTTAAGGATGATGACGGCTCCCGTGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGGATAC
CTGAATCTGGTACACACGTCCTCCGGCCTGTGGGCTGACATTGTGGCACAGGGAACCCGCCACAGCATGAAAGCGTCATC
GGACAATAACGACTTCCGCGCCCGGGGCTGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACTGACA
ACCTGATGCTGGAGCCACAACTGCAGTATACCTGGCAGGGACTTTCCCTGGATGACGGTAAGGACAACGCCGGTTATGTG
AAGTTCGGGCATGGCAGTGCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGCGAAGG
CACCTCATCCCGTGCCCCCCTGCGTGACAGTGCAAAACACAGTGTGAGTGAATTACCGGTGAACTGGTGGGTACAGCCTT
CTGTTATCCGCACCTTCAGCTCCCGGGGAGATATGCGTGTGGGGACTTCCACTGCAGGCAGCGGGATGACGTTCTCTCCC
TCACAGAATGGCACATCACTGGACCTGCAGGCCGGACTGGAAGCCCGTGTCCGGGAAAATATCACCCTGGGCGTTCAGGC
CGGTTATGCCCACAGCGTCAGCGGCAGCAGCGCTGAAGGGTATAACGGTCAGGCCACACTGAATGTGACCTTCTGA

Protein sequence :
MHRYVQKDIQGTTEPQFRNSHSRTSGTVTLYLPVSTFVGTGFFIHPQSKEKLMKRHLNTCYRLVWNHMTGAFVVASELAR
ARGKRGGVAVALSLAAVTSLPVLAADIVVHPGETVNGGTLANHDNQIVFGTTNGMTISTGLEYGPDNEANTGGQWVQDGG
TANKTTVTSGGLQRVNPGGSVSDTVISAGGGQSLQGRAVNTTLNGGEQWMHEGAIATGTVINDKGWQVVKPGTVATDTVV
NTGAEGGPDAENGDTGQFVRGDAVRTTINKNGRQIVRAEGTANTTVVYAGGDQTVHGHALDTTLNGGYQYVHNGGTASDT
VVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAATVTGINRLGAFSVVEGKADNVVLENGGR
LDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRSDGKAFSIGGGQADALMLEKGSSFTLNAG
DTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDALLQGGSLTGNGSVEKSGSGTLTVSNTTLT
QKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASGATWNIPDNATVQSVVDDLSHAGQIHFTS
TRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNSASGLATSGKGIQVVEAINGA
TTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRIVAGSRSHQTGVNGENNSVRLSI
QGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLMRTEVAGMSVTAGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGY
LNLVHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGKDNAGYV
KFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMRVGTSTAGSGMTFSP
SQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 96
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 95
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 89
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 88
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 88
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 58
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 58
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 58
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 58

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
flu YP_006129336.1 CP4-44 prophage; antigen 43 (Ag43)phase-variable biofilm formation autotransporter VFG1675 Protein 0.0 89
flu YP_006129336.1 CP4-44 prophage; antigen 43 (Ag43)phase-variable biofilm formation autotransporter VFG0655 Protein 0.0 88
flu YP_006129336.1 CP4-44 prophage; antigen 43 (Ag43)phase-variable biofilm formation autotransporter VFG1063 Protein 0.0 58