Gene Information

Name : i02_4885a (i02_4885a)
Accession : YP_006152022.1
Strain : Escherichia coli clone D i2
Genome accession: NC_017651
Putative virulence/resistance : Virulence
Product : antigen 43 precursor (AG43) (Fluffing protein)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4911095 - 4914016 bp
Length : 2922 bp
Strand : +
Note : -

DNA sequence :
ATGCTGTTTTGTACCTGCCGGTATCCACTTTTGTGGGTACCGGCTTTTTTATTCACCCTCTGTAAGGAAAAGCTGATGAA
ACGACATCTGAATACCAGCTACAGGCTGGTATGGAATCACATTACGGGCTCCCTGGTGGTGGCTTCCGAACTGGCCCGCT
CACGGGGAAAACGCGCCGGTGTGGCGGTTGCACTGTCTCTTGCTGCTGTTACGTCAGTCCCGGCACTGGCTGCTGACACG
GTTGTACAGGCGGGAGAAACCGTGAACGGCGGAACACTGACAAATCATGACAACCAGATTATCCTCGGTACGGCCAACGG
AATGACCATCAGTACCGGGCTGGAGTATGGGCCGGATAACGAGGCCAATACCGGCGGGCAATGGATACAAAATGGCGGTA
CCGCCAACAACACTACTGTCACCGGTGGTGGTCTTCAGCGAGTGAATGCCGGAGGAAGCGTTTCAGACACGGTTATCAGT
GCCGGGGGCGGACAGAGCCTTCAGGGGCAGGCAGTGAACACCACTCTGAACGGCGGTGAGCAGTGGGTACATGAAGGCGG
CATTGCAACGGGTACCGTCATTAATGAGAAGGGCTGGCAGGCCGTCAAATCCGGCGCACTGGCAACCGACACAGTTGTGA
ATACTGGCGCGGAAGGAGGACCGGATGCTGAAAATGGTGATACCGGGCAGACCGTCTACGGAGATGCCATACGCACCACC
ATCAATAAAAATGGTCGTCAGATTGTGGCTGCTGAAGGAACGGCAAATACCACTGTGGTTTATGCCGGCGGCGACCAGAC
TGTACATGGTCACGCACTGGATACCACGCTGAATGGGGGGTACCAGTATGTGCACAACGGCGGTACAGCGACTGACACTG
TTGTTAACAGTGACGGCTGGCAGGTAGTGAAGGATGGTGCTGTGGCTGAGAATACCACCGTTAACCAGAAAGGCAAACTG
CAGGTGAACGCCGGTGGTACAGCCACGAATGTCACCCTGAAGCAGGGGGGCGCACTGGTCACCAGTACGGCGGCAACCGT
CACCGGCAGCAACCGTCTGGGCAATTTCGCGGTGGAAAACGGTAAGGCTGACGGTGTTGTTCTGGAGTCCGGTGGTCGCC
TGGATGTACTGGAGAGCCATTCAGCACAGAATATCCTGGTGGATGACGGCGGTACCTTGGCAGTGTCTGTCGGCGGTAAG
GCAACAGATGTCACCATGACATCCGGTGGTGCCCTGATTGCAGACAGTGGTGCCACTGTTGAGGGGACCAATGCCAGCGG
TAAGTTCAGTATTGATGGTATATCCGGTCAGGCCAGCGGCCTGCTGCTGGAAAATGGCGGCAGCTTTACGGTTAATGCCG
GAGGACTGGCCAGCAACACCACTGTCGGACATCGTGGAACACTGACGCTGGCTGCCGGGGGAAGTCTGAGTGGCAGAACA
CAGCTCAGTAAAGGTGCCAGCATGGTACTGAATGGCGATGTGGTCAGTACCGGCGATATTGTTAACGCAGGGGAGATTCG
CTTTGATAATCAGACAACACCGGATGCCGCGCTGAGCCGTGCTGTTGCAAAAAGTAACTCCCCGGTAACGTTCCATAAAC
TGACCACCAGTAACCTCACCGGCCAGGGCGGCACCATCAATATGCGTGTTAGCCTTGATGGCAGTAATGACTCAGACCAG
CTGGTAATTAATGGTGGTCAGGCAACCGGCAAAACCTGGCTTGCGTTCACAAATGTCGGAAACAGTAACCTCGGGGTGGC
AACCACCGGACAGGGTATCCGGGTTGTGGATGCACAGAATGGTGCCACCACAGAAGAAGGTGCGTTTGCCCTGAGCCGCC
CGCTTCAGGCCGGCGCCTTTAACTACACCCTGAACCGTGACAGCGATGAAGACTGGTACCTGCGCAGTGAAAATGCTTAT
CGTGCTGAAGTCCCCCTGTATACATCCATGCTGACACAGGCAATGGACTATGACCGGATTCTGGCAGGCTCCCGCAGCCA
TCAGACCGGTGTAAACGGTGAAAATAACAGCGTCCGTCTCAGCATTCAGGGCGGTCATCTCGGTCACGATAACAACGGCG
GTATTGCCCGTGGAGCCACGCCGGAAAGCAGCGGCAGCTATGGCTTCGTCCGTCTGGAGGGTGACCTGCTCAGAACAGAG
GTTGCCGGTATGTCTCTGACGACAGGGGGGTATGGTGCTGCAGGCCATTCTTCCGTTGATGTTAAGGATGATGACGGCTC
CCGTGCCGGCACGGTCCGGGATGATGCCGGCAGCCTGGGCGGATACCTGAATCTGACACACACGTCCTCCGGCCTGTGGG
CTGACATTGTGGCACAGGGAACCCGCCACAGCATGAAAGCGTCATCGGACAATAACGACTTCCGCGCCCGGGGCTGGGGC
TGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACTGACAACCTGATGCTGGAGCCACAACTGCAGTACACCTG
GCAGGGACTCTCCCTGGACGACGGCCAGGATAACGCCGGTTATGTGAAGTTCGGGCATGGCAGTGCACAACATGTGCGTG
CCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGCGAAGGCACCTCATCCCGTGCCCCCCTGCGTGACAGTGCA
AAACACAGTGTGAGTGAATTACCGGTGAACTGGTGGGTACAGCCTTCTGTTATCCGCACCTTCAGCTCCCGGGGTGACAT
GAGCATGGGTACAGCTGCAGCTGGCAGTAACATGACGTTCTCACCGTCACAGAATGGCACGTCACTGGACCTGCAGGCCG
GACTGGAAGCCCGTGTCCGGGAAACTATCACCCTGGGCGTTCAGGCCGGTTATGTCCACAGCGTCAGCGGCAGCAGCGCT
GAAGGTTATAACGGTCAGGCCACACTGAATATGACTTTCTGA

Protein sequence :
MLFCTCRYPLLWVPAFLFTLCKEKLMKRHLNTSYRLVWNHITGSLVVASELARSRGKRAGVAVALSLAAVTSVPALAADT
VVQAGETVNGGTLTNHDNQIILGTANGMTISTGLEYGPDNEANTGGQWIQNGGTANNTTVTGGGLQRVNAGGSVSDTVIS
AGGGQSLQGQAVNTTLNGGEQWVHEGGIATGTVINEKGWQAVKSGALATDTVVNTGAEGGPDAENGDTGQTVYGDAIRTT
INKNGRQIVAAEGTANTTVVYAGGDQTVHGHALDTTLNGGYQYVHNGGTATDTVVNSDGWQVVKDGAVAENTTVNQKGKL
QVNAGGTATNVTLKQGGALVTSTAATVTGSNRLGNFAVENGKADGVVLESGGRLDVLESHSAQNILVDDGGTLAVSVGGK
ATDVTMTSGGALIADSGATVEGTNASGKFSIDGISGQASGLLLENGGSFTVNAGGLASNTTVGHRGTLTLAAGGSLSGRT
QLSKGASMVLNGDVVSTGDIVNAGEIRFDNQTTPDAALSRAVAKSNSPVTFHKLTTSNLTGQGGTINMRVSLDGSNDSDQ
LVINGGQATGKTWLAFTNVGNSNLGVATTGQGIRVVDAQNGATTEEGAFALSRPLQAGAFNYTLNRDSDEDWYLRSENAY
RAEVPLYTSMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTE
VAGMSLTTGGYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWG
WLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSA
KHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSQNGTSLDLQAGLEARVRETITLGVQAGYVHSVSGSSA
EGYNGQATLNMTF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85197.1 antigen43 protein orthologue Not tested PAI V 536 Protein 0.0 95
unnamed AAL08472.1 putative autotransporter Not tested SRL Protein 0.0 95
Z1211 NP_286746.1 adhesin Not tested TAI Protein 0.0 90
Z1651 NP_287154.1 adhesin Not tested TAI Protein 0.0 90
S3195 NP_838479.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 64
aec67 AAW51750.1 Aec67 Not tested AGI-3 Protein 0.0 64
aidA ADD91708.1 AidA Not tested PAI-I AL862 Protein 0.0 60
flu CAI43838.1 antigen 43 Not tested LEE Protein 0.0 59
unnamed CAD66200.1 hypothetical protein Virulence PAI III 536 Protein 0.0 56
sap AAK00474.1 Sap Not tested SHI-1 Protein 0.0 56
SF2991 NP_708765.1 outer membrane fluffing protein Not tested SHI-1 Protein 0.0 56

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
i02_4885a YP_006152022.1 antigen 43 precursor (AG43) (Fluffing protein) VFG1063 Protein 0.0 95
i02_4885a YP_006152022.1 antigen 43 precursor (AG43) (Fluffing protein) VFG1675 Protein 0.0 56
i02_4885a YP_006152022.1 antigen 43 precursor (AG43) (Fluffing protein) VFG0655 Protein 0.0 56