Gene Information

Name : ECBD_1322 (ECBD_1322)
Accession : YP_003035570.1
Strain : Escherichia coli Escherichia coli BL21-Gold(DE3)pLysS AG
Genome accession: NC_012947
Putative virulence/resistance : Virulence
Product : fimbrial biogenesis outer membrane usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 1397578 - 1400223 bp
Length : 2646 bp
Strand : +
Note : PFAM: fimbrial biogenesis outer membrane usher protein; KEGG: ssn:SSON_2394 PapC-like porin protein

DNA sequence :
ATGCCTGACCATTCTCTTTTTCGATTACGGATACTTCCGTGGTGCATTGCGCTGGCAATGTCAGGGAGTTATAGCAGTGT
CTGGGCTGAAGACGACATTCAGTTTGATTCCCGTTTTCTGGAATTAAAAGGCGACACGAAAATTGATCTGAAGCGTTTTT
CCAGTCAGGGATATGTTGAGCCCGGAAAATACAATTTACAGGTTCAACTAAATAAACAGCCATTGGCGGAAGAGCACGAT
ATTTACTGGTACGCTGGTGAAGATGACGCGAGCAAAACTTATGCTTGTCTGACACCGGAACTGGTGGCGCAGTTTGGTTT
AAAAGAAGATGTGGCGAAAAATCTGCAATGGAGCCACGATGGTAAATGCCTGAAACCCGGTCAACTGGAAGGCATGGAAA
TTAAGGCTGATTTAAGCCAGTCCGCATTAGTCATTTCATTACCGCAGGCTTACCTCGAATATACCTGGCCCGACTGGGAT
CCGCCTTCTCGTTGGGATGATGGCATCTCCGGGATCATCGCGGACTACAGCATCACTGCGCAAACACGACACGAAGAAAA
TGGCGGTGATGACTCTAACGAGATCAGCGGCAACGGGACGGTCGGGGTTAACCTGGGGCCGTGGCGTGTGCGTGCCGACT
GGCAGACCGACTATCAACATACCCGCAGTAATGATGATGACGATGAATTTAGCGGCGATGACACACAAAAAAAATGGGAG
TGGAGTCGCTACTATGCCTGGCGGGCGTTACCGTCATTGAAAGCCAAACTGGCGCTGGGCGAAGATTACCTCAATTCCGA
TATTTTCGACGGTTTTAACTATGTTGGCGGCAGTGTCAGTACTGACGATCAAATGTTGCCTCCCAACCTGCGTGGCTACG
CGCCAGACATTTCCGGCGTGGCGCACACCACAGCAAAAGTGACCGTCAGCCAGATGGGGCGTGTGATTTACGAAACGCAG
GTTCCGGCCGGGCCGTTTCGTATTCAGGATCTTGGTGATTCCATCTCCGGTACGTTGCATGTTCGCATTGAAGAACAGAA
CGGCCAGGTGCAGGAATATGACATCAGCACCGCCTCGATGCCATACCTTACTCGCCCAGGCCAGGTTCGTTATAAAGTCA
TGATGGGACGTCCGCAAGAGTGGGGCCACCATGTCGAGGGGGGATTTTTCTCTGGTGCTGAAGCCTCCTGGGGGATCGCT
AACGGTTGGTCGCTATATGGCGGCGCGCTGGGAGATAAAAACTATCAGTCTGCGGCACTTGGCATCGGTCGCGATTTGTC
TACGTTCGGCGCGGTTGCGTTTGATGTTACCCACTCGCATACCAAACTGGATAAAGACACCGCTTATGGCAAAGGTTCGC
TGGACGGTAACTCCTTCCGTGTGAGTTATTCCAAAGACTTTGACCAGCTCAACAGTCGCGTCACCTTCGCTGGATATCGC
TTCTCGGAAGAGAACTTTATGACCATGAGCGAGTACCTGGATGCCAGTGACAGCGAAATGGTCCGCACGGGCAACGACAA
AGAGATGTACACCGCCACGTATAACCAGAACTTCCGCGATGCGGGTGTTTCGGTTTATCTCAACTATACCCGCCATACTT
ACTGGGATCGCGAGGAGCAGACAAACTACAACATCATGCTCTCCCACTATTTCAATATGGGTAGCATTCGCAATATGAGC
GTTTCCCTGACTGGCTACCGCTACGAGTATGACAACCGGGCGGATAAGGGCATGTACATTTCGCTCAGTATGCCGTGGGG
CGATAACAGCACCGTTAGCTATAACGGCAACTATGGAAGTGGGACGGACAGCAGTCAGGTCGGTTATTTCAGCCGTGTCG
ATGACGCGACTCACTATCAGTTGAACGTCGGCACCAGTGACAAACACACCAGCGTTGATGGCTACTACAGCCATGATGGT
TCGCTGGCGCAGGTTGACCTCAGCGCGAACTACCATGAAGGGCAATACACCTCTGCGGGCTTGTCGTTACAGGGTGGCGC
GACGCTTACTGCCCAAGGCGGCGCGCTTCACCGTACCCAGAATATGGGCGGGACACGCCTGTTGATTGATGCCGATGGTG
TTGCCGATGTTCCGGTGGAAGGTAACGGAGCTGCTGTTTATACCAATATGTTTGGTAAAGCCGTCGTTTCTGACGTCAAT
AACTATTACCGCAATCAGGCGTATATCGACCTCAACAAACTGCCAGAAAACGCCGAAGCAACCCAGTCGGTGGTGCAAGC
CACGCTAACTGAAGGAGCCATTGGCTACCGCAAATTTGCTGTCATCAGTGGTCAAAAAGCGATGGCTGTGCTGCGTTTAC
AAGATGGCAGCCATCCACCGTTTGGCGCAGAAGTGAAAAATGATAACCAGCAGACAGTGGGCCTCGTCGATGATGACGGC
AATGTTTATCTGGCTGGGGTGAAACCAGGCGAACATATGAGCGTGTTCTGGAGTGGTGTTGCACATTGCGATATCAACTT
GCCGGACCCGTTACCGGCCGATCTGTTTAACGGCTTGTTACTACCATGCCAGCATAAAGGCAACGTCGCGCCGGTTGTTC
CTGATGACATAAAGCCTGTCATTCAGGAGCAGACGCAGCAGGTGACACCTGGCAATCCCCCTGTTTCTATTTCAGCTAAC
CAATAA

Protein sequence :
MPDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVEPGKYNLQVQLNKQPLAEEHD
IYWYAGEDDASKTYACLTPELVAQFGLKEDVAKNLQWSHDGKCLKPGQLEGMEIKADLSQSALVISLPQAYLEYTWPDWD
PPSRWDDGISGIIADYSITAQTRHEENGGDDSNEISGNGTVGVNLGPWRVRADWQTDYQHTRSNDDDDEFSGDDTQKKWE
WSRYYAWRALPSLKAKLALGEDYLNSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVAHTTAKVTVSQMGRVIYETQ
VPAGPFRIQDLGDSISGTLHVRIEEQNGQVQEYDISTASMPYLTRPGQVRYKVMMGRPQEWGHHVEGGFFSGAEASWGIA
NGWSLYGGALGDKNYQSAALGIGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYR
FSEENFMTMSEYLDASDSEMVRTGNDKEMYTATYNQNFRDAGVSVYLNYTRHTYWDREEQTNYNIMLSHYFNMGSIRNMS
VSLTGYRYEYDNRADKGMYISLSMPWGDNSTVSYNGNYGSGTDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDG
SLAQVDLSANYHEGQYTSAGLSLQGGATLTAQGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVN
NYYRNQAYIDLNKLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLQDGSHPPFGAEVKNDNQQTVGLVDDDG
NVYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLLPCQHKGNVAPVVPDDIKPVIQEQTQQVTPGNPPVSISAN
Q

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 7e-146 44
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 1e-144 44
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 8e-145 44
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 1e-145 44
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 9e-146 44
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 9e-146 44
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 1e-150 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECBD_1322 YP_003035570.1 fimbrial biogenesis outer membrane usher protein VFG1548 Protein 4e-146 44
ECBD_1322 YP_003035570.1 fimbrial biogenesis outer membrane usher protein VFG0884 Protein 5e-146 44
ECBD_1322 YP_003035570.1 fimbrial biogenesis outer membrane usher protein VFG0895 Protein 4e-146 44