Gene Information

Name : ECUMN_2677 (ECUMN_2677)
Accession : YP_002413386.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2751789 - 2754440 bp
Length : 2652 bp
Strand : -
Note : Evidence 4 : Homologs of previously reported genes of unknown function

DNA sequence :
ATGGATACGCCGATGCCAGAACATTCTTTTCTCCGTTTACGGGGGCTTTCGTGGTGGATTGCTCTGGCAATTTCTGGAAG
TCCATTTAATGCCCTGGCAGACGATACCATTCAGTTTGATGGCCGATTTCTTGATTTAAAAGGCAATACCAAAATTGATT
TGGGCCGTTTTTCGCAAAAAGGCTATGTAGAACCGGGAAAATATAACTTACGCGTTCATGTAAATAATCAGCCGTTACCT
GACGACTACGATATCTACTGGTACGCAACTGAAAACGATCCGAATAAATCATATGCCTGCTTATCGCCAGAGCTGGTTGC
CCAGTTTGGTCTGAAGGAAGATATTGCGAAGAATTTGCAATGGATCCGCGATGGTCAGTGTCTGAATACGGCACTTTTAG
CAGGTACAGAAATTAGCGGCGATTTAGGCCAGTCTGCACTCCTTGTTTCTGTACCGCAGGCTTATCTGGAATATACCGAC
AGCGAGTGGGACCCGCCTTCGCGCTGGGATGACGGTATCCCTGGGCTTATTGCCGATTACAGTATCAACGCGCAAACCCG
ACATGAAAATGGGGGTGATGACACTAACGACATCAGCGGTAATGGTACGGTCGGCGTTAACGTTGGGCCGTGGCGCTTAC
GAGCTGACTGGCAGAGCGATTATCAGCATACCCGCAGTAACGATGACGGCGACACTGACGATAGCGGTACGCAAAAAAAC
TGGGAGTGGAGTCGTTACTACGCCTGGCGCGCCTTGCCGTCATTAAAAGCCAAACTTTCGCTGGGCGAAGATTATCTCAA
CTCCGATATTTTTGATGGCTTCAGTTACATTGGCGGCAGCATCAGTACCGATGATCAAATGTTACCGCCAAACCTGCGCG
GCTACGCCCCCGATATTTCAGGTGTTGCACACACTACCGCTAAAGTGACTGTAACCCAAATGGGCCGTGTCATTTACGAA
ACCCAGGTGCCCGCAGGGCCATTCCGCATTCAGGATATTGGCGATTCCGTCTCTGGCACGCTGCATGTCCGCATTGAAGA
ACAAAACGGCCAGGTACAAGAATACGACGTCAGCACGGCGTCCATGCCGTTCCTGACTCGCCCCGGCCAGGTGCGTTATA
AAGTGACAATGGGGCGTCCACAAAACTGGGACCATCAGGTTGCAGGTAGCTTCTTCTCGGGCGGCGAGGCATCGTGGGGG
ATTGCCAATGGCTGGTCGCTCTACGGCGGTGCGTTAGCCGATGAAAACTACCAGTCGGCGGCGCTGGGCCTCGGTCGTGA
CCTGGCGCTGCTAGGCGCGTTAGCGTTTGATGTCACTCACTCTCGTGTTCAACTCGACGATAACAGCGTGTATGGCAACA
AAACGCTGGACGGCAACTCTTATCGCGTCAGCTATGCCAAAGATTTTGATGAACTCAACAGTCGGGTGACGTTTGCTGGA
TACCGCTTCTCTGAAAAAAACTACATGACCATGAGCGAGTATCTGGATGCGAACGACGACGACCGGGCGCGCACCGGTAA
CGACAAAGAGATGTATACGGTCACCTATAACCAGAACTTCACGGATGCGCGCGTTTCGGTCTATCTCAACTATTCCCACC
ATACCTACTGGGATCGCCAGGATCAGACCAACTACAACATGATGCTTTCTCATTATTTCAATTTAGGAAGCCTTCGCAAC
TTGAGCGTTTCTCTGACCGGTTATCGCTACGAATACGATAAAAGTGCTGACAAGGGGGTTTATCTGTCCTTAAGTTTGCC
GTGGGGCGACAACAGCACCATCAGTTACAACGGCAACTATGGCAGCGGTGCCGACAGCAATCAGGTGAGCCTGTACCACC
GCATTGATGACGCCAGCCACTACACGGTAAGCGCCGGAACCAGCGAAAACCACAGCAGCGTGGATGGCTACTACAGCCAC
GATGGTACTCTGGCAAAAGTTGATCTCAGTGCCAACTACCATGAAGGGCAATACACCTCAGCAGGGATCTCTTTACAGGG
CGGGGCAACGTTGACTGCTCACGGCGGGGCGCTGCATCGTACACAAAACATGGGCGGTACACGTCTGTTAATCGACGCTG
ATGGCGTTGCTAATGTGCCGGTTGAAGGTAATGGTTCCGCCGTTTATACCAACATGTTCGGCAAAGCGGTGGTGGCAGAT
GTTAATGATTACTACCGCAACCAGGCTTATATCGATTTGAACAAACTGCCGGAAAACGCCGAAGCCACGAAATCAGTGGT
TCAGGCAACGCTGACGGAAGGCGCTATTGGCTACCGTAAATTTGCGGTGATCAGTGGTGAGAAGGCGATGGCAGTGCTGC
GCTTGCAGGATGGCAGCCATCCGCCATTTGGGGCGGAAGTGAAAAACGATAATCAACAACAAGTTGGATTGGTTGATGAC
GAAGGTAATGTTTATCTGGCAGGCGTTAAACCGGGTGAACATATGACCGTTTTCTGGGAGGGTGAATCTCACTGCGATAT
CAGCCTGCCAGACCCGTTACCGAATGACCTGTTCAATGGCTTGTTGTTGCCATGCCAGCAAAAGGGGGGAAGTTCTCCCG
TCATACCGCATGATATTCAACCCGTCATTCAGGAGCAGACACAACAGGTGACACCAATGGAACCGCCGATGTCTGTTTCA
TCGAACCAATAA

Protein sequence :
MDTPMPEHSFLRLRGLSWWIALAISGSPFNALADDTIQFDGRFLDLKGNTKIDLGRFSQKGYVEPGKYNLRVHVNNQPLP
DDYDIYWYATENDPNKSYACLSPELVAQFGLKEDIAKNLQWIRDGQCLNTALLAGTEISGDLGQSALLVSVPQAYLEYTD
SEWDPPSRWDDGIPGLIADYSINAQTRHENGGDDTNDISGNGTVGVNVGPWRLRADWQSDYQHTRSNDDGDTDDSGTQKN
WEWSRYYAWRALPSLKAKLSLGEDYLNSDIFDGFSYIGGSISTDDQMLPPNLRGYAPDISGVAHTTAKVTVTQMGRVIYE
TQVPAGPFRIQDIGDSVSGTLHVRIEEQNGQVQEYDVSTASMPFLTRPGQVRYKVTMGRPQNWDHQVAGSFFSGGEASWG
IANGWSLYGGALADENYQSAALGLGRDLALLGALAFDVTHSRVQLDDNSVYGNKTLDGNSYRVSYAKDFDELNSRVTFAG
YRFSEKNYMTMSEYLDANDDDRARTGNDKEMYTVTYNQNFTDARVSVYLNYSHHTYWDRQDQTNYNMMLSHYFNLGSLRN
LSVSLTGYRYEYDKSADKGVYLSLSLPWGDNSTISYNGNYGSGADSNQVSLYHRIDDASHYTVSAGTSENHSSVDGYYSH
DGTLAKVDLSANYHEGQYTSAGISLQGGATLTAHGGALHRTQNMGGTRLLIDADGVANVPVEGNGSAVYTNMFGKAVVAD
VNDYYRNQAYIDLNKLPENAEATKSVVQATLTEGAIGYRKFAVISGEKAMAVLRLQDGSHPPFGAEVKNDNQQQVGLVDD
EGNVYLAGVKPGEHMTVFWEGESHCDISLPDPLPNDLFNGLLLPCQQKGGSSPVIPHDIQPVIQEQTQQVTPMEPPMSVS
SNQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 4e-150 45
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 5e-150 45
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 5e-150 45
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 4e-150 45
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 2e-148 44
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 2e-148 44
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 1e-153 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECUMN_2677 YP_002413386.1 hypothetical protein VFG1548 Protein 3e-150 45
ECUMN_2677 YP_002413386.1 hypothetical protein VFG0884 Protein 2e-150 45
ECUMN_2677 YP_002413386.1 hypothetical protein VFG0895 Protein 2e-150 45