Gene Information

Name : esp (EFAU085_02821)
Accession : YP_008396341.1
Strain : Enterococcus faecium AUS0085
Genome accession: NC_021994
Putative virulence/resistance : Virulence
Product : Enterococcal surface protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2865948 - 2871146 bp
Length : 5199 bp
Strand : -
Note : -

DNA sequence :
ATGGTTAGCAAGAATAATAAGAGAGTATTTCTAGAAAAAACAAAGAAACGAGTATTGAAGTACAGTATCAAAAAATTAAG
TGTTGGGGTTGCATCTGTTTTGGTTGGTGTTGGTCTTGTCTTGGGAACAACTGAACTAGTTCAAGCACAAGATGAAATTT
CTCCAAGCACCCCACTTGAAACAGCGATAAGCTCCGTACAAGTAGGTGACAAAGTAGCTTCGGGGAATACTTTTCAGGAG
AATCCTGGTTATACGAAAAATTATAACTTTTCGGATTTACAATTTAGTCCTCAAGAACTAACTGGAGACACACTAAAAGG
CAATACAATTGGATTTGAAGTTTACGGAAAACATAATATTGCAGCATCAACTAAAAACTGGGAAATCCGTCTTCAATTAG
ATGAACGATTAGCTAAATATGTAGAAAAAATTCAGGTCGATCCAAAGAAAGGAATTGGATCTTCTAGACGGACCTTTGTA
AGAATTAATGATTCGCTGGGTAGACCTACAAATATTTGGAAGGTCAATTATATTCGTGCAAGTGATGGATTATTTGCTGG
GGCTGAAACAACTGATACACAAACTGCTCCTAATGGGGTAATTACGTTTGAAAAGAGCTTAGATGAAATTTTTAAAGAGA
TAGGCATAGACAATCTCAAGACCGATCGTTTAATGTATCGAATCTACTTGGTAAGTCATCAAGACGATGATAAAATTGTA
CCAGGTATTGATAGTACGGGCTACTTTTTAACAGATTCAGATGATTTCTATAATAGCTTAGATGTATCTGAAAATAATCC
TGACCAGTTTAAACATGGATCAGTAAGTGCAAAATATGAGGAACCTAATACTCAAACAAAAGATGGTTCAGGATCCACAG
GTGCTAATGGAGCTATAATCTTAGATCATAAGTTAACGAAAAATTATAATTTCTCATATTCAGCTTCCGCAAAGGGCACG
CCTTGGTATGCTAATTACAAAATTGATGAACGATTAGTTCCATATGTAGCGGGTATACAGATGCACATGGTTCAGGCTGA
CAAAGTAACATATGATGTTTCTTTTGAATCAGGGAAAAAAGTGGCTGATTTAGCAATTGAACGGCGCAAAGATCATGAAA
ATTATGGTGTAGGTTCAATTACTGATAATGATCTTACCAAACTTATTGATTTTGCTAATGCAAGTCCACGTCCAGTCGTT
ATTCGATATGTTTTACAATTGACTAAGCCATTAGATGAAATCTTAGAAGATATGAAAGCAACAGCACAAGTTGAAGAAAA
TAAACCATTTGGTGAAGATTTCATCTTTGATTCTTGGTTGTCGGATACGAATAAAAAATTAATCCAGAACACTTATGGAA
CAGGTTATTATTATTTGCAAGATATTGATGGTGATGGAAACCCTGACGATAAAGAAGAGAGCGGAGACACGAATCCATAT
ATCGGGAAACCTGAATTAGAAGAAGTATATGATGTTGACACAACAGTTAAGGGGAAAGTATTCATCCACGAGTTAGCGGG
AACAGGTCACAAAGCCCAACTTGTTGATAAAGAAGGTACTGTATTAGCAGAAAAAACTATCGCTCCAAATGAAAAAGATG
GGGCTCCAATTTCAGATACTGTAGAATTTGAATTTACGGGTGTAGATTCAAGTAAATTAATCGCGAAAGATGAATTAAAA
ATCCAAATCGTTTCTCCAGGTTTTGATAAACCAGAAGAAGGTTCAACCGTTATTAAGGAATCACCAAAAGCGGTTGATAA
ACAAACCGTGGTAGTTGGATTTAAACCAGATGCTAAAGAATCAATTCGGAATAATAAAAACTTACCTGAAGATGCAGAGT
ATTCATGGAAAACAGAGCCTGATACTTCTAACGTTACTGATAGTACGAAAGGTATTGTAACTGTTAAGATCGGAAATCGA
ACTTTCGACGTGGATGTAGAGTTTGCTGTAAAAGCTTCTCAAGCTATGGAAAATGATGCAACATACGTACCTATAACAAC
AACTCCAGAAACGACAGTTCAAAGTGGAAAACCTACATTTGATAAACCAGATGTTCCTTTAGCTAAAGATGCCTTTTCAA
TTTTAGATGTCTATAATAAGGACTTCGGTAATGCAAGTGTTGATGCAAATACTGGTATTGTTACATTCACTCCAGCTAAA
GGTGTAGGAGAATCGGAGCCGATTACTGGAACAATTCCTATTAAAATTGTTTACCAAGATGGTTCTGTAGGCACGACCGA
TTTAGCAGTAACTGTAAGTAAAGATATTTATGAAAATCCAGGAGAAAACATTCCTGCAGGCTACCACAAAGTGACCTTCA
CCGCAGGGGAAGGAACAAGTATTGAAAGTGGAACAACGGTCTTTGCAGTGAAAGATGGCGTAAGCTTACCAGAAGATAAA
CTTCCAGTGTTGAAAGCAAAAGATGGTTATACAGATGCGAAATGGCCAGAAGAAGCAACTCAACCAATTACAGCAGATGA
TACAGAATTTGTATCAAGTGCAACAAAACTGGATGATATCATTGAAAATCCAGGAGAAAACATTCCTGCAGGCTACCACA
AAGTAATCTTCACCACAGGGGAAGGAACAAGTATTGAAAGTGGAACAACGGTCTTTGCAGTGAAAGATGGCGTAAGCTTA
CCAGAAGATAAACTTCCAGTGTTGAAAGCAAAAGATGGTTATACAGATGCGAAATGGCCAGAAGAAGCAACTCAACCAAT
TACAGCAGATGATACAGAATTTGTATCAAGTGCAACAAAACTGGATGATATCATTGAAAATCCAGGAGAAAACATTCCTG
CAGGCTACCACAAAGTAATCTTCACCGCAGGGGAAGGAACAAGTATTGAAAGTGGAACAACGGTCTTTGCAGTGAAAGAC
GGCGTAAGCTTACCAGAAGATAAACTTCCAGTGTTGAAAGCAAAAGATGGTTATACAGATGCGAAATGGCCAGAAGAAGC
AACTCAACCAATTACAGCAGATGATACAGAATTTGTATCAAGTGCAACAAAACTGGATGATAAATCTGACGCTGACAAAT
ATAATCCTGAAGGTCAAAAAGTGACTACAGAATTGAATAAGGAACCAGACGCATCTGAGGGAATTAAGAGCAAAGAAGAT
TTACCAAAAGATACTAAGTATACTTGGAAAGAAAAAGTAGATGTTAGCGCAGCTGGAAATAAAAAAGGGACTGTTGTAGT
GACATATCCAGATGGATCATCTGATGAAGTTGAAGTAGATGTTACAGTGACAGACAATCGTTCTGACGCCGATAAATATG
AGCCAACAGTAGAAGGCGAAAAAGTAGAAGTTGGAGGCACAGTAGATCTAACAGATAATGTTACTAACTTGCCAACATTA
CCAGAAGGAACAACAGTAACAGATGTCACTCCTGATGGTACGATTGATACCAACACACCAGGTAATTACGAAGGGGTTAT
CGAAGTAACGTATCCAGATGGTACGAAAGATACAGTGAAAGTTCCAGTAGAAGTGACAGACAATCGTTCTGATGCCGATA
AATATACACCAATGGTAGAAGGTGAAAAAGTAGAAGTTGGAGGCACAGTAGATCTAACAGATAATGTTACTAACTTGCCA
ACATTACCAGAAGGAACAACAGTAACAGATGTCACTCCTGATGGTACGATTGATACCAACACACCAGGTAATTACGAAGG
GGTTATCGAAGTAACGTATCCAGATGGTACGAAAGATACAGTGAAAGTTCCAGTAGAAGTGACAGACAATCGTTCTGATG
CCGATAAATATACACCAATGGTAGAAGGTGAAAAAGTAGAAGTTGGAGGCACAGTAGATCTAACAGATAATGTTACTAAC
TTGCCAACATTACCAGAAGGAACAACAGTAACAGATGTCACTCCTGGTGGTACGATTGATACCAACACACCAGGTAATTA
CGAAGGGGTTATCGAAGTAACGTATCCAGATGGTACGAAAGATACAGTGAAAGTTCCAGTAGAAGTGACAGACAATCGTT
CCGATGCCGATAAATATGAACCAACAGTAGAAGGCGAAAAAGTAGAAGTTGGAGGCACAGTAGATCTAACAGATAATGTT
ACTAACTTGCCAACATTACCAGAAGGAACAACAGTAACAGATGTCACTCCTGGTGGTACGATTGATACCAACACACCAGG
TAATTACGAAGGGGTTATCGAAGTAACGTATCCAGATGGTACGAAAGATACAGTGAAAGTTCCAGTAGAAGTGACAGACA
ATCGTTCTGATGCCGATAAATATACACCAATGGTAGAAGGTGAAAAAGTAGAAGTTGGAGGCACAGTAGATCTAACAGAT
AATGTTACTAACTTGCCAACATTACCAGAAGGAACAACAGTAACAGATGTCACTCCTGATGGTACGATTGATACCAACAC
ACCAGGTAATTACGAAGGGGTTATCGAAGTAACGTATCCAGATGGTACGAAAGATACAGTGAAAGTTCCAGTAGAAGTGA
CAGACAATCGTTCCGATGCCGATAAATACAATCCTGAAGGACAAAAAGTGACTACAGACTTAAATAAAGAGCCAGACGCA
TCTGAGGGAATTAAGAACAAAGAAGATCTACCAAAAGGAACTACTTATACTTGGAAAGAGAAAGTAGATGTTAGTACAGC
TGGAAACAAAAAAGGGATTGTAGTAGTAACATATCCAGATGGATCTAAAGAAGAAGTAGAGGTTACTATTTCTGTAGAAG
ATAAAAAAGCACCGAATAAACCTCAAGTTGATCCTATTACAGAGGGTGACCAAATTGTTACTGGTAAAACTGAACCAAAT
GCAGAGGTGACAGTTACATTACCAAACGGGAGTCAATACCATGGGACAGCTGATAAAAATGGTAATTTTACAGTTAAAGT
TCCTAAATTAGAGGCAGGAACAAAAGTGATAGTAACTGCAACTGATGAATCTGGTAATACTAGTGAACCTACTAATGTAG
TTGTTTCTTCAAACGAAAAAGACAGTGAAAAAGCCGTTAGTAAAGATAATAAGACTGACAATCAAGGTAGCAAGCAAAAC
AAAAATAGAGGCAAAAGCAGTCCTCAAAAGCAGTCCTCAAAAGCTTATCCTAAGACGGGAGAGATTGATAGTAATATCTT
TACTATTAGTGGTGGTTTAATCCTGTTAGGAACTTTAGGGTTATTAGGGTATAAAAATCGAAAAAAAGAGAATGAATAG

Protein sequence :
MVSKNNKRVFLEKTKKRVLKYSIKKLSVGVASVLVGVGLVLGTTELVQAQDEISPSTPLETAISSVQVGDKVASGNTFQE
NPGYTKNYNFSDLQFSPQELTGDTLKGNTIGFEVYGKHNIAASTKNWEIRLQLDERLAKYVEKIQVDPKKGIGSSRRTFV
RINDSLGRPTNIWKVNYIRASDGLFAGAETTDTQTAPNGVITFEKSLDEIFKEIGIDNLKTDRLMYRIYLVSHQDDDKIV
PGIDSTGYFLTDSDDFYNSLDVSENNPDQFKHGSVSAKYEEPNTQTKDGSGSTGANGAIILDHKLTKNYNFSYSASAKGT
PWYANYKIDERLVPYVAGIQMHMVQADKVTYDVSFESGKKVADLAIERRKDHENYGVGSITDNDLTKLIDFANASPRPVV
IRYVLQLTKPLDEILEDMKATAQVEENKPFGEDFIFDSWLSDTNKKLIQNTYGTGYYYLQDIDGDGNPDDKEESGDTNPY
IGKPELEEVYDVDTTVKGKVFIHELAGTGHKAQLVDKEGTVLAEKTIAPNEKDGAPISDTVEFEFTGVDSSKLIAKDELK
IQIVSPGFDKPEEGSTVIKESPKAVDKQTVVVGFKPDAKESIRNNKNLPEDAEYSWKTEPDTSNVTDSTKGIVTVKIGNR
TFDVDVEFAVKASQAMENDATYVPITTTPETTVQSGKPTFDKPDVPLAKDAFSILDVYNKDFGNASVDANTGIVTFTPAK
GVGESEPITGTIPIKIVYQDGSVGTTDLAVTVSKDIYENPGENIPAGYHKVTFTAGEGTSIESGTTVFAVKDGVSLPEDK
LPVLKAKDGYTDAKWPEEATQPITADDTEFVSSATKLDDIIENPGENIPAGYHKVIFTTGEGTSIESGTTVFAVKDGVSL
PEDKLPVLKAKDGYTDAKWPEEATQPITADDTEFVSSATKLDDIIENPGENIPAGYHKVIFTAGEGTSIESGTTVFAVKD
GVSLPEDKLPVLKAKDGYTDAKWPEEATQPITADDTEFVSSATKLDDKSDADKYNPEGQKVTTELNKEPDASEGIKSKED
LPKDTKYTWKEKVDVSAAGNKKGTVVVTYPDGSSDEVEVDVTVTDNRSDADKYEPTVEGEKVEVGGTVDLTDNVTNLPTL
PEGTTVTDVTPDGTIDTNTPGNYEGVIEVTYPDGTKDTVKVPVEVTDNRSDADKYTPMVEGEKVEVGGTVDLTDNVTNLP
TLPEGTTVTDVTPDGTIDTNTPGNYEGVIEVTYPDGTKDTVKVPVEVTDNRSDADKYTPMVEGEKVEVGGTVDLTDNVTN
LPTLPEGTTVTDVTPGGTIDTNTPGNYEGVIEVTYPDGTKDTVKVPVEVTDNRSDADKYEPTVEGEKVEVGGTVDLTDNV
TNLPTLPEGTTVTDVTPGGTIDTNTPGNYEGVIEVTYPDGTKDTVKVPVEVTDNRSDADKYTPMVEGEKVEVGGTVDLTD
NVTNLPTLPEGTTVTDVTPDGTIDTNTPGNYEGVIEVTYPDGTKDTVKVPVEVTDNRSDADKYNPEGQKVTTDLNKEPDA
SEGIKNKEDLPKGTTYTWKEKVDVSTAGNKKGIVVVTYPDGSKEEVEVTISVEDKKAPNKPQVDPITEGDQIVTGKTEPN
AEVTVTLPNGSQYHGTADKNGNFTVKVPKLEAGTKVIVTATDESGNTSEPTNVVVSSNEKDSEKAVSKDNKTDNQGSKQN
KNRGKSSPQKQSSKAYPKTGEIDSNIFTISGGLILLGTLGLLGYKNRKKENE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
esp AAM21183.1 surface protein Not tested Not named Protein 0.0 87
ef0056 AAM75261.1 EF0056 Virulence Not named Protein 0.0 87

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
esp YP_008396341.1 Enterococcal surface protein VFG2179 Protein 0.0 87