Gene Information

Name : BURPS1106A_A2831 (BURPS1106A_A2831)
Accession : YP_001076860.1
Strain :
Genome accession: NC_009078
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3519
EC number : -
Position : 2774607 - 2776478 bp
Length : 1872 bp
Strand : -
Note : identified by match to protein family HMM PF05947

DNA sequence :
ATGGATACGCGCCTGCTCGACTACTACAACCGCGAGCTCGCGTATCTGCGCGAGTTGGGCGGCGAGTTCGCGCAGCAGTT
TCCGAAAGTGGCCGCGCGCCTGCGGATGCACGAATCGGGGCCGCCCGATCCGTACGTCGAGCGGCTGCTCGAAGGCTTCA
GCTTTCTCACCGCGCGCGTGCAACTGAAGATGGACGCGGAGTTTCCGCGCTTCACGCAGGCGCTGCTCGACGCGGTGTAT
CCGGGTTACGTCGCGCCGCTTCCGTCGATGGCGATCGTGCAGTTCACGCCGATGATGAACGAAGGCAGCCTCGCGCAGGG
CTACCGGCTGCCGGCGGGCACCGCGCTGCGCGCGCGGCCCGCCGCGGCCGAACAGACCGCGTGCGAGTTTCGCACCGCGC
ACGATCTGACGCTGTGGCCGCTGGAGCTCGCGGGCGCTTCGGTGACGGGCGCGCCCGCGTATCTGCCGCGTTCGGCGACG
GCCGCGCGCCGCGACGTGCGCGGCGCGCTGCGCATCCGGCTGAAGGCGCGCGGCGGCGCGGGCCTCGCGCAACTGCCGAT
CGATCGGCTGATGTTCCACCTGGCGGGCCCCGAGCGCGACGCGCTGCATCTGCTCGAACTGATCGCCGGGCATACGATCG
GCGTCGTCTGCCACGACGCGGCGCAGCCGCCGCGCTGGCTGCACGCGCTTGGCGCGCACGCGCTCGCGCATCAGGGCTTC
GACGCCGATCAGGCGCTGCTGCCCGACGAAGGCCGCAGCTTCCACGGTTACCGGCTGCTGCGCGAGTACTTCGCGTTTCC
CGCGCGCTTCCTGTTCTTCAGCATCGAAGGATTGCGGCCCGCGCTCGCGCGCGCGACGGGCGACACGTTCGAGCTGACGC
TGCTGCTCGATCGGCACGACGCGGCGCTCGAGAACAGCGTCGATGCGCGGCACCTCGCGTTGAACTGCACGCCGGCCGTC
AACCTGTTCGCGCGGCGCGCGGACCGCATTCCGGTCCATCCGGGCGCGCGCGAGCATCATGTCGTCGTCGATCGCAGCCG
GCCGCTCGACTACGAGGTCTACGCGGTGCGGCGGCTCGCGGGCGAGCAGCGCGACGACGGGCAGATGCGCGCGTTCCGGC
CGTTCCATGCGTCGTTCGCGGGCGACGGCGGCAATTACGGCGCGTACTACACGGTGCGCCGCGAGCCGCGCCTCGTGTCC
GCGCAGGCGCGCGCGAACGGCACGCGCACCGGCTACGTCGGCAGCGAGACGTTCGTGTCGCTCGTCGATAGCGCGTGCGC
GCCGTATGACGAATCGATCCGCTATCTGTCCGTCGACACGCTGTGCACGAACCGCGATCTCGTCCTGCTGTTGCCGGCGG
GCGACGCGAACGCGTTCACGCTGCGCGTGTCGGCGCCCGTCGAGCGGATCGCCATGATCCGCGGGCCGTCGCGGCCGCGC
CCGCCGCTCGCCGACGCGCAGAGCGCGTGGCGGCTCGTGAGCCATCTCGGGCTCGCGCGCCACACGCTGACCGATGTCGA
CGACGAAGAAGGCGCGCGCGTGCTGCGCGAATTGCTCGGCCTGCACGCGGACCCGGCCGATGCGGCGATGCGCCGGCAGA
TCGACGGCGTGCATCGTGTCGCGTTCGCGCCGGTGTTTCGCCGGCTGCCCGCCGCCGGGCCGCTGATGTTCGGGCGCGGC
GTGCAGGTGGACGTGACCGTCGACGATCATGCGTTCTCCGGCGACAGCCCCTATTTGCTCGGCGCGGTGCTCGAGCAGTT
TTTCGCGCGGCACGTGTCGATCAACTCGTTCGCCGAATGCGTGCTGAGCAGCGCGCAGCGCGGCAGGCTCGCGCAATGGC
CGGCGCGCGTCGGCAGGCGGCCCGCGATATGA

Protein sequence :
MDTRLLDYYNRELAYLRELGGEFAQQFPKVAARLRMHESGPPDPYVERLLEGFSFLTARVQLKMDAEFPRFTQALLDAVY
PGYVAPLPSMAIVQFTPMMNEGSLAQGYRLPAGTALRARPAAAEQTACEFRTAHDLTLWPLELAGASVTGAPAYLPRSAT
AARRDVRGALRIRLKARGGAGLAQLPIDRLMFHLAGPERDALHLLELIAGHTIGVVCHDAAQPPRWLHALGAHALAHQGF
DADQALLPDEGRSFHGYRLLREYFAFPARFLFFSIEGLRPALARATGDTFELTLLLDRHDAALENSVDARHLALNCTPAV
NLFARRADRIPVHPGAREHHVVVDRSRPLDYEVYAVRRLAGEQRDDGQMRAFRPFHASFAGDGGNYGAYYTVRREPRLVS
AQARANGTRTGYVGSETFVSLVDSACAPYDESIRYLSVDTLCTNRDLVLLLPAGDANAFTLRVSAPVERIAMIRGPSRPR
PPLADAQSAWRLVSHLGLARHTLTDVDDEEGARVLRELLGLHADPADAAMRRQIDGVHRVAFAPVFRRLPAAGPLMFGRG
VQVDVTVDDHAFSGDSPYLLGAVLEQFFARHVSINSFAECVLSSAQRGRLAQWPARVGRRPAI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0289 NP_454871.1 hypothetical protein Not tested SPI-6 Protein 2e-123 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BURPS1106A_A2831 YP_001076860.1 hypothetical protein VFG2074 Protein 3e-135 48