Gene Information

Name : BURPS668_A2956 (BURPS668_A2956)
Accession : YP_001063947.1
Strain :
Genome accession: NC_009075
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3519
EC number : -
Position : 2808897 - 2810768 bp
Length : 1872 bp
Strand : -
Note : identified by match to protein family HMM PF05947

DNA sequence :
ATGGATACGCGCCTGCTCGACTACTACAACCGCGAGCTCGCGTATCTGCGCGAGTTGGGCGGCGAGTTCGCGCAGCAGTT
TCCGAAAGTGGCCGCGCGCCTGCGGATGCACGAATCGGGGCCGCCCGATCCGTACGTCGAGCGGCTGCTCGAAGGCTTCA
GCTTTCTCACCGCGCGCGTGCAACTGAAGATGGACGCGGAGTTTCCGCGCTTCACGCAGGCGCTGCTCGACGCGGTGTAT
CCGGGTTACGTCGCGCCGCTTCCGTCGATGGCGATCGTGCAGTTCACGCCGATGATGAACGAAGGCAGCCTCGCGCAGGG
CTACCGGCTGCCGGCGGGCACCGCGCTGCGCGCGCGGCCCGCCGCGGCCGAACAGACCGCGTGCGAGTTTCGCACCGCGC
ACGATCTGACGCTGTGGCCGCTGGAGCTCGCGGGCGCTTCGGTGACGGGCGCGCCCGCGTATCTGCCGCGTTCGGCGACG
GCCGCGCGCCGCGACGTGCGCGGCGCGCTGCGCATCCGGCTGAAGGCGCGCGGCGGCGCGGGCCTCGCGCAACTGCCGAT
CGATCGGCTGATGTTCCACCTGGCGGGCCCCGAGCGCGACGCGCTGCATCTGCTCGAACTGATCGCCGGGCATACGATCG
GCGTCGTCTGCCACGACGCGGCGCAGCCGCCGCGCTGGCTGCACGCGCTCGGCGCGCACGCGCTCGCGCATCAGGGCTTC
GACGCCGATCAGGCGCTGCTGCCCGACGAAGGCCGCAGCTTCCACGGCTACCGGCTGCTGCGCGAGTACTTCGCGTTTCC
CGCGCGCTTCCTGTTCTTCAGCATCGAAGGATTGCGGCCCGCGCTCGCGCGCGCGACGGGCGACACGTTCGAGCTGACGC
TGCTGCTCGATCGGCACGACGCGGCGCTCGAGAACAGCGTCGATGGGCGGCGCCTCGCGTTGAACTGCACGCCGGCCGTC
AACCTGTTCGCGCGGCGCGCGGACCGCATTCCGGTCCATCCGGGCGCCCGCGAGCATCATGTCGTCGTCGATCGCAGCCG
GCCGCTCGACTACGAGGTCTACGCGGTGCGGCGGCTCGCGGGCGAGCAGCGCGACGACGGGCGGACGCGCGCGTTCCGGC
CGTTCCATGCGTCGTTCGCGGGCGACGGCGGCAATTACGGCGCGTACTACACGGTGCGCCGCGAGCCGCGCCTCGTGTCC
GCGCAGGCGCGCGCGAACGGCACGCGCACCGGCTACGTCGGCAGCGAGACGTTCGTGTCGCTCGTCGATAGCGCGTGCGC
GCCGTATGACGAATCGATCCGCTATCTGTCCGTCGACACGCTGTGCACGAACCGCGATCTCGTCCTGCTGTTGCCGGCGG
GCGACGCGAACGCGTTCACGCTGCGCGTGTCGGCGCCCGTCGAGCGGATCGCCATGATCCGCGGGCCGTCGCGGCCGCGC
CCGCCGCTCGCCGACGCGCAGAGCGCGTGGCGGCTCGTGAGCCATCTCGGGCTCGCGCGCCACACGCTGACCGATGTCGA
CGACGAAGAAGGCGCGCGCGTGCTGCGCGAATTGCTCGGTCTGCACGCGGACCCGGCCGATGCGGCGATGCGCCGGCAGA
TCGACGGCGTGCATCGTGTCGCGTTCGCGCCGGTGTTTCGCCGGCTGCCCGCCGCCGGGCCGCTGATGTTCGGGCGCGGC
GTGCAGGTGGACGTGACCGTCGACGATCATGCGTTTTCCGGCGACAGCCCCTATTTGCTCGGCGCGGTGCTCGAGCAGTT
TTTCGCGCGGCACGTGTCGATCAACTCGTTCGCCGAATGCGTGCTGAGCAGCGCGCAGCGCGGCAGGCTCGCGCAATGGC
CGGCGCGCGTCGGCAGGCGGCCCGCGATATGA

Protein sequence :
MDTRLLDYYNRELAYLRELGGEFAQQFPKVAARLRMHESGPPDPYVERLLEGFSFLTARVQLKMDAEFPRFTQALLDAVY
PGYVAPLPSMAIVQFTPMMNEGSLAQGYRLPAGTALRARPAAAEQTACEFRTAHDLTLWPLELAGASVTGAPAYLPRSAT
AARRDVRGALRIRLKARGGAGLAQLPIDRLMFHLAGPERDALHLLELIAGHTIGVVCHDAAQPPRWLHALGAHALAHQGF
DADQALLPDEGRSFHGYRLLREYFAFPARFLFFSIEGLRPALARATGDTFELTLLLDRHDAALENSVDGRRLALNCTPAV
NLFARRADRIPVHPGAREHHVVVDRSRPLDYEVYAVRRLAGEQRDDGRTRAFRPFHASFAGDGGNYGAYYTVRREPRLVS
AQARANGTRTGYVGSETFVSLVDSACAPYDESIRYLSVDTLCTNRDLVLLLPAGDANAFTLRVSAPVERIAMIRGPSRPR
PPLADAQSAWRLVSHLGLARHTLTDVDDEEGARVLRELLGLHADPADAAMRRQIDGVHRVAFAPVFRRLPAAGPLMFGRG
VQVDVTVDDHAFSGDSPYLLGAVLEQFFARHVSINSFAECVLSSAQRGRLAQWPARVGRRPAI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0289 NP_454871.1 hypothetical protein Not tested SPI-6 Protein 9e-124 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BURPS668_A2956 YP_001063947.1 hypothetical protein VFG2074 Protein 4e-135 48