Gene Information

Name : SBG_1083 (SBG_1083)
Accession : YP_004729964.1
Strain : Salmonella bongori NCTC 12419
Genome accession: NC_015761
Putative virulence/resistance : Virulence
Product : hemoglobin protease
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 1200390 - 1204544 bp
Length : 4155 bp
Strand : +
Note : -

DNA sequence :
ATGAATAAAATATACGCTCTGAAGTATAGCGTCAGGCAAGGCGCGCTTGTGCCTGTTTCCGAACTGGCAACCCACGTAAA
AAAATCATCTCGCACAGGTTTGATAAAAAAAATTATTCCGTCGCTGCTCATTAATACCATTCTGTTAGGGTATTCGGTCT
CATCGTTGGCCTCAGTAGTCAGGTATGATCTCCCTTACCAGACGATAAGAGATTTTTCTGAGAATAAAGGGCAATTCACG
CCGGGCAGTTTGAATATTCCCATTTACAACAAAACAGACGCGATTATTGGCTATTTGAATAAAGCGCCAATGCCTGATTT
CAGCAGTGCCAATCATCAGTCCGCGGTTGCTACTCTTGTTTCACCACAGTATATCGTAAGCGTAAAACATAATGGCGGTT
ATCAAAGCGTCAGTTTTGGTGATGGCGAAAATAAATATCGTCTTGTTGACAGAAATAATCAGCCGGGACGAGATTTTCAT
GCCCCACGTTTAAATAAGTTAGTTACAGAAGTTGAACCTTCTTTGATGACGCAGTCAGGGATGGTGTCTGGCGCTTATAG
TGATAAAAATCGCTATCCCACATTCTACCGGATAGGGTCCGGGACCCAGGAAATTAAACAGACAGACGGGCAAATCATCT
CCCTTTCCGGCGCATACAATTACCTGACCGGAGGAACGGCGGGTTCACTGGGGTCCTATGCTCAGGGCCAGATGATCAGT
GCCAATACCAATAAACAGTTGTATAACCTGGCGCAGGGTCCAATGGGAACCCATCCCCGCAGTGGTGATAGCGGTTCACC
TTTATTTGCCTATGATTCTGTTTTACAACAGTGGGTCATTGTTGGCGTGGATAGCTCCGGCGGTGGCGGAGGAACTAACT
GGACGGTGGTTGATGCGGATTTTGTAAACCAGTCCATTCAGGAAGATACCGATGCGCCAGTAACCTTTGTGGCCGGGCAG
GGGGCCTTACGCTGGGCATTTGACTCGACAGACGGCACGGGGACTCTTACCCAGCAAGAAACTGTTTATCAGATGCATGG
TCAGAAAGACGCAGACCTGAATGCGGGTAAGAATCTGGTATTTAATGGTGCTGATGGCCAGATTGTCCTGGAAGACTCGG
TTAACCAGGGGGCGGGGGCGCTGACGTTTAACGGCAATTATACAGTATCCACTAATAACGGCTCTACCTGGCAGGGCGCA
GGTCTGGATATTTCCCGGGATGCGGAGGTGATATGGCAGGTGAACGGGGTTCAGGGCGACAACCTGCATAAGATTGGCGA
GGGAGTGCTGAAGGTCAATGGTACCGGTATTAACCCGGGCGGGCTGAAGGTCGGGGATGGGACGGTTATTCTGGCTCAGC
GTCCGGATGAGGACGGCAAGGCACAGGCCTTCAGTTCGGTGAATATCGCCAGTGGCCGACCCACGGTGGTTCTTACCGAC
AGCCGGCAGGTTAATCCCGATAACATTAGCTGGGGGTTCAGGGGAGGCCGGCTTGATATAAACGGCAATGACGTCACTTT
TCATAAACTTAACGCCGCGGATAATGGTGCGAATATCATTAATGCCAGCGATACCTTTGCCACCGTTTCAATTAAGCCCA
TCACGGACATGACGGTGACGATTAATGACTGGGATAAAAATAAAGCTTCCGGCGGTGCTGCCGGGTTATTGTATAAATAT
AATAATCCTTACGCCCATACAGTGGATTATTTTATCCAGAAGAGGAAAGGATATGGATTTTATCCTGTCAATCAGTCAGA
TAACGACAGTTGGGAGTATGTCGGGCACAATGAGACGCAGGCCATTGAACAGGTAAAATCGCGGCGGCCCGTCGATGATC
GGATGTATCATGGTAATCTTGCTGGCAATATTGATCTTAATATTGATACCTCTCGCAGTAGCGGCGGCGTTATTTTTGAT
GGCAATATAGATACGCCTGAGGGCGGATTAATGCAGTCTGGGGGCCAACTGACGTTCCAGGGCCATCCGGTTATCCACGC
CTATAATAATAAAGGAGTGGCAGATAAGCTTAAATCTCTTGGTGACGATTCGGTCAGGACTCAACCCACCTCTTTTGACC
AGCCTGACTGGGAGGGCCGGACATTTCGTCTGAAAACGCTGTTGCTGAAAAATACCGATTTCGGCCTGGCCAGAAACGCG
TCGCTGAACGGAGATATTGAGGCAGTGCATTCGTCCGTGACGCTGGGTACGCCGAATGTGTATATCGATCTGAATGACGG
GAATGGCACAAAAGTAACGCCACAAAAGGGAACCTCAGTTGCCAGGCAAGACACGGACAGGAGCCGCTATGCCGGGAAAG
TGACGCTTGGCGAACAATCAACTCTGGACGTACGTGAAATTTTCACCGGCAGTATCCAAAGTCAGGATAGCGCTGTCACG
GTGTCCTCCCGCCATGCGACACTGGATGGTTACAGCCGGTTCGGCAACACGTCACTGGCTCTTCAGGAGGGAGCCCGATT
GACCGCTACCGGTGGGTGGTGGAGTGATTCAGACGTCATTGTCGGACCAGCCGCTACGTTGAGTCTGGCCGGTACGTCCG
TAACCGGCCAGCCGGGGCAGGTGAGTCCGGCGTTTTATTCCACCGATTATGGCGCAGGTTATCAACTGGATACCGGCAGC
CAGTTACACTTCTCTCCCTACACTTTTGTCACGGGTGATATTCGGGCAAAGGGGGATACAGGGATCTCTATTGGTGGTGA
GGACGGTGTTGCCCTGGCAGATAACCTGCCCCTGGGGGAACAGATGATGTACAGTCTGTTCAATGGCTTTCGAAACGTTT
ATTCGGGTAATGTCAGCGTCCCGCAAGGGCGAATGACAATGACGGATACGCAGTGGCAAATGCCCGGTGATTCTCATACC
GGCGCACTGCGTATGATACGGTCGCTGGCCGGCTTTACCGGTCGCGGATTTAATTCCCTGACTACGAATACGTTACAGGC
CAACCAGTCCGCTTTTGCGCTCAGAACGGACCTGAAGGACAGCGACAAAATTGTGGTGAACCAGAAAGCAGAGGGCCGGG
ATAACACCCTGTTTGTGAATTTCCTGAAAAAACCATCCGGGCAGGAGCCTCTGAATATTCCACTGGTCAGCGCCCCGGCG
GGGACAAATCCGGCGATGTTTAAGGCCGCCGAGCGGGTGACCGGGTTTAGTCTGGTGACGCCGACTCTGCACACGACAGA
ACAGGATGGCAAAATACAGTGGGTACTGGATGGCTTTAAGTCCGCGCCGGACAAGGGGTCAGCCACCTCGGCCAACAGCT
TTATGGGCATGGGATATAAAAACTTCATGACCGAAGTCAACAACCTGAACAAGCGTATGGGGGATCTGCGTGATACTCAG
GGCGAGGACGGAATGTGGGTACGTATCATGAACGGCGCCGGAACCGGTGACGCCGGATATTCTGATCGTTACACCCATCT
GCAAACGGGGTTTGATAAAAAACACCGGTTGTCAGGTGCTGACTTGTTCACTGGTGTGTTGATGAGTTATACCGACAGCA
GCGCCAGTGGACGGGCCTACAGCGGCGACACGCATTCGCTCGGGGGTGGGATGTACGCATCCGTGATGTTTGATTCGGGG
ATATATATGGATGTTATCGGCAAGTATATTCATCATGATAATGACTATAACGCCGGTTTTGCTGGTCTGGGCAAACGGAA
TTACGGTACACACTCATGGTATGCTGGCCTGGAAGGCGGATACCGTTACCGTCTGACAGAAAGCCTGTATATTGAGCCGC
AGGCGGAACTGGTATATGGAACCGTCTCCGGAACAACGCTGAAATGGAATGATAATGGTATGGATGTGTCGATGCGCAGC
AAAACGTATAATCCGTTGATAGGGCGTACAGGCGTGGCATTGGGCAAAACGTTCAGTGGCAAGGACTGGAGCGTTACGGC
CCGTACAGGTGTGGATTACCAGTTTGACCTGGTGGCCAATGGCGAGACGGCGCTACGCGATGCCTCCGGCGAGAAACGTT
TCACTGGTGAAAAAGACAGCAGAATGCTGTACAACGTGGGGCTGAATGCGCAGGTGAAGGACAATGTGCGCTTTGGACTG
GAGCTGGAGCAGTCGGCATTTGGCAAATATAATGTTGACCATGCCATAAACGCCAACTTCCGCTACATGTTCTGA

Protein sequence :
MNKIYALKYSVRQGALVPVSELATHVKKSSRTGLIKKIIPSLLINTILLGYSVSSLASVVRYDLPYQTIRDFSENKGQFT
PGSLNIPIYNKTDAIIGYLNKAPMPDFSSANHQSAVATLVSPQYIVSVKHNGGYQSVSFGDGENKYRLVDRNNQPGRDFH
APRLNKLVTEVEPSLMTQSGMVSGAYSDKNRYPTFYRIGSGTQEIKQTDGQIISLSGAYNYLTGGTAGSLGSYAQGQMIS
ANTNKQLYNLAQGPMGTHPRSGDSGSPLFAYDSVLQQWVIVGVDSSGGGGGTNWTVVDADFVNQSIQEDTDAPVTFVAGQ
GALRWAFDSTDGTGTLTQQETVYQMHGQKDADLNAGKNLVFNGADGQIVLEDSVNQGAGALTFNGNYTVSTNNGSTWQGA
GLDISRDAEVIWQVNGVQGDNLHKIGEGVLKVNGTGINPGGLKVGDGTVILAQRPDEDGKAQAFSSVNIASGRPTVVLTD
SRQVNPDNISWGFRGGRLDINGNDVTFHKLNAADNGANIINASDTFATVSIKPITDMTVTINDWDKNKASGGAAGLLYKY
NNPYAHTVDYFIQKRKGYGFYPVNQSDNDSWEYVGHNETQAIEQVKSRRPVDDRMYHGNLAGNIDLNIDTSRSSGGVIFD
GNIDTPEGGLMQSGGQLTFQGHPVIHAYNNKGVADKLKSLGDDSVRTQPTSFDQPDWEGRTFRLKTLLLKNTDFGLARNA
SLNGDIEAVHSSVTLGTPNVYIDLNDGNGTKVTPQKGTSVARQDTDRSRYAGKVTLGEQSTLDVREIFTGSIQSQDSAVT
VSSRHATLDGYSRFGNTSLALQEGARLTATGGWWSDSDVIVGPAATLSLAGTSVTGQPGQVSPAFYSTDYGAGYQLDTGS
QLHFSPYTFVTGDIRAKGDTGISIGGEDGVALADNLPLGEQMMYSLFNGFRNVYSGNVSVPQGRMTMTDTQWQMPGDSHT
GALRMIRSLAGFTGRGFNSLTTNTLQANQSAFALRTDLKDSDKIVVNQKAEGRDNTLFVNFLKKPSGQEPLNIPLVSAPA
GTNPAMFKAAERVTGFSLVTPTLHTTEQDGKIQWVLDGFKSAPDKGSATSANSFMGMGYKNFMTEVNNLNKRMGDLRDTQ
GEDGMWVRIMNGAGTGDAGYSDRYTHLQTGFDKKHRLSGADLFTGVLMSYTDSSASGRAYSGDTHSLGGGMYASVMFDSG
IYMDVIGKYIHHDNDYNAGFAGLGKRNYGTHSWYAGLEGGYRYRLTESLYIEPQAELVYGTVSGTTLKWNDNGMDVSMRS
KTYNPLIGRTGVALGKTFSGKDWSVTARTGVDYQFDLVANGETALRDASGEKRFTGEKDSRMLYNVGLNAQVKDNVRFGL
ELEQSAFGKYNVDHAINANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 53
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 52
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 52
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 52
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 51
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 51
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 51
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SBG_1083 YP_004729964.1 hemoglobin protease VFG0903 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG0635 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG0861 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG1689 Protein 0.0 51
SBG_1083 YP_004729964.1 hemoglobin protease VFG0904 Protein 0.0 51