Gene Information

Name : SBG_1083 (SBG_1083)
Accession : YP_004729964.1
Strain : Salmonella bongori NCTC 12419
Genome accession: NC_015761
Putative virulence/resistance : Virulence
Product : hemoglobin protease
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3468
EC number : -
Position : 1200390 - 1204544 bp
Length : 4155 bp
Strand : +
Note : -

DNA sequence :
ATGAATAAAATATACGCTCTGAAGTATAGCGTCAGGCAAGGCGCGCTTGTGCCTGTTTCCGAACTGGCAACCCACGTAAA
AAAATCATCTCGCACAGGTTTGATAAAAAAAATTATTCCGTCGCTGCTCATTAATACCATTCTGTTAGGGTATTCGGTCT
CATCGTTGGCCTCAGTAGTCAGGTATGATCTCCCTTACCAGACGATAAGAGATTTTTCTGAGAATAAAGGGCAATTCACG
CCGGGCAGTTTGAATATTCCCATTTACAACAAAACAGACGCGATTATTGGCTATTTGAATAAAGCGCCAATGCCTGATTT
CAGCAGTGCCAATCATCAGTCCGCGGTTGCTACTCTTGTTTCACCACAGTATATCGTAAGCGTAAAACATAATGGCGGTT
ATCAAAGCGTCAGTTTTGGTGATGGCGAAAATAAATATCGTCTTGTTGACAGAAATAATCAGCCGGGACGAGATTTTCAT
GCCCCACGTTTAAATAAGTTAGTTACAGAAGTTGAACCTTCTTTGATGACGCAGTCAGGGATGGTGTCTGGCGCTTATAG
TGATAAAAATCGCTATCCCACATTCTACCGGATAGGGTCCGGGACCCAGGAAATTAAACAGACAGACGGGCAAATCATCT
CCCTTTCCGGCGCATACAATTACCTGACCGGAGGAACGGCGGGTTCACTGGGGTCCTATGCTCAGGGCCAGATGATCAGT
GCCAATACCAATAAACAGTTGTATAACCTGGCGCAGGGTCCAATGGGAACCCATCCCCGCAGTGGTGATAGCGGTTCACC
TTTATTTGCCTATGATTCTGTTTTACAACAGTGGGTCATTGTTGGCGTGGATAGCTCCGGCGGTGGCGGAGGAACTAACT
GGACGGTGGTTGATGCGGATTTTGTAAACCAGTCCATTCAGGAAGATACCGATGCGCCAGTAACCTTTGTGGCCGGGCAG
GGGGCCTTACGCTGGGCATTTGACTCGACAGACGGCACGGGGACTCTTACCCAGCAAGAAACTGTTTATCAGATGCATGG
TCAGAAAGACGCAGACCTGAATGCGGGTAAGAATCTGGTATTTAATGGTGCTGATGGCCAGATTGTCCTGGAAGACTCGG
TTAACCAGGGGGCGGGGGCGCTGACGTTTAACGGCAATTATACAGTATCCACTAATAACGGCTCTACCTGGCAGGGCGCA
GGTCTGGATATTTCCCGGGATGCGGAGGTGATATGGCAGGTGAACGGGGTTCAGGGCGACAACCTGCATAAGATTGGCGA
GGGAGTGCTGAAGGTCAATGGTACCGGTATTAACCCGGGCGGGCTGAAGGTCGGGGATGGGACGGTTATTCTGGCTCAGC
GTCCGGATGAGGACGGCAAGGCACAGGCCTTCAGTTCGGTGAATATCGCCAGTGGCCGACCCACGGTGGTTCTTACCGAC
AGCCGGCAGGTTAATCCCGATAACATTAGCTGGGGGTTCAGGGGAGGCCGGCTTGATATAAACGGCAATGACGTCACTTT
TCATAAACTTAACGCCGCGGATAATGGTGCGAATATCATTAATGCCAGCGATACCTTTGCCACCGTTTCAATTAAGCCCA
TCACGGACATGACGGTGACGATTAATGACTGGGATAAAAATAAAGCTTCCGGCGGTGCTGCCGGGTTATTGTATAAATAT
AATAATCCTTACGCCCATACAGTGGATTATTTTATCCAGAAGAGGAAAGGATATGGATTTTATCCTGTCAATCAGTCAGA
TAACGACAGTTGGGAGTATGTCGGGCACAATGAGACGCAGGCCATTGAACAGGTAAAATCGCGGCGGCCCGTCGATGATC
GGATGTATCATGGTAATCTTGCTGGCAATATTGATCTTAATATTGATACCTCTCGCAGTAGCGGCGGCGTTATTTTTGAT
GGCAATATAGATACGCCTGAGGGCGGATTAATGCAGTCTGGGGGCCAACTGACGTTCCAGGGCCATCCGGTTATCCACGC
CTATAATAATAAAGGAGTGGCAGATAAGCTTAAATCTCTTGGTGACGATTCGGTCAGGACTCAACCCACCTCTTTTGACC
AGCCTGACTGGGAGGGCCGGACATTTCGTCTGAAAACGCTGTTGCTGAAAAATACCGATTTCGGCCTGGCCAGAAACGCG
TCGCTGAACGGAGATATTGAGGCAGTGCATTCGTCCGTGACGCTGGGTACGCCGAATGTGTATATCGATCTGAATGACGG
GAATGGCACAAAAGTAACGCCACAAAAGGGAACCTCAGTTGCCAGGCAAGACACGGACAGGAGCCGCTATGCCGGGAAAG
TGACGCTTGGCGAACAATCAACTCTGGACGTACGTGAAATTTTCACCGGCAGTATCCAAAGTCAGGATAGCGCTGTCACG
GTGTCCTCCCGCCATGCGACACTGGATGGTTACAGCCGGTTCGGCAACACGTCACTGGCTCTTCAGGAGGGAGCCCGATT
GACCGCTACCGGTGGGTGGTGGAGTGATTCAGACGTCATTGTCGGACCAGCCGCTACGTTGAGTCTGGCCGGTACGTCCG
TAACCGGCCAGCCGGGGCAGGTGAGTCCGGCGTTTTATTCCACCGATTATGGCGCAGGTTATCAACTGGATACCGGCAGC
CAGTTACACTTCTCTCCCTACACTTTTGTCACGGGTGATATTCGGGCAAAGGGGGATACAGGGATCTCTATTGGTGGTGA
GGACGGTGTTGCCCTGGCAGATAACCTGCCCCTGGGGGAACAGATGATGTACAGTCTGTTCAATGGCTTTCGAAACGTTT
ATTCGGGTAATGTCAGCGTCCCGCAAGGGCGAATGACAATGACGGATACGCAGTGGCAAATGCCCGGTGATTCTCATACC
GGCGCACTGCGTATGATACGGTCGCTGGCCGGCTTTACCGGTCGCGGATTTAATTCCCTGACTACGAATACGTTACAGGC
CAACCAGTCCGCTTTTGCGCTCAGAACGGACCTGAAGGACAGCGACAAAATTGTGGTGAACCAGAAAGCAGAGGGCCGGG
ATAACACCCTGTTTGTGAATTTCCTGAAAAAACCATCCGGGCAGGAGCCTCTGAATATTCCACTGGTCAGCGCCCCGGCG
GGGACAAATCCGGCGATGTTTAAGGCCGCCGAGCGGGTGACCGGGTTTAGTCTGGTGACGCCGACTCTGCACACGACAGA
ACAGGATGGCAAAATACAGTGGGTACTGGATGGCTTTAAGTCCGCGCCGGACAAGGGGTCAGCCACCTCGGCCAACAGCT
TTATGGGCATGGGATATAAAAACTTCATGACCGAAGTCAACAACCTGAACAAGCGTATGGGGGATCTGCGTGATACTCAG
GGCGAGGACGGAATGTGGGTACGTATCATGAACGGCGCCGGAACCGGTGACGCCGGATATTCTGATCGTTACACCCATCT
GCAAACGGGGTTTGATAAAAAACACCGGTTGTCAGGTGCTGACTTGTTCACTGGTGTGTTGATGAGTTATACCGACAGCA
GCGCCAGTGGACGGGCCTACAGCGGCGACACGCATTCGCTCGGGGGTGGGATGTACGCATCCGTGATGTTTGATTCGGGG
ATATATATGGATGTTATCGGCAAGTATATTCATCATGATAATGACTATAACGCCGGTTTTGCTGGTCTGGGCAAACGGAA
TTACGGTACACACTCATGGTATGCTGGCCTGGAAGGCGGATACCGTTACCGTCTGACAGAAAGCCTGTATATTGAGCCGC
AGGCGGAACTGGTATATGGAACCGTCTCCGGAACAACGCTGAAATGGAATGATAATGGTATGGATGTGTCGATGCGCAGC
AAAACGTATAATCCGTTGATAGGGCGTACAGGCGTGGCATTGGGCAAAACGTTCAGTGGCAAGGACTGGAGCGTTACGGC
CCGTACAGGTGTGGATTACCAGTTTGACCTGGTGGCCAATGGCGAGACGGCGCTACGCGATGCCTCCGGCGAGAAACGTT
TCACTGGTGAAAAAGACAGCAGAATGCTGTACAACGTGGGGCTGAATGCGCAGGTGAAGGACAATGTGCGCTTTGGACTG
GAGCTGGAGCAGTCGGCATTTGGCAAATATAATGTTGACCATGCCATAAACGCCAACTTCCGCTACATGTTCTGA

Protein sequence :
MNKIYALKYSVRQGALVPVSELATHVKKSSRTGLIKKIIPSLLINTILLGYSVSSLASVVRYDLPYQTIRDFSENKGQFT
PGSLNIPIYNKTDAIIGYLNKAPMPDFSSANHQSAVATLVSPQYIVSVKHNGGYQSVSFGDGENKYRLVDRNNQPGRDFH
APRLNKLVTEVEPSLMTQSGMVSGAYSDKNRYPTFYRIGSGTQEIKQTDGQIISLSGAYNYLTGGTAGSLGSYAQGQMIS
ANTNKQLYNLAQGPMGTHPRSGDSGSPLFAYDSVLQQWVIVGVDSSGGGGGTNWTVVDADFVNQSIQEDTDAPVTFVAGQ
GALRWAFDSTDGTGTLTQQETVYQMHGQKDADLNAGKNLVFNGADGQIVLEDSVNQGAGALTFNGNYTVSTNNGSTWQGA
GLDISRDAEVIWQVNGVQGDNLHKIGEGVLKVNGTGINPGGLKVGDGTVILAQRPDEDGKAQAFSSVNIASGRPTVVLTD
SRQVNPDNISWGFRGGRLDINGNDVTFHKLNAADNGANIINASDTFATVSIKPITDMTVTINDWDKNKASGGAAGLLYKY
NNPYAHTVDYFIQKRKGYGFYPVNQSDNDSWEYVGHNETQAIEQVKSRRPVDDRMYHGNLAGNIDLNIDTSRSSGGVIFD
GNIDTPEGGLMQSGGQLTFQGHPVIHAYNNKGVADKLKSLGDDSVRTQPTSFDQPDWEGRTFRLKTLLLKNTDFGLARNA
SLNGDIEAVHSSVTLGTPNVYIDLNDGNGTKVTPQKGTSVARQDTDRSRYAGKVTLGEQSTLDVREIFTGSIQSQDSAVT
VSSRHATLDGYSRFGNTSLALQEGARLTATGGWWSDSDVIVGPAATLSLAGTSVTGQPGQVSPAFYSTDYGAGYQLDTGS
QLHFSPYTFVTGDIRAKGDTGISIGGEDGVALADNLPLGEQMMYSLFNGFRNVYSGNVSVPQGRMTMTDTQWQMPGDSHT
GALRMIRSLAGFTGRGFNSLTTNTLQANQSAFALRTDLKDSDKIVVNQKAEGRDNTLFVNFLKKPSGQEPLNIPLVSAPA
GTNPAMFKAAERVTGFSLVTPTLHTTEQDGKIQWVLDGFKSAPDKGSATSANSFMGMGYKNFMTEVNNLNKRMGDLRDTQ
GEDGMWVRIMNGAGTGDAGYSDRYTHLQTGFDKKHRLSGADLFTGVLMSYTDSSASGRAYSGDTHSLGGGMYASVMFDSG
IYMDVIGKYIHHDNDYNAGFAGLGKRNYGTHSWYAGLEGGYRYRLTESLYIEPQAELVYGTVSGTTLKWNDNGMDVSMRS
KTYNPLIGRTGVALGKTFSGKDWSVTARTGVDYQFDLVANGETALRDASGEKRFTGEKDSRMLYNVGLNAQVKDNVRFGL
ELEQSAFGKYNVDHAINANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 53
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 52
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 52
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 52
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 51
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 51
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 51
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SBG_1083 YP_004729964.1 hemoglobin protease VFG0861 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG0903 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG0635 Protein 0.0 52
SBG_1083 YP_004729964.1 hemoglobin protease VFG0904 Protein 0.0 51
SBG_1083 YP_004729964.1 hemoglobin protease VFG1689 Protein 0.0 51