Gene Information

Name : SBG_3912 (SBG_3912)
Accession : YP_004732695.1
Strain : Salmonella bongori NCTC 12419
Genome accession: NC_015761
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 4360324 - 4363839 bp
Length : 3516 bp
Strand : +
Note : -

DNA sequence :
ATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTGGCAGATGCTGAGTCAGGAAAGGGCAGTTTTGA
ACGCAAAGACGCCAAAAATTTCGCTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATCGTCGATAAATTTT
TTGAAGGAGAAAAAGACGACGTCGAAACAGTCGATGTCATCTTGCGGCCAAAGGTTTATTTCCGGTTACTGCAGCATGGA
AAGGACCGTTCCGCTGGTGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCCTGTTGAGCCGTGAAGGTTTTTTATA
TCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGGGCATTTTCGATTGGTGAGATTGAGC
AGTATGACAAATACAAAACGACACATACCTCATTCTCTATCAACTTTGATGACAGCGTTGATAAGACCGCCGAAACAGAT
GAAGAACGAGAAGCACGATATGCCGCCTTACAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAGGCTACTGAAGAA
TGTTGTCGGCGACTGGATTAAAAATCCCGAGCAATATGAACTCGCTGAGCATGGTTATATTGTTAAAACGGCGCAATCTG
GCGGCGCCAGTTTCCATATTCTTTCGCTTTATGACCACCTGCTTGTTTGCAAAAAGGATGTGCCGCTTTTCAATCGTTTT
GCCTCTCGGGAGGTTCATGCGGCAGAGTCTTTGCTTACTCCCGAAGCAAAATTCAGCGACAGGCTTGGACACTCCGGGGA
TAAGTTCCCGCTGGCACAGGCTCAGCGCGATGCCTTAAGCCATTTTCTTGATGCGAGGCATGGCGATATCCTTGCCGTTA
ACGGCCCTCCCGGAACCGGGAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCGGCTCTCGAAAAA
GCAGAACCTCCGGTTATTATCGCGACTTCAACGAATAACCAGGCTGTAACGAACATTATCGAGGCGTTCGGGAAAGATTT
TTCCCAAGGCACTGGTGCAATGGCCGGACGATGGTTGCCTGAGCTGAAAAGCTTCGGCGCTTATTTCCCCTCAAGCACTC
GTAAAGCCGAGGCAGCCAAAAAATACCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAGAATGCA
CTGCTGTTTTATCTGGAGAAAGCTAAGGCAGCCTTCCCTGAAAAAGATTGTTCAACCCCTGAAAAGGTCATTGAACTCCT
GCATGGTCAGTTGGCAGCAAAATACGAACAACTGGTAAGACTAAACGCAGCATGGCAAACGTTAAGCCAGGTTCGGGCTG
CACGTGAGCTTATTGCTAACGACATTGAGCAATATCTCGATAATTTAAATAAATTACTCTCCGGGCAAGAACAAAAAGTC
ACTCAACTAAAGAGTGCTAAAGCGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATTATTTTCCTGGCT
CCCGGCGGTTCGCAGTAAACGGCAGTACCAAATACAGCTGTTCCTCGAAGATAAATTAGGTGCGCTGATTGCGGGAAATC
AGTGGTTTGATCCTGAAACCATCGAACGTAATATAGATGGACTGCTCAACTCCGCTGAGCGCGAGCAAACAACCTACCGA
CAGCAGATTGACTCCGCGCATGAAATCGTTCTTAAAGAACAGCAGGCGGCTCAGGAATGGCAGAGGCTGGCACTTGATTT
AGGACATGAGAGCGACGAGGAACTGAGCTTCTCACAGGCAGATGAACTGGCTGATACGCAGATTCGCTTCCCTGCATTCT
TACTGACGACCCACTACTGGGAAGGTCGTTGGCTAATGGATATGGCGAAGATCGATGATCTGCAAAAAGAGAAGGGCAAG
AAAGGAGCTAAAGGGGTAACCGTCCGCTGGCAACGCCGAATGAAACTCACACCATGTGTGGTCATGACCTGCTATATGCT
GCCCGGCAATATGCAGATAAGTGAACACAAAGGGCAGCGTAAATTCGAGAAAAGCTATTTATATGACTTCGCTGATTTAC
TCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCCTTAGCTAAAAAGGCATTAGTGATT
GGTGATACGGAACAGATCCCGCCAATATGGAGTATTGCTCCTGCTATTGATATAGGTAACATGCTGGCGGAAAAAATTCT
GTCAGGCAGTACGCAAGAAGAGATTACTGAGAAATATGCGGCAATCGCAGAGCTTGGTAAAAGCGCCGCATCTGGTAGCG
TCATGAAAATAGCGCAGTGTGCTTCACGCTATCAATATGATCCCGAATTGGCTCGTGGTATGTACTTATATGAGCACCGC
CGGTGCTTCGATAATATTATTGGATACTGCAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGGTGTGAAGA
GAACAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCTGGCAAGTAGCGGTAGTCGATATAATT
TGCTGGAGGCTGAAACGATAGCGGCCTGGCTGACAGATAACCAGCAAGATATTGAAACGCATTACGGCAAATCGCTTCAT
GAGGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCGACCATCAAACAGGCGCTGGGTAAACAAGGTATCAGCAC
TGGCGCGAATGAAACGTCGCTCACGGTGGGCACCGTGCATTCTCTTCAGGGGGCGGAAAGAGCGATTGTTTTATTCTCGC
CAGTCTATTCAAAGCATGAGGACGGTGGGTTTATTGATAGTGATAACAGCATGCTGAACGTTGCTGTCTCCCGTGCGAAG
GACAGTTTCCTGGTCTTCGGCGATATGGACCTGTTTGAGATCCAGCCAGCCTCATCTCCGCGGGGATTACTGGCAAAATA
TCTCTTTGAGTCAGAGAAGAATGCGCTCACTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCGAGACCAAAATCT
ACACACTCCATGGTGTGGAGCAGCATGATAATTTCCTGAATCAGACATTTGAAAATACCGGTAAACACATCACGATAGTT
TCTCCATGGCTGACCTGGCAAAAACTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTCACGTGGTATTAA
CGTCACGATTGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGAAGCGAAAAGAGAAGCAGCAGAATCTCA
AAGCGGCGCTGGATAAACTGAACGCCCTTGGTATTGCGACAAAACTGGTCAATCGTGTTCATAGCAAAATTGTTATTGGG
GATGATGGTTTGCTGTGCGTGGGATCGTTCAACTGGTTTAGTGCGACGCGTGAAGCGCGATATGAACGATACGATACATC
GATGGTTTATTGCGGTGATAACCTGAAGGGGGAGATTGAGGCAATTTATAATAGCCTTGAAAGGCGTCAGGTTTAG

Protein sequence :
MDENALGFASYWRNSLADAESGKGSFERKDAKNFAHWHGIAAGRLDEAIVDKFFEGEKDDVETVDVILRPKVYFRLLQHG
KDRSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIEQYDKYKTTHTSFSINFDDSVDKTAETD
EEREARYAALQQEWRQYLDDSERLLKNVVGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPLFNRF
ASREVHAAESLLTPEAKFSDRLGHSGDKFPLAQAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK
AEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEYVENA
LLFYLEKAKAAFPEKDCSTPEKVIELLHGQLAAKYEQLVRLNAAWQTLSQVRAARELIANDIEQYLDNLNKLLSGQEQKV
TQLKSAKAEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWFDPETIERNIDGLLNSAEREQTTYR
QQIDSAHEIVLKEQQAAQEWQRLALDLGHESDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMAKIDDLQKEKGK
KGAKGVTVRWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI
GDTEQIPPIWSIAPAIDIGNMLAEKILSGSTQEEITEKYAAIAELGKSAASGSVMKIAQCASRYQYDPELARGMYLYEHR
RCFDNIIGYCNTLCYHGKLLPKRGCEENNLMPAMGYLHIDGKGELASSGSRYNLLEAETIAAWLTDNQQDIETHYGKSLH
EVVGIVTPFSAQVSTIKQALGKQGISTGANETSLTVGTVHSLQGAERAIVLFSPVYSKHEDGGFIDSDNSMLNVAVSRAK
DSFLVFGDMDLFEIQPASSPRGLLAKYLFESEKNALTFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTGKHITIV
SPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNLKAALDKLNALGIATKLVNRVHSKIVIG
DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 95

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SBG_3912 YP_004732695.1 hypothetical protein VFG0627 Protein 0.0 95
SBG_3912 YP_004732695.1 hypothetical protein VFG1537 Protein 0.0 95