Gene Information

Name : STU288_22510 (STU288_22510)
Accession : YP_007905581.1
Strain : Salmonella enterica U288
Genome accession: NC_021151
Putative virulence/resistance : Virulence
Product : superfamily I DNA helicase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4726843 - 4730358 bp
Length : 3516 bp
Strand : +
Note : COG1112 Superfamily I DNA and RNA helicases and helicase subunits

DNA sequence :
ATGGATGACAATGCTTTAGGGTTTGCCTCATACTGGCGCAATTCGCTGGCAGATGCTGAGTCAGGAAAGGGCAGTTTTGA
ACGCAAAGACGCCAAAAATTTCACCCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAACGATCGTTGGTAAATTTT
TTGAGGGAGAAAAAGACGACGTCGAAACAGTCGATGTCATCTTGCGGCCAAAGGTTTATTTCCGGTTACTGCAGCGTGGA
AAGGACCATTCCGCTGGTGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCCTATTGAGCCGTGAAGGTTTTTTATA
TCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGGGCATTTTCAATTGGTGAGATTGAGC
AGTACGACAAATACAAAACGACACATACGTCATTCTCTATCAACTTTGATGACCGCGTTGATAAGACCGCCGAAACAGAT
GAAGAACGAGAAGCACGATATGCAGCCTGGCAGCAGGATTGGCGTCAATATCTGGATGATTCAGAAAGGCTGCTGAAGAA
CGTTGTCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAACGGCGCAATCTG
GCGGCGCCAGTTTCCATATCCTTTCACTTTATGATCACCTGCTTGTTTGCAAAAAGGATGTGCCGCTCTTCAATCGTTTC
GCCTCGCGAGAGGTTCATGCTGCAGAGTCATTACTTCCTCCGGAAGCAAAATTCAGCGACAGGATTGGACACTCCGGGGA
TAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCGAGGCATGGCGATATCCTTGCCGTTA
ATGGTCCCCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCCGAGCGGCTCTCGAAAAA
GCGGAACCTCCGGTTATTATCGCGACTTCAACGAATAACCAAGCTGTAACGAACATTATCGAGGCGTTCGGGAAAGATTT
TTCCCAAGGCACTGGTGCAATGGCCGGACGATGGTTGCCGGAGCTGAAAAGCTTCGGCGCTTATTTTCCCTCAAGCACTC
GTAAAGCCGAGGCCGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAGGATGCA
CTGCTGTTTTATCTGGAGAAAGCTAAGGCAGCCTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGTTATTGAACTCCT
GCATGGTCAGTTGGCAGCAAAATCCGAGCAACTGGTAAGACTGAACGCAACATGGCAAACGTTAAGCCAGGTTCGGGCAA
CGCGAGAGCTTATTGATAACGACATTGAGCAATATCTCGATAATTTAAATAAATTACTCTCCGGGCAAGAACAAAAAGTT
ACTCAACTAAAAAGTGCTAAAGCGGAATGGAAAAAATATCGGGCCAGTGAATCACTGATCTATCCATTATTTTCATGGCT
ACCAGTGGTTCGCAGTAAGCGGCAGTACCAAATACAACTGTTTCTCGAAGATAAATTAGGTGCGCTGATTGCGGGAAATC
AATGGTCGGATCCTGAAACCATCGAACGTAATATCGATAGGTTGCTTAATTCCGCCGAGCGCGAGCAAACAACCTACCGG
CAGCAGATTGACTCCGCGCATGAAATCGTTCTTAAAGAACAGCAGGCGGCTCAGGAATGGCAGAGGCTGGCACTTGATTT
AGGGCATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCAGATGAGCTGGCTGATACGCAGATTCGCTTCCCTGCATTCT
TACTGACGACCCACTACTGGGAAGGTCGTTGGTTGATGGATATGGCCGGCATTGATGATCTGCAGAAAGAAAAGGGCAAG
AAAGGTGCTAAAGGGGTAACAGCTCGCTGGCAACGCCGAATGAAACTTACCCCATGCGTGGTCATGACCTGCTATATGCT
GCCCGGCAATATGCAGATAAGTGAACATAAAGGGCAGCGTAAATTCGAGAAAAGCTATTTATATGACTTCGCCGATTTAC
TCATTGTCGATGAAGCTGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCCTTAGCTAAAAAGGCATTAGTGATT
GGTGATACGGAACAGATCCCGCCAATATGGAGTATTACTCCTGCTATTGATATAGGTAACATGCTGGCGGAAAAAATTCT
GTCAGGCAGTACGCAAGAGGAGATTACTGAGAAATATACGGCAATCGCAGAGCTTGGTAAAAGCGCCGCATCTGGCAGCG
TCATGAAAATAGCGCAGTGTGCCTCACGCTATCAATATGATCCCGAACTGGCTCGTGGAATGTACTTATATGAACACCGC
CGGTGCTTCGATAATATTATTGGATACTGCAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGGTGTGAAGA
GAGCAATTTAATGCCAGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCTGGCAAGTAGCGGAAGTCGATATAATT
TGCTGGAGGCTGAAACGATAGCGGCCTGGCTGACAGATAACCAGCAAAGTATTGAAGCGCATTATGGTAAATCGCTTCAT
GAAGTTGTCGGTATCGTGACGCCTTTTAGTGCGCAGGTACCGACCATCAAACAGGCGCTGGATAAACAAGGCATCAGCGC
AGGCACCAATGAAACGTCGCTCACGGTGGGCACAGTCCATTCTCTTCAGGGCGCTGAAAGAGCGATTGTTATATTCTCGC
CAGTCTATTCAAAGCATGAAGACGGCGCGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCTGTCTCCCGAGCTAAG
GACAGTTTCCTGGTCTTCGGCGATATGGACCTGTTTGAGATTCAGCCAGCCTCATCTCCGCGGGGATTACTGGCAAAATA
TCTCTTTGAGTCAGAGAAGAATGCGCTCACTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTGCCGAGACCAAAATCT
ACACACTCCATGGTGTGGAGCAGCATGATAATTTCCTGAATCAGACGTTTGAAAATACCGATAAACACATCACGATAGTT
TCTCCATGGCTAACCTGGCAAAAACTGGAGCAAACCGGTTTTCTTGATTCCATGATTACGGCGTGTTCACGTGGTATTAA
CGTCACGGTAGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGAAGCGAAAAGAGAAGCAGCAGAACCTTA
AAGCGGCGCTGGAGAAACTGAACGCCCTTGGTATTGCGACAAAACTGGTCAATCGTGTTCATAGCAAAATTGTTATTGGT
GATGATGGTTTGCTGTGCGTGGGATCGTTCAACTGGTTTAGCGCGACACGTGAAGCGCGATATGAACGATACGATACATC
GATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGATAGGCGTCAGGTTTAG

Protein sequence :
MDDNALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDETIVGKFFEGEKDDVETVDVILRPKVYFRLLQRG
KDHSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIEQYDKYKTTHTSFSINFDDRVDKTAETD
EEREARYAAWQQDWRQYLDDSERLLKNVVGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPLFNRF
ASREVHAAESLLPPEAKFSDRIGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK
AEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPEKECSSPEKVIELLHGQLAAKSEQLVRLNATWQTLSQVRATRELIDNDIEQYLDNLNKLLSGQEQKV
TQLKSAKAEWKKYRASESLIYPLFSWLPVVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDRLLNSAEREQTTYR
QQIDSAHEIVLKEQQAAQEWQRLALDLGHEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMAGIDDLQKEKGK
KGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI
GDTEQIPPIWSITPAIDIGNMLAEKILSGSTQEEITEKYTAIAELGKSAASGSVMKIAQCASRYQYDPELARGMYLYEHR
RCFDNIIGYCNTLCYHGKLLPKRGCEESNLMPAMGYLHIDGKGELASSGSRYNLLEAETIAAWLTDNQQSIEAHYGKSLH
EVVGIVTPFSAQVPTIKQALDKQGISAGTNETSLTVGTVHSLQGAERAIVIFSPVYSKHEDGAFIDSDNSMLNVAVSRAK
DSFLVFGDMDLFEIQPASSPRGLLAKYLFESEKNALTFDYKERKDLKTAETKIYTLHGVEQHDNFLNQTFENTDKHITIV
SPWLTWQKLEQTGFLDSMITACSRGINVTVVTDRSYNTEHNDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSKIVIG
DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLDRRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 95

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
STU288_22510 YP_007905581.1 superfamily I DNA helicase VFG1537 Protein 0.0 95
STU288_22510 YP_007905581.1 superfamily I DNA helicase VFG0627 Protein 0.0 95