Gene Information

Name : SeSA_A4746 (SeSA_A4746)
Accession : YP_002117410.1
Strain : Salmonella enterica CVM19633
Genome accession: NC_011094
Putative virulence/resistance : Virulence
Product : superfamily I DNA helicase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 4609593 - 4613108 bp
Length : 3516 bp
Strand : +
Note : identified by match to protein family HMM PF00614

DNA sequence :
ATGGATGACAATGCTTTAGGGTTTGCCTCATACTGGCGCAATTCGCTGGCAGATGCTGAGTCAGGAAAGGGCAGTTTTGA
ACGCAAAGATGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAACGATCGTCGGTAAATTTT
TTGAGGGAGAAAAAGACGACGTCGAAACAGTCGATGTCATCTTGCGGCCAAAGGTTTATTTCCGGTTACTGCAGCGTGGA
AAGGACCATTCCGCTGGTGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCCTGTTGAGCCGTGAAGGTTTTTTATA
TCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGGGCATTTTCAATTGGTGAGATTGGGC
AGTATGATAAATATAAAACGACACATACCTCATTCTCTATCAACTTTGATGACAGCGTTGATAAGACCGCCGAAACAGAT
GAAGAACGAGAAGCACGATATGCAGCCTGGCAGCAGGATTGGCGTCAATATCTGGATGATTCAGAGAGGCTACTGAAGAA
TGTTGTCGGCGACTGGATTAAAAATCCCGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAACGGCGCAATCTG
GCGGCGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTATTTGCAAAAAGGATGTGCCGCTCTTCAATCGCTTC
GCCTCGCGAGAGGTTCATGCTGCAGAGTCATTACTTCCTCCGGAAGCAAAATTCAGCGACAGACTTGGACACTCCGGGGA
TAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCGAGGCATGGCGATATCCTTGCCGTTA
ATGGTCCACCAGGAACCGGGAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCCGAGCGGCTCTCGAAAAA
GCGGAGCCTCCGGTTATTATCGCGACTTCAACGAATAACCAGGCTGTAACGAACATTATCGAGGCGTTCGGGAAAGATTT
TTCCCAGGGCACTGGTGTAATGGCCGGACGATGGTTGCCAGAGCTGAAAAGCTTCGGCGCTTATTTTCCCTCAAGCACTC
GTAAAGCCGAGGCCGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAGGATGCA
CTGCTGTTTTATCTGGAGAAAGCTAAGGCAGCCTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGTTATTGAACTCCT
GCATGGTCAGTTGGCAGCAAAATCCGAACAACTGGTAAGGCTGAACACAACATGGCAAACGTTAAGCCAGGTTCGGGCAG
CGCGTGAGCTTATTGCTAACGACATTAAGCAATATCTCGATAATTTAAATAAATTACTTGTCGGGCAAGAACAAAATGTC
ACTCTACTAAAAAGTGCTAAAGCGGAATGGAAAAAATATCGGGCCAGTGAATCACTGATCTATCCATTATTTTCATGGCT
ACCAGCGGTTCGCAGTAAGCGGCAGTACCAAATACAACTGTTTCTCGAAGATAAATTAGGTGCGCTGATTGCGGGAAATC
AGTGGTCTGATCCTGAAACCATCGAACGTAATATCGATAGGTTACTTAATTCCGCCGAGCGCGAGCAAACAACCTACCGG
CAGCAGATTGACTCCGCGCATGAAATCGTTCTTAAAGAACAGCAGGCGGCTCAGGAATGGCAGAGGCTGGCACTTGATTT
AGGGTATGAGGGGGGCGAGGAACTGAGCTTCTCACAGGCAGATGAGCTGGCTGATACGCAGATTCGCTTCCCTGCATTCT
TACTGGCGACCCACTACTGGGAAGGTCGTTGGTTGATGGATATGGCCAGCATTGATGATCTGCAGAAAGAAAAGGGCAAG
AAAGGTGCTAAAGGAGTAGCCGCTCGCTGGCAACGCCGAATGAAACTTACCCCATGCGTGGTCATGACCTGTTATATGCT
GCCCGGCAATATGCAGATAAGTGAACACAAAGGACAACGTAAATTCGAGAAAAGTTATTTATATGATTTTGCCGATTTAC
TCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCTTCGTTTGCTCTGGCTAAAAAGGCATTAGTGATT
GGTGATACGGAACAGATCCCGCCAATATGGAGTATTACTCCTGCTATTGATATAGGTAACATGCTGGCGGAAAAAATTCT
GTCAGGCAGTACGCAAGAAGAGATTACTGAGAAATATACGGCAATCGCAGAGCTTGGTAAAAGCGCCGCATCTGGCAGCG
TCATGAAAATAGCGCAGTGTGCTTCACGCTATCAATATGATCCCGAACTGGCACGTGGCATGTACTTATATGAACACCGC
CGGTGCTTCGATAATATTATTGGATACTGCAATACGCTTTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGATGTGAAGA
GAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCTGGCAAGTAGCGGAAGCCGATATAATT
TGCTGGAGGCTGAAACGATAGCGGCCTGGCTGACAGATAACCAGCAAAGTATTGAAGCGTATTATGGTAAATCGCTTCAT
GAAGTTGTCGGTATCGTGACGCCTTTTAGTGCGCAGGTATCGACCATCAAACAGGCGCTGGATAAACAAGGCATCAGCGC
AGGCGCGAATGAAAAGTCGCTTACAGTAGGCACCGTGCATTCTCTTCAGGGCGCTGAAAGAGCGATTGTTATATTCTCGC
CAGTCTATTCAAAGCATGAAGACGGCGCGTTTATTGATAGCGATAACAGCATGCTGAACGTTGCTGTCTCCCGAGCTAAG
GACAGTTTCCTTGTCTTCGGCGATATGGACCTGTTTGAGATCCAGCCAGCCTCATCTCCGCGGGGATTACTGGCAAAATA
TCTCTTTGAGTCAGAGAAAAATGCACTCTTTTTTGACTATAAAGAGCGTGAGGATTTAAAAACTTCCGAGACCAAAATCT
ACACACTCCATGGTGTGGAGCAGCATGATAATTTCCTGAATCAGACGTTTGAAAATACCGGTAAACACATCACGATAGTC
TCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCCATGATTGCGGCGTGTTCACGTGGTATTAA
CGTCACGGTAGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGAAGCGAAAAGAGAAGCAGCAGAACCTCA
AAGCGGCGCTGGAGAAACTGAACGCCCTTGGTATTGCGACAAAACTGGTCAATCGTGTTCATAGCAAAATTGTTATTGGT
GATGATGGTTTGCTGTGCGTGGGATCGTTCAACTGGTTTAGCGCGACACGTGAAGCGCGATATGAACGATACGATACATC
GATGGTTTATTGCGGTGATAACCTGAAGGGCGAGATTGAGGCAATTTACAATAGCCTTGAGAGGCGTCAGGTTTAG

Protein sequence :
MDDNALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDETIVGKFFEGEKDDVETVDVILRPKVYFRLLQRG
KDHSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTSFSINFDDSVDKTAETD
EEREARYAAWQQDWRQYLDDSERLLKNVVGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLICKKDVPLFNRF
ASREVHAAESLLPPEAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK
AEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGVMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPEKECSSPEKVIELLHGQLAAKSEQLVRLNTTWQTLSQVRAARELIANDIKQYLDNLNKLLVGQEQNV
TLLKSAKAEWKKYRASESLIYPLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDRLLNSAEREQTTYR
QQIDSAHEIVLKEQQAAQEWQRLALDLGYEGGEELSFSQADELADTQIRFPAFLLATHYWEGRWLMDMASIDDLQKEKGK
KGAKGVAARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI
GDTEQIPPIWSITPAIDIGNMLAEKILSGSTQEEITEKYTAIAELGKSAASGSVMKIAQCASRYQYDPELARGMYLYEHR
RCFDNIIGYCNTLCYHGKLLPKRGCEESNLMPAMGYLHIDGKGELASSGSRYNLLEAETIAAWLTDNQQSIEAYYGKSLH
EVVGIVTPFSAQVSTIKQALDKQGISAGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGAFIDSDNSMLNVAVSRAK
DSFLVFGDMDLFEIQPASSPRGLLAKYLFESEKNALFFDYKEREDLKTSETKIYTLHGVEQHDNFLNQTFENTGKHITIV
SPWLTWQKLEQTGFLDSMIAACSRGINVTVVTDRSYNTEHNDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSKIVIG
DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 96
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 95
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 95

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SeSA_A4746 YP_002117410.1 superfamily I DNA helicase VFG1537 Protein 0.0 95
SeSA_A4746 YP_002117410.1 superfamily I DNA helicase VFG0627 Protein 0.0 95