Name : S3169 (S3169) Accession : NP_838460.1 Strain : Shigella flexneri 2457T Genome accession: NC_004741 Putative virulence/resistance : Virulence Product : superfamily I DNA helicase Function : - COG functional category : L : Replication, recombination and repair COG ID : COG1112 EC number : - Position : 3046628 - 3050143 bp Length : 3516 bp Strand : + Note : residues 1 to 1171 of 1171 are 92.99 pct identical to residues 1 to 1171 of 1171 from GenPept : >gb|AAL23307.1| (AE008910) superfamily I DNA helicases [Salmonella typhimurium LT2] DNA sequence : ATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAAGGGCAGTTTTAA ACGGAAAGACGCCCAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTGTCAGTAAATTTT TTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCATCTTGCGCCCAAAAGTTTATTTCCGGTTACTGCAGCATGGT AAGGACCGTTCCGCAGGCGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCTTGCTAAGCCGTGAGGGTTTTTTATA TCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTGGTGAGATTGGGC AGTATGACAAATACAAGACGACCCATACCACGTTCTCTATCAACTTTGATGACAGCGTTGATAAGACTGCCGAAACGGAT GAAGAACGGGAAGCACGATATGCCGCCTTGCAGCAGGAGTGGCGTCAATATCTGTATGACTCAGAGAGGCTACTGAAGAG CGTTGCCGGCGACTGGATTGAAAAACCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAACGGCTCAATCTG GCGGTGCCAGTTCCCATATCCTTTCTCTTTATGATCACCTGCTTGTTTGCAATAAGGATGTGCCGCTCTTCAATCGCTTC GCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGGACACTCCGGAGA TAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATATCCTTGCTGTTA ATGGCCCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCGGCTCTCGAAAAA TCTGAGCCTCCGGTTATTATCGCGACTTCAACGAATAACCAGGCTGTAACGAACATTATTGAGGCATTCGGGAAAGACTT TTCGCAAGGTTCAGGTGCGATGGCCGGGCGATGGTTGCCAGAGCTGAAAAGCTTCGGTGCTTATTTTCCCTCAAGCAGTC GTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAGGATGCA CTGCTGTTTTATCTGGAAAAGGCTAAGGCAGCCTTTCCTGGAAAAGAGTGTTCATCCCCTGAAAAGGTCATTGAACTCTT GCATGGTCAGTTGGCAGCAAAATCTGAGCAACTGATAAGACTGAACGCAACATGGCAAACGTTAAGCCAGATTCGGGCTG CGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAAGAACAAAAAGTC ACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATTATTTTCCTGGCT CCCGGCGGTTCGCAATAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGATTGCAGGAAATC AGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAAACAACATACCGG CAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCTGGCATTTGATTT AGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCTTCCCTGCATTTT TACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGCATTGATGATCTGCAGGACGAGAAGAAGAAA AAAGGTGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACGCCATGTGTGGTGATGACATGCTATATGCT GCCCGGTAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATTTTGCCGATTTAC TCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAGGCATTAGTGATT GGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCTGCGATTGATGTCGGTAACATGCTGGCGGAAAAAATTCT GTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCGCATCTGGCAGCG TTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTATATGAACACCGC CGGTGCTACGACAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGGCGTGAAGA GAGCAATTTAATGCCCGCAATGGGGTATCTCCATATTGATGGTAAAGGAGAGCTGGCAAGTAGTGGAAGTCGATATAATT TGCTTGAGGCTGAAACGATAGCGGTCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGTAAATCGCTTCAT GAAGTTGTCGGTATTGTGACGCCTTTTAGCGCTCAGGTATCCACTATCAAACAGGTGCTGGGCAAACAAGATATCAGTAC AGGCACGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTGTGATATTCTCGC CAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTCTCCCGTGCGAAG GACAGTTTTCTGGTCTTCGGCGATATGGACCTGTTTGAGGTCCAGCCAGCCTCATCGCCACGGGGATTACTGGCAAAATA CCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACCGCCGGGACCAAAATCT ACACACTTCATGGTGTGGAGCAACATGATAATTTCCTGAATCAGACATTTGAAAATACCAGTAAACACATCACGATAATT TCTCCATGGCTGACCTGGCAAAGGCTGGAGCAAACCGGTTTTCTTGATTCCATGATTGCGGCGTGTTCACGTGGAATTAA CGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGAAGCGAAAAGAGAAGCAGCAGAACTTTA AAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAAATTGTTATTGGT GATGATGGTTTGCTGTGTGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACGATACGATACATC AATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTCAGGTTTAG Protein sequence : MDENALGFASYWRNSLADAESGKGSFKRKDAQNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVILRPKVYFRLLQHG KDRSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFDDSVDKTAETD EEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYELAEHGYIVKTAQSGGASSHILSLYDHLLVCNKDVPLFNRF ASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK SEPPVIIATSTNNQAVTNIIEAFGKDFSQGSGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA LLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQIRAARELIANDIEQYLDNLNKLLSGQEQKV TLLKSAKTEWKKYRAGESLIYSLFSWLPAVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQTTYR QQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQDEKKK KGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI GDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGSRYNLLEAETIAVWLAENQQNIEAHYGKSLH EVVGIVTPFSAQVSTIKQVLGKQDISTGTNEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAVSRAK DSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITII SPWLTWQRLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKLNALGIATKLVNRVHSKIVIG DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
S3169 | NP_838460.1 | superfamily I DNA helicase | Not tested | SHI-1 | Protein | 0.0 | 100 |
SF2965 | NP_708739.1 | superfamily I DNA helicase | Not tested | SHI-1 | Protein | 0.0 | 100 |
unnamed | CAD42018.1 | hypothetical protein | Not tested | PAI II 536 | Protein | 0.0 | 98 |
APECO1_3532 | YP_854230.1 | superfamily I DNA helicase | Not tested | PAI I APEC-O1 | Protein | 0.0 | 98 |
ORF_2 | AAZ04413.1 | superfamily I DNA helicase | Not tested | PAI I APEC-O1 | Protein | 0.0 | 98 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
S3169 | NP_838460.1 | superfamily I DNA helicase | VFG0627 | Protein | 0.0 | 100 |
S3169 | NP_838460.1 | superfamily I DNA helicase | VFG1537 | Protein | 0.0 | 98 |