Gene Information

Name : Shal_3494 (Shal_3494)
Accession : YP_001675698.1
Strain : Shewanella halifaxensis HAW-EB4
Genome accession: NC_010334
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG1744
EC number : -
Position : 4256966 - 4261486 bp
Length : 4521 bp
Strand : -
Note : KEGG: vvu:VV1_0767 hypothetical protein

DNA sequence :
ATGAAAAAAACAATTTTAACACTGACGTTATTAATACTGATTGGAGGTTGTGGTGATGATGAACCTGTTGCGCCGTCGAT
CCCCTTAATTCCATTGGAGCCATCGACGCCAGTGGTTCCCTCTATTCCACTAGAGCCATCTACACCGATTAAACTTGAAA
TTAGCGTGTCACTAGACGGAACATTACTATTGGGGGAGCCTATTCGTTGTAATGACCAAAAGCCTAATGGTTTTGAAATC
GAAATCGGCGACACGGTTTCTTGTCTCTATAACAATATGCCGCTACTAACATTCAGTGACGTGAGAGCGGATATAGCACG
AGCAGCAGCGCCAACTGAGCGTAAACAATTGCTCTTAATTGATGCCGATGAATTTATCGAACACCCTGAAAATGCATCGA
ATGCTCAAACTCTGATTAAGACCATGGGAGTTGTGCGGGGCGGGCAAATCGATATAGAGCTATCGACATTGCAGGCATTG
CAGTTTGAAAATAACTACCTCAATAATTTATCCATGGCTGAGGAGGATTTCAAAAACTTGCTAGAGCAGCAAGCCAACGA
TGCTCAAGCTGACAAACTCCCCTCTACCCATCTTCCCGACATAAAGCCTGAGGTGAGTACTGGTACATCTACGGATTTAA
ATGCTCACTTTGTCGCGGCCGATGCAGAGGCGGCATACCAGTATAAGCCTGAGCAGAATATTCTATCCACTGCGATACTG
ACTGACAGCCAAGGCAAACCTGTTGCGGGTATTACCTATTTCAGCCGCAGCAGCCGTGGGATTACGGACTTAGATGGTTC
GTTCCAATTTTCTTGGGGCGATAGCGTCAGCTTTGGCATCGATACTTTTGAGCTTGGAGAGGTTCGCGGTAATCAGAAAA
GTTACACATTGAGTCAGCTTGGCGAAGGTAATAACGGCCTCAATGCCGAGGCGTTAGTGTTACGTTATGCCACAAGCCAA
GGGCAAGCTTGGCAATTGCCTGCGCGGGTTGCTGAAGTCTTTGCTAGGTATCCTAACGTGATAAATGAAATCATCTCTTT
ATCTCTAAGTAGTCAAGAGAAGGTGTTAGATACGGGTAATGGAGAGCAAATAATCGCTGCCAAATTTAGTAAACAATTTG
AGTTAGGGCTCGCCAGAGATATTGATAGCGTGTTGTGTGACATCAACTGTCAACAAACAGCTAAAGCAAGAGAGCTATTA
CCTGAAGTGCAATCTGAGGGCGCAGACATTGTTAGAGCCTTAAACAATGTCACCGATGACAGTGGTCAGATTCTAGCTGA
CATTCAGCGTCTATGGGGCACAACATCTGATGCTGGTTGGAATAAAGTCAGTGCTTTTCATGTGTTTCATGACAGTACCA
ACTTTTATGGCAGTACAGGTAATGCTCGTGGTCAGGCTACCGTTAATATCGCTAACAGCGCTTTTCCTGTGATGATGGCC
CGTAACGATAATAACTACTGGATCCCTTTTGGTGCTAACAAAGCGTGGGATAAAGATACCCTCGCGTATATTACCGAAGC
ACCATCGACAGTGGTACCTGACAAGGTCAGTGGCGAAACCGCCACCTTTAATCTGCCTTTTATCAGCATTGGTGAAATAG
GTTTAGGCAAGATTATGGTCATGGGTAATGCTCGCTATAACTCAGTGCTAGTTTGCCCTAATGGCTATAGTTGGAATGGT
GGCATCGACGACAAGAGCCAGTGCCGAGTCGCTAGTGACAGTGATGATATGAAGCACTTCTTCCAAAATAGCCTCAAATA
TCTGACTAATGGCAAGTCTGGCTACAGCGTTGGTACCAACATGCCCTACGTTTATTTTAAGCGTGGTGGTCAGGTAACAG
GTGAGCAAGAACCTTTTATCATCGATCCTGTCTTTGGTGTGACGACTGAGCAGGTGGTAAGTTTTGCAGGCATTGACCCG
CAAACTATGCCGCTACTCATCCTCAATGGTTTTGAATACACTATTAATACCAGCGGCAGTCACTATACTCTGCCGATGCA
AGCCGATACGACTCAGCCAAAGTTGACTCAAGATGATGTCACGGCGTTAATTGATTACGTCAATAACGGCGGCAACATCA
TGATTATGGAGACTCTTGCTGGTACCAATAATACGGGCGCCTTGTCGCGTTTACTCGACAGTGCTGGTATTGCCTTTGGT
ATGGGAGGCTCAGTCGTTGCTAATGGTAACGGCCCTAGTGGTGGATATGCCGATCGCGTCCGTAACCAGCGTGGTGATGG
GATCTGGATCATTGAGCGTTATGCAGCCGTAGAAGGCGAAGATGGCACACCGACATCTCCTTATATCATTAACCAAGCCG
ACGGCAGCGTAGATTGGCTCTATAAGCAGCAGAGTAAGCCTGATGATAAACCAAGTCTATCCGTTGCCAAATGGGTTGAG
ACTGATGAGGATGGAGTTGAAACGACTCACACTGCGTTTATTGATGAAAGCCAAACCGATGATCTCAATGCGGCTAAAGC
GCTCATCTTGTCTGCTTTCACCAAGTCAGATGGCTCGCCTGCCTATCAGGAGTGTACCAACCCCAATTTTCATTATGAGA
TCAACTGCCTTGAATATAGGCCCGGTAACAATATCCCGGTTACCGGAGGCATGTTTGTCCCTCGGTATACACAGTTAGAA
CTGGGTGATGCACAAGCTCGCGCCATGGTAAAAGCGGCTGATTTAGGCACCAATATTGAACGCTTATATCAACATGAGCT
CTATTTCCGCAGTCGAGCTCAACAAGGCGTACGCTTATCGAATGTTGATCTCAACCGTACCTATCAAAATATGACGGTAT
GGTTGTGGAACAATCTCGATTATCGCTATGAGTCAGGCCTTGATGATGAGTTAGGCTTCGAGCGCTTTACTCAGTTTTTA
AACTGTTATAGCGACAATCGGGCTCAAGGCGGCACTCACTGCCCCGTTGAGTTGCAACAGCAGATGTTGTCATTGCAGAT
GATCCTCGGTCAAGAGAGTGGTGAATACAAAGGATACATGAATCCGAGTTATCCTTTGAACTACATGGAAAAACCACTGA
CTCGTTTGATGCTAGGCCGCTCTTTCTGGGATCTAGATATTAAAGTTGACCCACGTCTGTTCCCAGGGGAAGCAATGGGC
TCTCAAGGTGGAGGACAAGTTACATTCGACTTGAGTCATAACACTGCTGCCTGGTTTGCCGGAAATCGTCAACCTACAGG
GCAATGGGCTGTTGCACAGCAGTCGTTTACCGTATCGGTTAATGGCAGCGCAGACCCCGTTACCATTACTGTGGCTTTGG
CCGATGATCTTACCGGGCGCGAAAAGCACGAACTTGGCCTTAAGCGACCACCGCGTATGACTCAGTCATTCACACTAAAC
TCAGGTGATATTGGTAGCAGCTACAGTTTAACGTCACCCTATGGCGGTTTAATTTATGTCCAAGGTCAAGGTAGCCAAGA
AGGACATAATGTAACCCTGAACTTTACTGGTAGTGTCGACGCGCCGCTTTATCAGTATGATGGTAAAGGCGGACAATGGC
TGAATCCGCTAGATTCCCCTGCGCCTATTGGTGAAGTCATATCTCAAAGCTTTGTTTATACCGCAGCTAAAGCCAACCTT
AACGCGGCCAATTACAACGGCAGCCCAGAGCAGTTTGCTAAAGAGCTGGATACTTTTGCTGAGGATATGAATAACTTCTA
TAGCCGCGATGAAGCGCTACAAGGAAATCTAAACCGTAAAGCGACCGATAGTAGTATTCCAACTAATCGTCATCACTTTG
TTAATGACATCGCTATCAGTATCGGAGCGGCCCACTCAGGTTATCCGGTGATGAATGCAAGCTTTAACGCTGAGAGTCAT
AACCTTGCGCTCACACCGTTAAACTCTTGGTTGCTATGGCACGAGGTTGGCCATAATGCTGCTGAGGCACCGTTCAATGT
CGATGGTGCGACGGAAGTGGTTAACAATCTGCTCGCGCTTTATATGCAAGATAAACATCTCGGCAAGATGAGTAGAGTTG
AGCAGGATATCCGCATCGCACCTGATTTCGTCGCATCTGAAAATGGTCATGCTTGGGCTGCGGGCGGCGCGGGTGAGCGT
TTGGTGATGTTTGCTCAGTTAAAAGAGTGGGCCGAAACTGAATTTAATATCGATAATTGGTATTCAGAAGGGGTTTTACC
CGCTTATTACGATAGCGCAACAGGTTTGCAAGGCTGGAACCTATTTAAACTGATGCACCGTTTAACCCGTAATATTGATG
ACAAGCAGTTTGTGCTCAAAGGCGATAATCTGTGTCATCGGCAAGAGGGTATAGGTAAGAGTGATCAGCTCTTACTCTGC
GCCTCTTATGCGACTCAAATGGATCTCAGTGATTTCTTCCGCGCTTGGAACCCGGGAAGCAAAGCCTTTATTTATCCTGG
TGACCTAACTCCTCAGTATGAGGGGGGGATCTCTGCGGCGGGATTAGCAAAAGTTGCCGCATTAAACTTACCTAAGCCAA
CACGCGATCCACTTCAAATTGACCGCATTACCGTGAGATAG

Protein sequence :
MKKTILTLTLLILIGGCGDDEPVAPSIPLIPLEPSTPVVPSIPLEPSTPIKLEISVSLDGTLLLGEPIRCNDQKPNGFEI
EIGDTVSCLYNNMPLLTFSDVRADIARAAAPTERKQLLLIDADEFIEHPENASNAQTLIKTMGVVRGGQIDIELSTLQAL
QFENNYLNNLSMAEEDFKNLLEQQANDAQADKLPSTHLPDIKPEVSTGTSTDLNAHFVAADAEAAYQYKPEQNILSTAIL
TDSQGKPVAGITYFSRSSRGITDLDGSFQFSWGDSVSFGIDTFELGEVRGNQKSYTLSQLGEGNNGLNAEALVLRYATSQ
GQAWQLPARVAEVFARYPNVINEIISLSLSSQEKVLDTGNGEQIIAAKFSKQFELGLARDIDSVLCDINCQQTAKARELL
PEVQSEGADIVRALNNVTDDSGQILADIQRLWGTTSDAGWNKVSAFHVFHDSTNFYGSTGNARGQATVNIANSAFPVMMA
RNDNNYWIPFGANKAWDKDTLAYITEAPSTVVPDKVSGETATFNLPFISIGEIGLGKIMVMGNARYNSVLVCPNGYSWNG
GIDDKSQCRVASDSDDMKHFFQNSLKYLTNGKSGYSVGTNMPYVYFKRGGQVTGEQEPFIIDPVFGVTTEQVVSFAGIDP
QTMPLLILNGFEYTINTSGSHYTLPMQADTTQPKLTQDDVTALIDYVNNGGNIMIMETLAGTNNTGALSRLLDSAGIAFG
MGGSVVANGNGPSGGYADRVRNQRGDGIWIIERYAAVEGEDGTPTSPYIINQADGSVDWLYKQQSKPDDKPSLSVAKWVE
TDEDGVETTHTAFIDESQTDDLNAAKALILSAFTKSDGSPAYQECTNPNFHYEINCLEYRPGNNIPVTGGMFVPRYTQLE
LGDAQARAMVKAADLGTNIERLYQHELYFRSRAQQGVRLSNVDLNRTYQNMTVWLWNNLDYRYESGLDDELGFERFTQFL
NCYSDNRAQGGTHCPVELQQQMLSLQMILGQESGEYKGYMNPSYPLNYMEKPLTRLMLGRSFWDLDIKVDPRLFPGEAMG
SQGGGQVTFDLSHNTAAWFAGNRQPTGQWAVAQQSFTVSVNGSADPVTITVALADDLTGREKHELGLKRPPRMTQSFTLN
SGDIGSSYSLTSPYGGLIYVQGQGSQEGHNVTLNFTGSVDAPLYQYDGKGGQWLNPLDSPAPIGEVISQSFVYTAAKANL
NAANYNGSPEQFAKELDTFAEDMNNFYSRDEALQGNLNRKATDSSIPTNRHHFVNDIAISIGAAHSGYPVMNASFNAESH
NLALTPLNSWLLWHEVGHNAAEAPFNVDGATEVVNNLLALYMQDKHLGKMSRVEQDIRIAPDFVASENGHAWAAGGAGER
LVMFAQLKEWAETEFNIDNWYSEGVLPAYYDSATGLQGWNLFKLMHRLTRNIDDKQFVLKGDNLCHRQEGIGKSDQLLLC
ASYATQMDLSDFFRAWNPGSKAFIYPGDLTPQYEGGISAAGLAKVAALNLPKPTRDPLQIDRITVR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 58
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 56
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 56
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 55
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 50

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Shal_3494 YP_001675698.1 hypothetical protein VFG0106 Protein 0.0 56