Gene Information

Name : VPBB_A0402 (VPBB_A0402)
Accession : YP_007300431.1
Strain :
Genome accession: NC_019971
Putative virulence/resistance : Virulence
Product : Putative superfamily I DNA helicase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 424870 - 428388 bp
Length : 3519 bp
Strand : -
Note : -

DNA sequence :
ATGAATAATCAGGGCTGGATTCACTATTGGCGTAATTCTCTTGCCGATGCCGATAGCGCGAAAGGTGCATTAAAAAAGCA
AGACCTGAAAAATTATGTGCGAACGACAACCGACGAGTTCAAAGAAGGTAAGCTTAAACCTGATTCTACATTGCTAGAAG
ATCTGTTTCGCAATGAAGCAGATACCCTAACCGCCGTACAAATTCATCATCGACCGGTCACCTATTATCTTCGGAAAGTT
CATGGGAAAGATTACAGCGGCAATATGCCTGGTGTACTGACACCAATTGTCTGCTCTTTATGGGTCAATCGTGAAGGACT
ACTATTTCCCAACACCACGCCATTCATACCTCGTGATTTGCTTGCTCCTCAAGGAAATGACACCTTTACTATCGCTGACG
TCGATAAGCTCGATGAATTCCTAACAACAAATGAAATTCCAGCGCAGTCTACTGAAAGTATTCCAGCCAAATTCGAACAA
GAAGAACAGTATCAGAACCATCAAAAAGATTGGCACAACTACTACGGACTCACCCAAAAACTATTTGCCGATTACTGCGA
TAGAAACCGAATCAAGCAGTTCTACGAACACATTGAAAGTCGTGGCTTAGTCAACAAAATCAGCGAGTGCTTGGGAGCTT
CAAGGCATATTTTAAAACTGTACGATAACCTGAGCAATTCAAGCACAGCCTTGCCTCTGCTTGATAGTTACGCGGTAAAA
ACAGTAACTAACCACGATGAATGTATTGATGTATCTCAGACAGTGAGTTCGCGTTTTGGCCACTCAAATAGTCAATTCCC
GCTAGCTAAAGCGCAATGCGATGCCTTGGCCCATACCTTAGCCATGCAAGAAGGTGATATTCTTGCGGTAAATGGTCCTC
CTGGCACAGGTAAAACAACGTTTGTTCTATCTGTAGTCGCGTCACTGTGGATTGAATCTGCGCTAAAAGAGAGTCAGCCG
CCGCTAATCATTGCGGCATCGACTAATAATCAAGCAGTAACCAACATCATCGATGCTTTTGGTAAAGACTTCGATGAAGG
CGACGACGAACTATCAGGCCGATGGTTGCCCGACATTTTTAGCTATGGTGGCTACCTGCCATCGGCGTATGGAGAATTGG
AAGCAGCCAAAAGTTACCAAACTAAGCATTTCTATGAAAAAGTCGAGCAGTTGGATTTTCTCGATCAAGCCCAAGCGCAT
TATTTGGATAGAGCAAAACAGGCGTTTCCACAACAGAATTTTGCTGATGTAACGCAAGTTAAAGCGTATCTCCTTGCCGA
ATTGCGCCAACATCAAAACCAGTTAGATCACATCCAAAACAATTGGCATCACTACAACCGCCAGCTAAATGACATCCATT
CTCGGCTGGGGAACAATCCCCAACAAACATTGGCAGATCAACAACAAGCCGTTTCTAATGCGCAAGCATTGAAAGATAAC
GCCAAAGAACAGTTAACTGCGTGGCGAAGCTACCTCGGCAATGAATCTACATGGTTAACTCTGTTTAAGTGGTTACCGCC
AATTAAGAATAAACTTGAACTCCAGCGCAGGAGTTTCATGTATAACCTTATTGAGCACGATGAAGAGCAAATTGAAAATC
TGTCATCAGACCTATTTGAGAGCCTACTCAAGCAAGTATTTTCTAGTAAAAAAGACGATTTTGATGATCAAAAAAATCGG
TATCAATCTTGGCTAGAGCAGTATCAAGAGTTTGAGCAATCACAACTGAATTGGCTTGATTCTATCAATAACTTTACTGA
AGACTCTCCAGAGCAAACTATTCCACAACTGACTGACATCGATTCGGTTTTAGATATCACGACCCGTTTTCGAATGTTCC
GCTTAGCGGTCCATTACTGGGAAGCATGCTGGCTGCTCAGTTGCCGAGATTTGGGTCAAGAGTTGAACAAGCAGGCCCGG
AAAACAGGTTTGAAAACAGTTCGTCCTCGCTGGCAACGCCGAATGATGCTTACTCCTTGCATTGTATCAACGTTCCATTC
TCTGCCTTCTCATATGACTTACCAGTCTCACGTTGGAAATAATGAATTTGAAACTGACTATCTGCTTAATGAAATCGACC
TTTTGATCGTTGATGAAGCAGGCCAAGTAGCTCCAGAAGTTGCCGCAGCCTCGTTCTCACTAGCCAAAAAAGCGTTGGTG
ATTGGTGATATTTACCAAATCCCGCCGATTAGAAATGTGTGCTCATCAATTGACCGGGGTAATTTAAAGCAGCATAAAGT
TATCAGCTCTGATGATGAATACACAGTCATTCAAGAGGAAGGCCGAAGTGTAGTGACTGGCAGTGTGATGCATGTAGCGC
AACAAGCAAGCCGCTTTCATTATATGCCGGAAGCTGAGCCTGGAATGTTTTTGCAAGAACATAGACGCTGCTACGATGAA
TTAATCTCATACTGCAATGACCTTTGCTATCAAGGTATCCTGATCCCCAAGCGAGGACAAGCGACCGAAGATAGCCTCTA
CTCTCCTTTCAGTCATTTGCACGTCGACGGTATCGCGGAATCATTTAGCGGCAGCCGACGCAATAAGTTAGAAGCAGAAA
CTATCGCTGCATGGCTACATGCCAATAAGGTTGAAATAGAGAATTACTACGGCGAGCCACTGGCCAAGTGTGTGGGTATC
ATTACTCCATTTTCAGCTCAAGTGAACCAGATCAAGACCGCGTGTGGTGAGTTCGATATCAAAGCAGGCAAGGGGGATGA
CCAGCTAACCGTTGGCACCGTTCACTCTCTTCAAGGTGCTGAACGCAAAATCATTATTTTTTCCCAGGTGTACACCAGAC
ACAACGATGGCGGGTTTATCGATATGGATCCGTCAATGCTCAACGTCGCTGTCTCTCGAGCCAAAGATGCATTCTTGGTG
TTTGGTGATTTGGATATCATTGAAGCTGCGCCATCATCTTCTCCACGAGGATTACTCGCCAAGTATCTCTTCACTGACGA
GCGAAATGAGCTTGAGTTTAGTGTTGGTCAACGCCCCGATTTATTACAAATATGCGGCCATCCGAAACTGCTAACCAATG
CCGAAGAGCACGATGCGTTCTTATCAAAACTGCTGAGGGAAGTACAACGAAAAATAGATATCGTCTCGCCTTGGTTACTG
CTAGACAAACTGCAATCAACTGGTCAGTTAGAACTACTAAAAACAGCACTGCACAAAGGTGTCCAGATAACCATACACAC
CGACAGACATTTCAATACCACGGTCGCAAACCACCCGGATACTAATAAAGTAAAAGCGTTTCAACATTGCTGCGCGACCT
TGGAACAGCTTGGTATCGTTATCAATGTGATAAATGGTGTCCATAGTAAAAGCGTTTTTGCTGATGATCGGTATATGGCG
GTGGGGTCATTCAATTGGTTTAGTGCAAGCAGAGGCGGAAAATACGCTAATATCGAGACATCATTAATCTATGTCGGTGA
GTTAGAGAAGGAGATCAAAACTCAACTCGACTTCTTAAATAGCCGAAGCTGCAACACAAACAAGCAGCCCGTGACGTAG

Protein sequence :
MNNQGWIHYWRNSLADADSAKGALKKQDLKNYVRTTTDEFKEGKLKPDSTLLEDLFRNEADTLTAVQIHHRPVTYYLRKV
HGKDYSGNMPGVLTPIVCSLWVNREGLLFPNTTPFIPRDLLAPQGNDTFTIADVDKLDEFLTTNEIPAQSTESIPAKFEQ
EEQYQNHQKDWHNYYGLTQKLFADYCDRNRIKQFYEHIESRGLVNKISECLGASRHILKLYDNLSNSSTALPLLDSYAVK
TVTNHDECIDVSQTVSSRFGHSNSQFPLAKAQCDALAHTLAMQEGDILAVNGPPGTGKTTFVLSVVASLWIESALKESQP
PLIIAASTNNQAVTNIIDAFGKDFDEGDDELSGRWLPDIFSYGGYLPSAYGELEAAKSYQTKHFYEKVEQLDFLDQAQAH
YLDRAKQAFPQQNFADVTQVKAYLLAELRQHQNQLDHIQNNWHHYNRQLNDIHSRLGNNPQQTLADQQQAVSNAQALKDN
AKEQLTAWRSYLGNESTWLTLFKWLPPIKNKLELQRRSFMYNLIEHDEEQIENLSSDLFESLLKQVFSSKKDDFDDQKNR
YQSWLEQYQEFEQSQLNWLDSINNFTEDSPEQTIPQLTDIDSVLDITTRFRMFRLAVHYWEACWLLSCRDLGQELNKQAR
KTGLKTVRPRWQRRMMLTPCIVSTFHSLPSHMTYQSHVGNNEFETDYLLNEIDLLIVDEAGQVAPEVAAASFSLAKKALV
IGDIYQIPPIRNVCSSIDRGNLKQHKVISSDDEYTVIQEEGRSVVTGSVMHVAQQASRFHYMPEAEPGMFLQEHRRCYDE
LISYCNDLCYQGILIPKRGQATEDSLYSPFSHLHVDGIAESFSGSRRNKLEAETIAAWLHANKVEIENYYGEPLAKCVGI
ITPFSAQVNQIKTACGEFDIKAGKGDDQLTVGTVHSLQGAERKIIIFSQVYTRHNDGGFIDMDPSMLNVAVSRAKDAFLV
FGDLDIIEAAPSSSPRGLLAKYLFTDERNELEFSVGQRPDLLQICGHPKLLTNAEEHDAFLSKLLREVQRKIDIVSPWLL
LDKLQSTGQLELLKTALHKGVQITIHTDRHFNTTVANHPDTNKVKAFQHCCATLEQLGIVINVINGVHSKSVFADDRYMA
VGSFNWFSASRGGKYANIETSLIYVGELEKEIKTQLDFLNSRSCNTNKQPVT

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 42
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 42
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 42
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 42
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
VPBB_A0402 YP_007300431.1 Putative superfamily I DNA helicase VFG0627 Protein 0.0 42
VPBB_A0402 YP_007300431.1 Putative superfamily I DNA helicase VFG1537 Protein 0.0 42