Gene Information

Name : VPA1376 (VPA1376)
Accession : NP_800886.1
Strain :
Genome accession: NC_004605
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1444991 - 1449484 bp
Length : 4494 bp
Strand : -
Note : similar to GB:AAF94008.1 (AE004169) percent identity 60 in 1529 aa

DNA sequence :
ATGAAAAAGACGTTACTGTCGACAATAGTAGTAAGTTTATTGTTTGGTTGTAACTACGATAGTTATCATACCGATACAGT
TGTACCCGAGCTACCGCCACCTCCAGAAGCCACAATTAATGTCGGTCTATCGCTGGATGGTTATCTTAAATTTGTTAATT
CCTCAATTCTATGTAATGGGGAACCAGCAGAGCACTTCACTGTTGGGCCATCTGATCATGTGAGCTGCGTTCTCAAGGCA
GATGGAACTCCAATCGCAACTTATTATTCTCCATTTGATTCTGGTACGGACAATCAGGCCCGTATCACTTTAGAGTATCT
GAAGTTAACTGATGCAGAAGAATATAAAGATAGCTCAACTCGTGTCCAGAATATACAGACGTTAATTAAAACTATGGGTA
CGATCCATGGTGATGAGCTTGACCTTAGTTTGGAAAAAACTTCTCATCGTTTAATTTTTAAAAACTACTTAAATAATCAA
CTTGATGTTGAAATAGATACTTTTAAGAAGCTCCTTCAAGAAAGGTTAAGTAATGATTCTCAAACAGACAAACAACCATC
AACACATACACCAGAAGTTGAGCCTGCGGTAACCCCTGGTGCATCGAGCGATCTTTCACAAGCTTTTGTTTCTGCTAATG
CTGAAAAAAGCTTAGAGTACAAACCAAAGGAACTTATCTTAACGACGGGATATTTGGTTGATAGTTTTGGTCGTTCTGTA
AATGGAATTGCCTACTTTACTTCAAAAGGGCGAGGTCTAACTGGCTACAAGGATGGGCGATTAATTGGTGATGGCTCTTT
AGAATTCTCTTGGGGTGATACTATCAATTTTGGTATTGATACATTTGAATTAGGTAGTACTAGAGGCAACAAGAATACAA
TAAAATTACAAGATCTTGGCAGTGGAAATGAAGGAAAAAATATTGAAAGCCTTGTAATGCGCTTTTCCGAGGAGAACGAT
CAATCAGTTTTTGTTACAGATAAAGTTACGGAAGTATTTTCTAAATATCCTAATGTAATTAATGAAGCAATTTCACTTTC
ATTGTCGAATGAAGATATCCAATTAGATTTAGGTAATGGCAATACCGAAGTAGTTAAAGGGGAGTTTGAAAAGCAGTTTG
AATCTGGCTTAGCTGAAGAGATTGATAAAGAGTTAGGGAGGCAGAAGTTAGCATTTGGGGAGCAGTATAGAAAACCGAAA
CAAATTAAGGCTGTTGATTCTGACGCACAAAACGTTCAAAGGGATGTCGAGCGTTTATGGGGTGCAACTCAGCAGGCTCA
GAGAGAAGGTTGGAAACCGGTAGAGCGATTCCATATTTTTCATGATAGTACGAACTTCTATGGAAGTACGGGAAGTGCTC
GTGCTCAGGCTGCTGTAAACATCTCGAATAAGGCATTTCCTGTAGTAATGGCACGGAATGACAAAAACTATTGGCTCGAT
TTTGATAAGCCTCAGGCTTGGGACGAAAATGGTCTTGCTTATATTACAGAAGCACCTTCAAAGGTTAAACCTAAAAAGGT
AGATGCATCGAATGCGACCTTCAATTTACCCTTCATCAGTATTGGTGACTTAGGTAAAGGTAAAGTGATGGTTATGGGGA
ATGCTCGGTATAACAGTGTGCTAGTCTGTCCAAATGGATTTAGTTGGAATGGAGGGGTTAACGACCAAGGGCAGTGTACT
GGCAATACCGATAGTGATGACATGGCAAACTTCTTTAATAACGCTTTTCAATATCTAACTGGTAAAAAGGCCGGAACATT
TAGCGTTGCAACTAACATCCCACATGTTTACTTTAAACGAGGTGGACAAGTATTGGGCAGCAAAGCCAGTTACCTGATAG
ATAAACGTTTTGCTCAAGACACGCAACAATTGGATTCGTTTTCGGGTCTTGATCCTAACGATATCCCATTGGTTATTTTA
AATGCGTATAGCTACTTGGGGGAGCAGGGGGGCTTAGGTGCATATGATTTACCTATGCAAGCAAATCTTGATGCGCCGAA
ACTCACCCAACAAGATATTTCAGATTTGATTGCCTATGTGGAAGACGGTGGTAGTGTCTTGATGATGGAGACCATCAAAG
GCCAAAAAGATTCTGGTGTAGTCTCTCGTTTACTTGATGCCGCTGGTATTGCTTTTGGGATTGGGGAAAGTGTAGCTCGG
GACGGTAATGGTCCAAATGGAGGCTACCCTGATCGGGTTCGTAGCCAGCGTCAACAAGGTATTTGGGTTTTGGAACGTTA
TGCTGCTGAAGATAGTAGTAATGGCGAAGGGCCAAGCCTTCCATATGTGATAAAGGAAGATGGTAGTGTTGAATGGAAGT
ATATTATTGAGAACCGTCCAGATGATAAACCTAAGTTAGAAGTCGCTAAATGGACAGAAATTAATGAGCAGGGAGATAGT
AAAGTTCAAGTAGCATTTATCGATGAAGCGAACTTTTATCAAGACGGTACGTTTGATAACGAGGCACTAACAGTTGCTAA
AAACCGCATTCTAGATGCATTTAAAGATAATTCAGGAAAACGAGCATATGAAGAATGTACAAATAATGAATATCATTATG
AGGTCAACTGTTTAGAATATCGCCCTGGTAACAAGATTCCTATTAGTGGTGGACTGTATGTCCCTAACTATACGGAGATG
AAGCTCGGTGAACATGAAGCAAAAGCGATGGTAAAAGCGGCCAATATTGGTAGCAATATTGAAGCGCTTTATCAGCATGA
GCGATACTTTAGAACAAAAGGTAAGCAAGGCTTCCGCTTGAATAGCGTCGATATGAGTCGTATGTACCAAAACTTGTCTG
TATGGCTCTGGAACGACCTTAGATATTCTTATGATCAAGAAAAAAATGATGAGTTAGGATTTAAGCGCTTTACAGAGTTT
CTCAACTGTTATACGGATGACAAGGCTGGTGGTAATACGATTTGTCCAGAGAGCTTGAAACTTGAACTTCAGAAAATGGA
TATGATCTACGCTGAAGGTGAATATGCTGGTTATATGAATCCTAGTTACCCATTGAACTATATGGAAAAACCATTAACAC
GTTTAATGCTTGGGCGTTCATTCTGGGATCTTGATGTCAAGGTTGACACTAGACAGTTCCCTGGAGTTGCATCATCGAGT
GGTAGTAATGGTGGTACAATTACTCTAGATATGAGTAATAACGTCACAGCTTGGTTTGCAGGTAGCCGACAAGCGACGGG
CCAATGGGCACAAGCTCATGTTCCTTTCACTGTCAGTGTAAGTGGCGCAAAAGCCCCTGTAACTATCACGGTTGCGTTAG
CAGATGATTTGACTGCTCGTGAGAAACATGAAGTTGGTCTAAAGCGACCACCAAGAATGACTAAGTCATTTATTATTGGA
GGTAATAAAGCGACAAGTGAAACAATCACGGTACCTTACGGCGGTTTGATTTATGCTCAAGGTGGTAATTCTGAGAGTGT
ACAATTAACCTTCACGGGAACATTGGCGGCACCACTGTTCATTGATGGCAGTTGGAAAAATGATTTAGATTCTCCAGCAC
CTGTTGGGGAGGTGGTGAGTAAATCATTCATCTATACTGGTCCTAAAGCTAACCTTCGTGCAGAAAACTACCCAGGAGGT
ATTGAGCAGTTTGCTAAGGATTTGGATCAGTTCGCATCGGATCTCAATGATTTTTATGCGAGAGATGAAGGCCTTGATGG
CCAAGCTAACCGTAAAGTGACAGGGGATGAGAATCCAAACAGCCGGCATCATTTTGTCAACGATGTGGCAATTAGTATTG
GTGCTGCGCACTCAGGTTATCCTGTTATGAACAGTAGTTATAATCTTAACAGTAGCAATATTAACACGACACCACTGAAT
GATTGGTTGTTATGGCATGAAGTAGGGCACAACGCCGCGGAAGCACCATTTGTTGTAGAAGGCGCGACAGAGGTAGTCAA
TAACTTGCTAGCTCTATACATGCAAGATTTGCATATCGGCAAAATGACGCGTGTTGAGCAGGATATCCAGGTTGCCCCCG
AGTTTGTCAGAACTGAACATGGTCATGCTTGGGCTGCTGGAGGTGCCGCAGAGCGCTTAGTTATGTTCGCTCAGCTAAAA
GAGTGGGCTGAGAGTGAGTTTGATATTCGTGATTGGTACCAAGGAGAGCTACCAAGCTATTATTCTGAGGTTGAGGGTGT
AAAAGGTTGGAATCTGTTCAAGTTGATGCATCGCTTAACACGTAATGAGAGTGATGGCATTTTTTATCTTAAAAGTACTA
ATGCCTGCCGTTGGCAAGGATTAAGTAAGAGTGACCAACTAATGGTTTGTGCTTCTTATGCAGCACAAACTGATTTGTCT
GATTTCTTCTTAGCATGGAACCCGGGGGCAAGAAGCTTTATTTATCCAGGAAGCTCAGAGCCAAGTTATGAGGGAGGTGT
CACTCAGAAAGGTTTAGATGTAGTGAGAAAACTTGGATTGAAAAAGCCAAGTTTAGATCCGGAGGAGATTAACACTATTA
CTGTAAGGAAATGA

Protein sequence :
MKKTLLSTIVVSLLFGCNYDSYHTDTVVPELPPPPEATINVGLSLDGYLKFVNSSILCNGEPAEHFTVGPSDHVSCVLKA
DGTPIATYYSPFDSGTDNQARITLEYLKLTDAEEYKDSSTRVQNIQTLIKTMGTIHGDELDLSLEKTSHRLIFKNYLNNQ
LDVEIDTFKKLLQERLSNDSQTDKQPSTHTPEVEPAVTPGASSDLSQAFVSANAEKSLEYKPKELILTTGYLVDSFGRSV
NGIAYFTSKGRGLTGYKDGRLIGDGSLEFSWGDTINFGIDTFELGSTRGNKNTIKLQDLGSGNEGKNIESLVMRFSEEND
QSVFVTDKVTEVFSKYPNVINEAISLSLSNEDIQLDLGNGNTEVVKGEFEKQFESGLAEEIDKELGRQKLAFGEQYRKPK
QIKAVDSDAQNVQRDVERLWGATQQAQREGWKPVERFHIFHDSTNFYGSTGSARAQAAVNISNKAFPVVMARNDKNYWLD
FDKPQAWDENGLAYITEAPSKVKPKKVDASNATFNLPFISIGDLGKGKVMVMGNARYNSVLVCPNGFSWNGGVNDQGQCT
GNTDSDDMANFFNNAFQYLTGKKAGTFSVATNIPHVYFKRGGQVLGSKASYLIDKRFAQDTQQLDSFSGLDPNDIPLVIL
NAYSYLGEQGGLGAYDLPMQANLDAPKLTQQDISDLIAYVEDGGSVLMMETIKGQKDSGVVSRLLDAAGIAFGIGESVAR
DGNGPNGGYPDRVRSQRQQGIWVLERYAAEDSSNGEGPSLPYVIKEDGSVEWKYIIENRPDDKPKLEVAKWTEINEQGDS
KVQVAFIDEANFYQDGTFDNEALTVAKNRILDAFKDNSGKRAYEECTNNEYHYEVNCLEYRPGNKIPISGGLYVPNYTEM
KLGEHEAKAMVKAANIGSNIEALYQHERYFRTKGKQGFRLNSVDMSRMYQNLSVWLWNDLRYSYDQEKNDELGFKRFTEF
LNCYTDDKAGGNTICPESLKLELQKMDMIYAEGEYAGYMNPSYPLNYMEKPLTRLMLGRSFWDLDVKVDTRQFPGVASSS
GSNGGTITLDMSNNVTAWFAGSRQATGQWAQAHVPFTVSVSGAKAPVTITVALADDLTAREKHEVGLKRPPRMTKSFIIG
GNKATSETITVPYGGLIYAQGGNSESVQLTFTGTLAAPLFIDGSWKNDLDSPAPVGEVVSKSFIYTGPKANLRAENYPGG
IEQFAKDLDQFASDLNDFYARDEGLDGQANRKVTGDENPNSRHHFVNDVAISIGAAHSGYPVMNSSYNLNSSNINTTPLN
DWLLWHEVGHNAAEAPFVVEGATEVVNNLLALYMQDLHIGKMTRVEQDIQVAPEFVRTEHGHAWAAGGAAERLVMFAQLK
EWAESEFDIRDWYQGELPSYYSEVEGVKGWNLFKLMHRLTRNESDGIFYLKSTNACRWQGLSKSDQLMVCASYAAQTDLS
DFFLAWNPGARSFIYPGSSEPSYEGGVTQKGLDVVRKLGLKKPSLDPEEINTITVRK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 62
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 61
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 61
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 60
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 50

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
VPA1376 NP_800886.1 hypothetical protein VFG0106 Protein 0.0 61