Gene Information

Name : Kvar_4478 (Kvar_4478)
Accession : YP_003441384.1
Strain : Klebsiella variicola At-22
Genome accession: NC_013850
Putative virulence/resistance : Unknown
Product : filamentous hemagglutinin family outer membrane protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4739572 - 4747434 bp
Length : 7863 bp
Strand : -
Note : TIGRFAM: filamentous haemagglutinin family outer membrane protein; adhesin HecA family; PFAM: filamentous haemagglutinin domain protein; Haemagluttinin repeat-containing protein; protein of unknown function DUF638 hemagglutinin/hemolysin KEGG: plu:plu1149

DNA sequence :
ATGCTGATGGTGGTGGCGGAAACGACCCGCTCTCATCGGGCGGGCGTGTCGCCGCAATCCGGCGCCGACGCGCGAACCGG
GTCGATGCTGACCTCCACCCTGGCGCCGCTGGCCTTTAGCTTTCTGCTCGCTTTTTCCTGTCTTACCCCGGCTCAGGCGG
CGATTGTCGCCGATAACCATGCGCCAGGCGGCCAGCAACCGCAGATCGCCAATAGCGCTAATGGCACGCCGCAGGTCAAT
ATTCAGACACCGAGCGGCGCAGGCGTTTCACGCAACGTTTACAGCCAGTTTGACGTCGATGGGCGCGGCGTGGTGCTGAA
TAACAGCCGTGCCAACACTTCAACCCAACTGGCCGGGATGGTCGCCGGGAACCCGAATCTGGCGAAAGGGGAAGCCAGAG
TGATCCTCAACGAGGTGAATACCCGCGATCCGAGCCGGCTGAATGGCTATATCGAGGTTGCCGGACAAAAAGCGCAGGTG
GTTATCGCTAACCCTGCCGGAATTAGCTGCGACGGCTGCGGGTTCATCAATGCCAACCGCGCAACGCTGACGACAGGCCA
GGTACAGTACGGTAACGGGCAGATTAGCGGTTATGATGTCAATCGCGGCGAAATTCTCGTACAGGGCGGCGGCCTTGATG
CGAGTTCGGTAGACAGTACCGACCTGCTCGCCCGCGCGGTGAAAATCAACGCCGGGGTTTGGGCGCAGGAGCTAAAGGTC
ACCGCAGGGCGTAATCAGATCGATGCGGCGCACAGCCGAACGACGGCAAAATCAGCCGACGGCAGTGCGCTGCCTGCGGT
GGCGATAGACGTCAGCGCGCTGGGCGGGATGTACGCTCATAAAATTCGTCTTGTCGGTACCGAGCGCGGCGTAGGCGTTC
ACAATGCCGGCAATATCGGCGCGGCGGCGGGGGATGTAGCAATCAGCGCTGATGGCGCGCTCAGCAACAGCGGCGTGATT
CAGTCCGCGCAGAATCTGCAACTGTCCGTTAAGGGCGATCTTCACAACCAGGGGCAGCTGTACGCCGGAAAAGATAGCGT
GGTGACGGCCAGCGCGACGCTAACCAATGACGGCATGATCGCCGCGCAAGGCGATACGCAGATTGCCGCCAACGCTTTGC
GCAGCACGCAAAACAGCACGCTCGCCGCCGGATTAAATAGCGATGGTTCTACCGCCAGCAGCGGCGCGCTGACGCTGAAT
AGCCAGAGCTCGCTGGCGCTGAATGGCCGCAATCTGGCGGCAGGAACGCTGACGGCGCAGGGCAGGACGGTGGATTTTGA
TAACAGTCGAACCTCGGGGGCGCGGATCGTGGTCAGCGCCGCGTCCGGCGATATCACCACCCGAGATGCGGTGGTGGTGG
CCAGCGAAAAACTGCAGTTGGCCGCCAGCGGCAAACTCAACAACCACAGTGGCCTGCTGACGGCCAGCCAGCTTGAGCTG
AAAGCAATGGCGCTGGATAACCAACAGGGCGTAATCCAGCAGACGGGCGAGGACGATCTACGCCTTGATTTTCGCGCGGG
ATTGGATAATCGCGGCGGTGAGATTGCCAGCAACAGCCATGCTTTGACGCTCAGCACCAGCCAGTTGCTCAATCAAAACG
GAACGCTGCTGCATACCGGCAGCGGCGGCATGTCTATTACCATTGATGGCGCGCTGGACAACGGCGAAGGCACTATTGCC
GCTAACGGCAATATCGTTTTGTACTCTGATAACATTAATAATCGCAGCGGTAAAATCAGCACGACGCAGGGCAATGCTCA
GCTCACTACCCGTCACGAGCTGGAAAATTCGCAAGGGAATATCGTTGCTGGCGGCAGCCTTTCTCTACAAGTGGCCAGCC
TTCGGAACCAGCAGGGGCAATTGATTGCGGCGCAGGGCGACCTGGCCATGAGCGCTGAAGGAGGGCTGGATAACGGCGAA
GGCGTTCTTGCCGCAAATGGCAATATCAAGCTTGACGCCGACAATCTGACCAACCATGGCGGAAAAATCAGCGCGGCGCA
GGGCGACATTCAGCTCACTGCGCGCCATGGGGTGGATAACTCGCAGGGGAATATCATCGCCAGCGGCGACATCCAACTAC
AGGCTGAAAATCTTAATAATCGTCACGGGCAGATAGGCACGGCGCAGCGGGGCAGCGTCAACCTGACGACGAGCGGTTTA
CTGGATAACCAGCAGGGCACAATCACCGCCTTTGATGCCCTGGGTATTCAGAGCGCCACCGTTGATAACCGGCAGGGAGA
ACTGCAGTCCGGCGGTAATCTCAACATCACCATTCACAACCGGGGGCTGGATAACCGACAGGGGCAGATCGTCTCCGCCA
CCGCTCTGGAAATAGCAGGCGTCAACCTGGCGTTGGCTAATACGGGCGGAACGTTACTTGCCGCGAGCAAACTTAGCCTC
GATGCCGATTCCCTCAGCGGCGATGGCGAGGTGCTCTCTCAGGGCGATATGTCGCTCACGTTGCGTCAGGCATTTCATAA
CGCCGGGAGGGTCATCGCCAACGGCAACCTGCAGTGGAATCTCTCCGGACTGGGACTGATTAATCAGGGGGTTATCAGCG
CCGGCCGGGCGCTCAATATCTATGCGGCAAAGCTGGATAACCGTCAGGAAGGTGAAATCAGCGCTGGCGAGAACCATTTG
ACGGTCAGCGGCGAGTTGGTCAATCGGGGGCTGATTGACGGTGGTTTAACCCATATTGTCGCCACGACGTTAACCAATAT
CGGCAGCGGGCGCCTGTACGGCGACGCCGTCGCGCTGCAGGCGGCAACGCTGACTAACGCCGCGGAGAACGGCGTGGCGG
CGACGATTGCCGCACGGGCATCACTGGCAATGGGCGTCGGCACGCTCAATAACCAGGATCACGCGCTAATCTACAGCGAC
GGTACGCTCGCTATTGGCGGCCAGCTTGCCGAAGACGGTTCGCTCAGCGGCTGGGCGGGAGTGTTTAATAACCACAGCGC
AACGCTTGAGTCCGCCGGTGATATGGCGCTCGATATCCAACAAATCAATAACTACAACGACCATCTGGTGACGAAAGATG
TGATGGTCGAGCAGTCATGGCGCCATGAAGCGGCGCTGAAAGGATCGGTACAGCGCTTCGACTGGTCGCTGGTCGATACC
AGCTATAAAAACAAATACGGCGTTCACGACGCCATCATGCCGGACGGTAGCCGCGGCGATGAGTTCTATGAATACCAGTA
CCAGCGTACCGTTGTGGAAACGCAGGTGGTGGAAAGCGATCCTGGCAAAATCCTCTCCGGCGGCCAGCTCATCATCAACA
GCGATAAGCTGAATAACTACGATAGCCAGATTATCGCCGGAGGAGCGCTCGGTGGCGTGATTGGCGAACTAAACAACGTC
GCCACGACCGGTAAACGGGTGACCACCGACGTCGGCACCCAGACCCGTTGGTATGAGAAAAAGACCAGCCGCCCGTTTGG
CGGCACCAAAACCAGCCAGGGGAAAAAAAGCAGCGAATATGAACCGACGCCGACGGTTCAGACCATCGATTTGCAAACCA
TGAAGTGGCAGGGCAATACCCAAATTGATGGCCATAGCGGTGTGATTAATCCGCGCGATCGCGCCGATGAAACAGGCGAA
CTACCCGCCGGACGGCTGGTGGAAGTGACGCCGGTTAACGCTGATGGAACGGTTATTCGCGTCGTCACGCCGGATACCCG
GCTTCCTGTCAGCAGCCTGTATCAGATCGATCCGCAGGCTAAAGCAGGCTATCTGGTGGAAACCGATCCGCGTTTTACCA
ACGGTAAAGCGTGGCTCGCCAGCGACTACATGCAAAATCAGCTGGGCATTGATCAGGCGATGAAGCGGCTTGGCGACGGC
TATTACGAACAGCGGCTGGTACGTGAGCAGATCGTTAAGCTGAGCGGCGGGCGCTACCTGCAGGGCTACAGTAACGATGA
AGAGCAGTATCGGGCACTGATGGACGCGGGCGTCGCCTTCGCCAAACAGTACAACCTGACGGTCGGGGTGGCGCTGACGC
CAGCGCAGATGGCGCTGTTGACCAGCGATATGGTCTGGCTGGTCGCCAGAGAGGTCACGCTAACGGATGGCTCCGTGCAG
CAGGTGCTGGTGCCGCAGGTTTACGCGCGGGTTAAAGCGGGCGATCTCGATGGCAGCGGCGCGTTGCTCGGCGGTGAAAA
CGTGGCGTTCAGCGTGAGCCGTGATGTGATCAACAGCGGGCACATCCAAAGCCGCGGCGTGACGCAGCTAACCGCGGAAA
ACATTCATAACAGCGGCTATATCGGCGGCAACCAGCTCACGCTAAACGCGCGTACTGATATCAACAATATTGGCGGAACG
CTGCAGGGCGGCGATAGCCTGATCGCGCAAGCCGGGCGTGATATCAACAGCGCCAGCACCCTCAGCGGAGGCCCGGGTAA
CATCAGCCTCGATCGGCCTGCGGGGATTTATGTCCAGAATGAAAACGGCCAGCTGGGGCTGCAGGCGTTACATAACATTA
ATCTGACGGCAAGCATGGTGAGTAACAGCGCGGCGGGCAGCCAGACGCAGATTATCGCCGGCAACGATCTCAATCTGCAG
ACGCTCGCCACCACCCATAGCGAAAGCGGAAACTGGGGGAAAGGCAACGATCGTTCCTTGACCCAGCGTAGCGATCTCGG
CACGCAAATTAACGGCGGCGCGGTGGCGCTGTCCGCAGGCCACGATATTCATGCCCGCGCCGCCAGCGTGACGGCGACCA
GCAGCCTGACGGTTGCCGCCGGCAATGATATTAACCTCAGCAGCGGCGAGTCCTCATGGCACCTGACGGAAAACAGCCAT
CAAAGCAGCAGCGGCCTGCTGGCTCGCCGCTCGCTCACCACTCACGATGAAGTGTGGGCGCAGAACGCCATCGGCAGCAA
TTTTAGCGGTGACAGTATTGTGATGCAGGCCGGGCGCGACCTGCTGGTGTCCGGCAGCAGTGTTGCCGGCACCCAGGACG
TCAACCTTGCGGCGGGGCGTAACCTGACCATCACCACCGCCGAGGAGCGTCGCCAGGAAAACCATCTACGTAAAGAGAAA
CATAGCGGCTTTTCCGGCACCGGCGGCGTCGGCTTCAGCGTCGGCAGTTCGTCGCTGAAAGCCACCGATGTGACGACGGC
GTTGAGTAGCGCCGCCAGTACCGTCGGAAGCTCACAGGGCAACTTGAGCCTGAGCGCCGGTAATGTATTGACGGTTCAGG
GATCTGACCTGGTGGCCGGCAACAATATGGCGTTGACCGGTAAAACCGTGAATATTCTGGCAGCGGAAAACCAGAGCACC
CAGACCCATACCGTGGAGCAGAAAACCAGCGGCCTGACGCTGGCGTTATCCGGGATGGTGGGCAGCGCCATTAATACCGC
GGTTTCCAGCGCCAACCAGGCCAGCACCGAGAGCAATGGCCGTCTCGCCGCGCTGAGCGGGCTGCAGTCGGCATTATCGG
GCGTACAGGCGTATCAGGCATCGCAGATGCAGACTGCGGATTCCAGTCCGGAGAGCATGATAGGGGTCAACCTTTCCTGG
GGTAGCCAGTCCTCGAAATCCACTCAGCGACAAACGCAGAACACCAGCCGCGGCAGCAGCCTGATGGCCGGCAACAATCT
CAGCATTATCGCCACGGAGACGGATATCAACGTTGAGGGCAGCCAGCTGCAGGCCGGCGGCAGCGCGTTGCTCAATGCCG
CGCGCGATGTGAATTTATTCTCTGCGGAGAACGCCAGCACCCTGAGCGGTAAAAACGAGAGCCATGGGAGTTCGTTTGGC
GTCGGTATTAACTTCGGTCAGGGCGCGAACGGGCTGACGGTCAGCGCCAGCGCCAACGCCGGGAAGGGCCACGAGAAGGG
TAACAGCCTGACGCATAACGAAACGACCCTCAGCGCCGGTGAGCGGGTGACTATCGTCAGCGGTCGCGATACGACGCTGA
CCGGGGCGCAGGTGAGCGGCCATCAGGTGACAATGGACGTCGGGCGTAACCTGACGCTGAGCAGCGAGCAGGACAGCGAT
AACTACGATTCGAAGCAGCGTAGCGGCAGCGTGGGGGCGAGCGGCAGCATGGGCGGCGGTTCGGGGTCTCTTAACCTGAG
CCAGAGCAAAATGCACAGCACCTGGGCGTCGGTGGAGGCGCAGACGGGGATTTTTGCCGGCGAGGGCGGTTTTGATGTGA
AGGTGGGCGGGCATACGCAGCTGAACGGCAGCGTGCTGGCGAGCACGGCGGCGGCGGAGCTGAACCGGCTGGATACCGGG
ACGCTGGGTTTCCGGGATATTAAAAACTATGCCGAGTACAGCGTGGAGCAGCAGAGCGAGGGAGTGAGCACGAGCGGTAG
CGTGGCGGGGCAGTTCCTCGGGAATGCAGCCAGCGGTCTGCTGATGGGGGCGAACGGCAGCGGCAGCGACAGTTCGCTGA
CGCGGTCGGCGGTGAGCGAGGGCAGTATCGTTATCCGCGACGGCGCGAATCAGCAGCAGGACGTGACGGGGTTAAGCCGG
GATGCGGCGCATGCGAACCAGACGCTGAGTCCGATATTTGATAAGGAGAAAGAGCAGAACCGTCTGGCGACAGCGCAGAA
GACAGGTGAAATCGGGCGCCAGGTGAGTGATGTGCTGGTGACGCAGGGCAAGCTGAACGCGCAGGCGGCGCAGAGCGACC
CGGCGGCGCGGGCAGCGGCGCGGGCGAAGCTGGTAGCGGGAGGGAATGGCAGTCCGAGCGAGGAGCAAATCAACGCGCAG
GTGAGCCGAACGGCGACGGCGGACTACGATACGGGCGGAAAGTACCAGAAGGTGGCGCAGGCGGTAACGGCGGCGATGCA
GGGTCTGGCGGGGGGCGACCTGGCGCAGGCGGCGAGTGGCGCAGTGAGCCCGTATGTGGCGGAAATCATTCACAGCCAGA
CGACGGACAGCGCGACGGGCAAGGTGAATGTGGAAGCCAATGCGATGGCGCACGCGGTGTGGGGAGCGATAGCGGCGGCA
TCGGGAAATAACAGCGCGCTGGCGGGAGCGGCAGGCGCAGTGAGTGGCGAGCTGCTGGGGCGCTGGATAGCGGCGGAATA
TTATCCGGGTGTTAAGACAGAAGAGCTGTCGGATGAGCAGAAGTCGACGATAAGCGCGCTGAGTACGCTGGCGGCGGGAC
TGATGGGGGGGCTTAGCGGAGGCAGCAGCGCGGATGCGGTGGCGGGTGCGCAGGCCGGGAAGAATGCGGTGGAGAATAAC
TTACTGAGCGGCAGTGAAGATGCGCAGGCGGCATGGTTGCGCCAGCACGGCATCGATATGGCGAGTTGTTCCGATAACCC
AGGCGGGTCGGCATGTCAGAAAGCGATAAATGAGCGTAATGCGGTAGGACTTGCGCTGGCATCGGGTAGTGTGGCCCTGT
TACCAGGTGGTGCTCAGGCAATGTGGGGACTGGGAGCAAGTGCAAATGCCGGTATCAGTTATCTGGCAGATGGCACGATA
GATCCGGCAAATGCGACTATTGCTGGCTGGGTAAACGTGCTCAGTATGGGGAATGGTCTGGCAGGTACGGTAGGCTGGAA
TGCCGCAGGTGGTGCGCTGGGCAACTGGATAGATGACAAAGATCCACTGTCTGGTGCGCTTATCAATGGTGCCGGTTCGG
GTATTGGCTATGGCATTGGTAAAGGACTCTCATGGGGTGTTAATGCCGGGGCAAACTGGTGGAAAGGAGGCTGGGATCCG
AAGTTTAACGCTGAATTGAGGCAGTTTACTGAAATTAAGGGGGATTTTGGTATTTCGAAGGAAATGACTCCGAGCCGCGT
TCCTGGGGCTTTTGGTGATTTTGGTGGTTCATTCTTCTCGGAAATAACAGGTAAAGGTATAGAGAAGCGCGCGGATTCTA
TGGGGGAGAGAAAAAATGATTAA

Protein sequence :
MLMVVAETTRSHRAGVSPQSGADARTGSMLTSTLAPLAFSFLLAFSCLTPAQAAIVADNHAPGGQQPQIANSANGTPQVN
IQTPSGAGVSRNVYSQFDVDGRGVVLNNSRANTSTQLAGMVAGNPNLAKGEARVILNEVNTRDPSRLNGYIEVAGQKAQV
VIANPAGISCDGCGFINANRATLTTGQVQYGNGQISGYDVNRGEILVQGGGLDASSVDSTDLLARAVKINAGVWAQELKV
TAGRNQIDAAHSRTTAKSADGSALPAVAIDVSALGGMYAHKIRLVGTERGVGVHNAGNIGAAAGDVAISADGALSNSGVI
QSAQNLQLSVKGDLHNQGQLYAGKDSVVTASATLTNDGMIAAQGDTQIAANALRSTQNSTLAAGLNSDGSTASSGALTLN
SQSSLALNGRNLAAGTLTAQGRTVDFDNSRTSGARIVVSAASGDITTRDAVVVASEKLQLAASGKLNNHSGLLTASQLEL
KAMALDNQQGVIQQTGEDDLRLDFRAGLDNRGGEIASNSHALTLSTSQLLNQNGTLLHTGSGGMSITIDGALDNGEGTIA
ANGNIVLYSDNINNRSGKISTTQGNAQLTTRHELENSQGNIVAGGSLSLQVASLRNQQGQLIAAQGDLAMSAEGGLDNGE
GVLAANGNIKLDADNLTNHGGKISAAQGDIQLTARHGVDNSQGNIIASGDIQLQAENLNNRHGQIGTAQRGSVNLTTSGL
LDNQQGTITAFDALGIQSATVDNRQGELQSGGNLNITIHNRGLDNRQGQIVSATALEIAGVNLALANTGGTLLAASKLSL
DADSLSGDGEVLSQGDMSLTLRQAFHNAGRVIANGNLQWNLSGLGLINQGVISAGRALNIYAAKLDNRQEGEISAGENHL
TVSGELVNRGLIDGGLTHIVATTLTNIGSGRLYGDAVALQAATLTNAAENGVAATIAARASLAMGVGTLNNQDHALIYSD
GTLAIGGQLAEDGSLSGWAGVFNNHSATLESAGDMALDIQQINNYNDHLVTKDVMVEQSWRHEAALKGSVQRFDWSLVDT
SYKNKYGVHDAIMPDGSRGDEFYEYQYQRTVVETQVVESDPGKILSGGQLIINSDKLNNYDSQIIAGGALGGVIGELNNV
ATTGKRVTTDVGTQTRWYEKKTSRPFGGTKTSQGKKSSEYEPTPTVQTIDLQTMKWQGNTQIDGHSGVINPRDRADETGE
LPAGRLVEVTPVNADGTVIRVVTPDTRLPVSSLYQIDPQAKAGYLVETDPRFTNGKAWLASDYMQNQLGIDQAMKRLGDG
YYEQRLVREQIVKLSGGRYLQGYSNDEEQYRALMDAGVAFAKQYNLTVGVALTPAQMALLTSDMVWLVAREVTLTDGSVQ
QVLVPQVYARVKAGDLDGSGALLGGENVAFSVSRDVINSGHIQSRGVTQLTAENIHNSGYIGGNQLTLNARTDINNIGGT
LQGGDSLIAQAGRDINSASTLSGGPGNISLDRPAGIYVQNENGQLGLQALHNINLTASMVSNSAAGSQTQIIAGNDLNLQ
TLATTHSESGNWGKGNDRSLTQRSDLGTQINGGAVALSAGHDIHARAASVTATSSLTVAAGNDINLSSGESSWHLTENSH
QSSSGLLARRSLTTHDEVWAQNAIGSNFSGDSIVMQAGRDLLVSGSSVAGTQDVNLAAGRNLTITTAEERRQENHLRKEK
HSGFSGTGGVGFSVGSSSLKATDVTTALSSAASTVGSSQGNLSLSAGNVLTVQGSDLVAGNNMALTGKTVNILAAENQST
QTHTVEQKTSGLTLALSGMVGSAINTAVSSANQASTESNGRLAALSGLQSALSGVQAYQASQMQTADSSPESMIGVNLSW
GSQSSKSTQRQTQNTSRGSSLMAGNNLSIIATETDINVEGSQLQAGGSALLNAARDVNLFSAENASTLSGKNESHGSSFG
VGINFGQGANGLTVSASANAGKGHEKGNSLTHNETTLSAGERVTIVSGRDTTLTGAQVSGHQVTMDVGRNLTLSSEQDSD
NYDSKQRSGSVGASGSMGGGSGSLNLSQSKMHSTWASVEAQTGIFAGEGGFDVKVGGHTQLNGSVLASTAAAELNRLDTG
TLGFRDIKNYAEYSVEQQSEGVSTSGSVAGQFLGNAASGLLMGANGSGSDSSLTRSAVSEGSIVIRDGANQQQDVTGLSR
DAAHANQTLSPIFDKEKEQNRLATAQKTGEIGRQVSDVLVTQGKLNAQAAQSDPAARAAARAKLVAGGNGSPSEEQINAQ
VSRTATADYDTGGKYQKVAQAVTAAMQGLAGGDLAQAASGAVSPYVAEIIHSQTTDSATGKVNVEANAMAHAVWGAIAAA
SGNNSALAGAAGAVSGELLGRWIAAEYYPGVKTEELSDEQKSTISALSTLAAGLMGGLSGGSSADAVAGAQAGKNAVENN
LLSGSEDAQAAWLRQHGIDMASCSDNPGGSACQKAINERNAVGLALASGSVALLPGGAQAMWGLGASANAGISYLADGTI
DPANATIAGWVNVLSMGNGLAGTVGWNAAGGALGNWIDDKDPLSGALINGAGSGIGYGIGKGLSWGVNAGANWWKGGWDP
KFNAELRQFTEIKGDFGISKEMTPSRVPGAFGDFGGSFFSEITGKGIEKRADSMGERKND

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
S4 AAQ19127.1 putative adhesin/hemagglutinin/hemolysin Not tested PAI I CL3 Protein 0.0 45