Gene Information

Name : asa1 (EFA0047)
Accession : NP_816972.1
Strain :
Genome accession: NC_004669
Putative virulence/resistance : Virulence
Product : aggregation substance Asa1
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 35117 - 39007 bp
Length : 3891 bp
Strand : -
Note : similar to GB:J03600, GB:J03571, SP:P09917, PID:187167, PID:187169, PID:187193, PID:388525, GB:J03600, GB:J03571, SP:P09917, PID:187167, PID:187169, PID:187193, and PID:388525; identified by sequence similarity; putative

DNA sequence :
ATGAAGCAACAAACAGAAGTAAAGAAACGTTTTAAAATGTATAAGGCAAAGAAGCATTGGGTGGTAGCCCCTATTCTTTT
TATAGGTGTGTTAGGAGTTGTAGGATTAGCTACTGATGATGTACAAGCTGCGGAATTAGATACGCAACCAGGAACAACGA
CGGTGCAACCCGATAACCCCGATCCGCAGGTAGGTAGTACAACACCTAAGACAGCAGTAACTGAAGAAGCAACAGTACAA
AAAGACACTACTTCTCAACCGACCAAAGTAGAAGAAGTAGCGTCTGAAAAAAATGGAGCTGAACAGAGTTCAGCTACTCC
AAATGATACCACAAACGCGCAACAACCAACAGTAGGAGCAGAAAAATCAGCACAAGAACAACCAGTAGTAAGCCCTGAAA
CAACCAATGAACCTCTAGGGCAGCCAACAGAAGTTGCACCAGCTGAAAATGAAGCGAATAAATCAACGTCCATTCCTAAA
GAATTTGAAACACCAGACGTTGACAAAGCAGTTGATGAAGCAAAAAAAGATCCAAACATTACCGTTGTTGAAAAACCAGC
AGAAGACTTAGGCAACGTTTCTTCTAAAGATTTAGCTGCAAAAGAAAAAGAAGTAGACCAACTACAAAAAGAACAAGCGA
AAAAGATTGCGCAACAAGCAGCTGAATTAAAAGCCAAAAATGAAAAAATTGCCAAAGAAAATGCAGAAATTGCGGCAAAA
AACAAAGCGGAAAAAGAACGCTACGAGAAAGAAGTCGCGGAATACAACAAACATAAAAATGAAAATGGCTATGTAGCAAA
ACCAGTAAATAAAACGCTAATTTTCGATCGTGAAGCAACAAAAAATTCCAAAGTTGTTTCTGTAAAAGCTGCAGAATATA
TAGACGCTAAAAAACTAACTGATAAACATAAAGATAAAAAATTACTTATCAGTATGCTTAGTGTAGATTCAAGCGGGTTA
ACAACTAAAGACTCGAAAAAAGCACATTTTTATTATAATAACGGTGCAGGAGGAACATTGGTTGTTCTTCACAAAAATCA
ACCAGTAACTATTACCTATGGCAATTTGAATGCTAGTTATTTGGGTAAAAAAATTGCTAGTGCTGAATTCCAATATACAG
TGAAGGCCACACCTGATTCAAAAGGTCGATTGAATGCTTTCTTACATGATGATCCAGTGGCCACAATTGTCTATGGAATT
AACATTGACCCTCGTACAAAGAAGGCTGGTGCTGAGATTGAAATGCTCGTTCGCTTCTTTGGAGAAGATGGCAAAGAAAT
CTTGCCAACGAAAGAGAATCCATTTGTATTTTCAGGTGCTTCATTAAATTCACGTGGTGAAAACATTACGTATGAGTTCG
TAAAAGTAGGAAACACGGATACTGTTCATGAAATTAATGGATCAAAAGTAGCTCGTCATGGAAATAAAGTTTATTCTAAA
ACGGATATTGATGTAGGGACGAATGGGATTTCAATAAGTGACTGGGAAGCAGTTCAAGGCAAAGAATATATTGGCGCAAC
TGTTATTTCAACACCAAATAGAATTAAATTCACTTTCGGGAATGAAATTGTTAACAATCCAGGGTATGACGGAAATTCGA
TGTGGTTCGCATTTAATACGGATTTAAAAGCAAAATCAATTACGCCTTATCAAGAAAAAGGACGTCCAAAGCAACCAGAA
AAAGCAACGATTGAATTCAATCGATACAAAGCCAATGTGGTTCCTGTTCTTGTTCCGAATAAAGAAGTCACTGATGGCCA
GAAAAATATCAATGATTTAAATGTGAAACGTGGCGATTCTTTACAATACATTGTGACAGGGGATACGACAGAACTTGCCA
AAGTAGATCCAAAAACAGTAACAAAACAAGGGATTCGAGATACCTTTGATGCAGAAAAAGTGACGATTGATTTATCCAAA
GTGAAAGTTTATCAAGCAGACGCAAGTCTAAACGAAAAAGACTTAAAAGCTGTTGCTGCAGCAATTAATTCAGGAAAAGC
TAAAGACGTGACCGCTTCTTATGACCTTCATTTAGACCAAAACACCGTTACAGCAATGATGAAAACCAACGCAGACGACT
CTGTTGTTTTAGCAATGGGGTATAAATATTTACTTGTCTTGCCATTTGTAGTGAAAAATGTAGAAGGCGATTTTGAAAAT
ACAGCTGTTCAATTAACAAACGATGGGGAAACGGTAACAAATACAGTGATTAACCATGTGCCAGGTAGTAATCCTTCCAA
AGATGTAAAAGCAGATAAAAACGGTACAGTTGGCAGTGTTTCTCTACATGATAAAGATATTCCGTTACAAACAAAAATTT
ATTATGAAGTGAAATCTTCCGAACGTCCAGCCAACTATGGCGGAATCACAGAAGAATGGGGCATGAATGATGTCTTGGAC
ACGACCCATGATCGTTTCACAGGAAAATGGCACGCTATTACGAACTATGACCTTAAAGTAGGGGATAAAACGTTAAAAGC
AGGAACAGATATTTCTGCCTACATTCTTTTAGAAAACAAAGACAATAAAGACTTGACGTTTACAATGAATCAAGCATTAT
TGGCAGCGTTAAATGAAGGAAGCAATAAAGTAGGCAAACAAGCTTGGTCTGTGTATCTGGAAGTCGAACGGATCAAAACA
GGTGACGTAGAAAACACGCAAACAGAAAACTACAACAAAGAGCTTGTTCGTTCTAATACGGTGGTGACACATACGCCTGA
TGATCCAAAACCAACCAAAGCCGTTCATAACAAAAAAGGGGAAGACATTAATCATGGAAAAGTGGCTCGTGGTGATGTTC
TTTCTTATGAAATGACTTGGGACTTAAAAGGGTACGATAAGGACTTTGCCTTTGATACAGTCGATCTTGCGACAGGCGTT
TCTTTCTTCGATGATTACGATGAAACGAAGGTGACACCAATCAAAGACTTACTTCGTGTCAAAGATTCTAAAGGGGAAGA
CATTACGAACCAGTTCACGATCTCTTGGGATGATGCCAAAGGCACGGTGACGATTTCTGCCAAAGACCCACAAGCCTTTA
TTTTGGCGCATGGTGGGCAAGAATTACGTGTAACTTTACCAACAAAAGTTAAAGCCAATGTTTCTGGTGATGTGTATAAT
TTAGCGGAACAAAATACATTTGGTCAACGAATTAAAACCAATACCGTTGTCAACCATATTCCAAAAGTGAACCCTAAAAA
AGACGTGGTTATTAAAGTCGGTGATAAACAAAGTCAAAATGGTGCCACAATCAAATTAGGGGAGAAATTCTTCTATGAAT
TTACAAGTAGTGACATTCCTGCAGAATACGCTGGTATTGTGGAAGAATGGTCGATTAGCGATAAACTAGACGTCAAACAT
GACAAATTTAGTGGCCAATGGTCTGTGTTTGCCAATTCTACGTTTGTCTTAGCAGACGGAACCAAAGTGAATAAAGGGGA
CGACATTTCGAAACTATTCACGATGACCTTTGAACAAGGGGTAGTGAAAATCACAGCCAGTCAAGCCTTTTTAGATGCGA
TGAATCTAAAAGAAAACAAAAACGTTGCACACTCATGGAAAGCGTTCATTGGTGTAGAACGAATTGCGGCAGGAGACGTT
TACAACACAATCGAAGAATCTTTCAACAATGAGAAGATTAAAACTAATACGGTAGTGACGCATACGCCAGAAAAACCACA
AACGCCACCAGAAAAAACAGTGATTGTACCACCAACACCAAAAACACCGCAAGCACCAGTAGAGCCATTAGTGGTAGAAA
AGGCAAGTGTGGTGCCAGAATTGCCGCAAACAGGCGAAAAACAAAATGTCTTATTAACGGTAGCTGGTAGTTTAGCCGCA
ATGCTTGGCTTAGCAGGCTTAGGCTTTAAACGTAGAAAAGAAACAAAATAA

Protein sequence :
MKQQTEVKKRFKMYKAKKHWVVAPILFIGVLGVVGLATDDVQAAELDTQPGTTTVQPDNPDPQVGSTTPKTAVTEEATVQ
KDTTSQPTKVEEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPETTNEPLGQPTEVAPAENEANKSTSIPK
EFETPDVDKAVDEAKKDPNITVVEKPAEDLGNVSSKDLAAKEKEVDQLQKEQAKKIAQQAAELKAKNEKIAKENAEIAAK
NKAEKERYEKEVAEYNKHKNENGYVAKPVNKTLIFDREATKNSKVVSVKAAEYIDAKKLTDKHKDKKLLISMLSVDSSGL
TTKDSKKAHFYYNNGAGGTLVVLHKNQPVTITYGNLNASYLGKKIASAEFQYTVKATPDSKGRLNAFLHDDPVATIVYGI
NIDPRTKKAGAEIEMLVRFFGEDGKEILPTKENPFVFSGASLNSRGENITYEFVKVGNTDTVHEINGSKVARHGNKVYSK
TDIDVGTNGISISDWEAVQGKEYIGATVISTPNRIKFTFGNEIVNNPGYDGNSMWFAFNTDLKAKSITPYQEKGRPKQPE
KATIEFNRYKANVVPVLVPNKEVTDGQKNINDLNVKRGDSLQYIVTGDTTELAKVDPKTVTKQGIRDTFDAEKVTIDLSK
VKVYQADASLNEKDLKAVAAAINSGKAKDVTASYDLHLDQNTVTAMMKTNADDSVVLAMGYKYLLVLPFVVKNVEGDFEN
TAVQLTNDGETVTNTVINHVPGSNPSKDVKADKNGTVGSVSLHDKDIPLQTKIYYEVKSSERPANYGGITEEWGMNDVLD
TTHDRFTGKWHAITNYDLKVGDKTLKAGTDISAYILLENKDNKDLTFTMNQALLAALNEGSNKVGKQAWSVYLEVERIKT
GDVENTQTENYNKELVRSNTVVTHTPDDPKPTKAVHNKKGEDINHGKVARGDVLSYEMTWDLKGYDKDFAFDTVDLATGV
SFFDDYDETKVTPIKDLLRVKDSKGEDITNQFTISWDDAKGTVTISAKDPQAFILAHGGQELRVTLPTKVKANVSGDVYN
LAEQNTFGQRIKTNTVVNHIPKVNPKKDVVIKVGDKQSQNGATIKLGEKFFYEFTSSDIPAEYAGIVEEWSISDKLDVKH
DKFSGQWSVFANSTFVLADGTKVNKGDDISKLFTMTFEQGVVKITASQAFLDAMNLKENKNVAHSWKAFIGVERIAAGDV
YNTIEESFNNEKIKTNTVVTHTPEKPQTPPEKTVIVPPTPKTPQAPVEPLVVEKASVVPELPQTGEKQNVLLTVAGSLAA
MLGLAGLGFKRRKETK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
EF0485 NP_814266.1 aggregation substance Virulence Not named Protein 0.0 82
ef0005 AAM75211.1 EF0005 Not tested Not named Protein 0.0 82

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
asa1 NP_816972.1 aggregation substance Asa1 VFG2173 Protein 0.0 100
asa1 NP_816972.1 aggregation substance Asa1 VFG2164 Protein 0.0 83
asa1 NP_816972.1 aggregation substance Asa1 VFG2172 Protein 0.0 82
asa1 NP_816972.1 aggregation substance Asa1 VFG2171 Protein 0.0 81