Gene Information

Name : AHA_1845 (AHA_1845)
Accession : YP_856379.1
Strain : Aeromonas hydrophila ATCC 7966
Genome accession: NC_008570
Putative virulence/resistance : Virulence
Product : ImcF family protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3523
EC number : -
Position : 2014151 - 2017636 bp
Length : 3486 bp
Strand : +
Note : identified by match to protein family HMM PF06744; match to protein family HMM PF06761

DNA sequence :
ATGTTCAAAACCATTTTTACTTTTCTGCGGCAACAGCTGCCGAAATTGAAACCCTCCTGGCCCCTGCTCGGTGCTGTGCT
CTGGGTGCTGGCTCTCATCCTGGTGTGGTGGCTGGGCCCTCGTCTCGAGGTGCGCGGCGCCAAACCGTTCGAGCCGTTGT
GGGGGCGGGTGGTGTTCACCCTGCTCTGGCTCTGGTTGCTGCTGGGAGTGGTCTCCTGGCGGGTGTGGCGCAAGATGCAA
CAGCTCAAGGCCGAACGCCAGCACGAGGTGGTGCTGGAGCAGGACCCGGTCAAGGGGCTCATCGACAGGCAGGCGCTGTT
CCTCGATCGCTGGCTGCAGGCGTTGAATACCCATCTGGGCAAGGGGGCGCTCTACGCCATGCCCTGGTATCTGGTGCTGG
GGCTGCCCGGCAGCGGCAAGAGCAGCCTCATTCACCGCGCCAACCCGGCCAACAAGCTCAACCCGAGACTCGATACCGAA
CTGCGCGACGTGGCGCAGGATCAGCTGGTGGATTGCTGGCTGGGGGAACAGGCGGTGATGCTGGATCCCGCCGGGGTGCT
GCTCTCCCAGAGCGAAGCCGAACTGGATCCGCAGGCCCGCAAGCATGAGCGGCTCTGGCTGCACCTGCTTGGCTGGCTCA
ATGAACACCGCCGCCGCCAGCCCCTCAACGGCCTGGTGCTGACCGTGGATCTGGCCTGGCTCTCCCACGCCAGCGTGGCC
GAGCGCAAGGCCTATGCCCAGCTGATGCGCTCGCGCCTGCAGGAGGTGTCGGCCACCATGAATACCCGGTTGCCGCTCTA
TGTCACCTTCACCAAGCTCGATCTGCTGCGCGGCTTTGACGTTATCTACCAGCAGCTCGACAAGGAGGCCCGGGATGCCG
TGCTGGGGGTGACCTTCAAGCCAGGCGCCGACTGGCAGCAGGATCTGGCCCTGTTCTGGGATCAGTGGGTGGACAACCTC
AACCAGAATCTGCCCGAACTGATGCTGAGCCGGCTCGATGCCGCCCAGCGCAACGCGCTGTTCTCGTTCGTGCGCCAGCT
GGCGGGCCTCAAGGATTACGTGACCAGCCTGCTGGCCGAGACCCTGGCCATCGAGGAGAGCAAGCCGCTGCTGGTGCGCG
GGGTTTATGTCAGCTCCGTCTATCAGCAAGGGGTGCCGTTCGATGCGTTTGCCCAGGCGGCCTCCCGCCGCTACAACTTG
CCAGAGCCCATCCACTCGGCCCTGCGTGGGGAGTCCAACACCTACTTCGTGCGCCAGCTGTTCTCCTCCATCATCTTCCC
GGAGGCCCATCTGGCCGGCGAAAACCGGCTGCACACCCTCTATCGCCGGCGCCGCATGGCCATCGGCCTCAGCTGCCTGT
CGCTCTTCTCGGCCGCCCTCATCGGTGGCTGGCACTACTTCTACCGGGTCAACGAGGAGGCGGGCCGCAACGTATTGACC
AAGGCTCAGGCCTTCATGGAGACCAACGAGGTGGCCGACGCCCACGCCTTCGGCGTGAGCCAGCTGCCGCGCCTCAATCT
TATTCGCGAGGCGACCCTCTCCTTTGGCAACTACCGCGAGCGGATGCCGCTGGTGGCGGATCTCGGCCTCTACCAAGGGG
ACGAGATTGGTCCTTATGTGGAGGGCTCCTACCTGCAACTGCTGAGCCTGCGTTTCCTGCCGGCCCAGATGCAGGGATTG
CTGGCGGATCTCAACCAGGCGCCTGCCGGCAGCGAGGAGAAGCTCGCCATCCTGCGGGTGATGCGGATGCTGGACGATGC
GTCCGGTCGCAACAAGGAGCTGGTGGCGCAGTACATGGCGAGCCGTTGGCAGAAGGCCTTCCCGGGGCAGGGGGCGGTAC
AGGAGCAGCTGATGGGGCACCTGGACTATGCCCTCGACCACACCAACTGGTATGGCGCCAGGGCCGAGCGGGATCAGGCC
GCCATCACCGCCTTCGTGCCGTTCAAGGAGCCGGTCTACGGTGCCCAGCGCGAGCTCGGCAAGCTGCCCATGTATCAGCG
GGTCTACCAGAACCTGGTGGTGAAGGCGGGCGATGTATTGCCGCCGGATCTCAACGTCCGTGACGAGGTGGGCCCCACCT
TTGACACCGTGTTTGCCCTGCGCAGCGACAACGCCGGCCAGGTGCCGCGGCTGCTCACCTGGCCCGGTTTCAACGACTTC
TTCCTGAAACAAGACAAGGCGCTCATCGATCTCACCGCGATGGATGCCTGGGTGCTGGGCCAGCGCAAGCTGAGCCAGTT
GAGCGAGGCAGATCGCAAGGAGATCACCCGTCAGGTCAACGACCGTTACGTCACCGATTACGTCAACCAGTGGCAAAAAC
TGCTGACCAATCTGGACGTGCAGACCCTGGAGAGCCCGGAGCAGGCGCTGGACGTGCTGGCCGCCATTACTGGCAACGAT
CAGCCGTTCCAACGGGTACTGGCCTCGCTGGATGACAACACCCGCATTCGCAAGATCTCCGATGTGGAAGGGGACCCGGC
CCAGGCCATCAGCGCCCGCATCGGTCGTCCCTTTATGGCCACTAATGGAGTATTGGCTGGACGTGGCGAGCAGGGACCGC
TCATTCAGGAGGTCAACCAGAAGCTGGTGGAGCTGCAGCACTACCTGGAGCTCATCGTCAACGCCACCGAGCCGGGCCAG
TCCGCCTTGAAAGCGGTGCAGCTGCGCATGACCAACAAGTACGCCGATCCGGTGTTCGCCCTGCAGCAATATGCCCGCAG
TCTGCCGGCCCCGCTCGATCGCTGGGTGGGTCAGCTCTCCGAGCAGAGCTCGCGGCTGGTGATCGACCTGGCCATGTCCT
CCCTCAATCAGGAGTGGCAGGACAAGGTGCTGACCCCCTTCAACAGCCAGCTGGCCGGTCGTTATCCGTTTGATCCGAGC
TCGAACAAGGATGTGCCCCTCTCCGAGATGGAGCGCTTCTTTGCGCCTAACGGGACGCTGGACAGCTTCTATCAGGTCAA
CCTCAAACCCATGGTGGAGAGCGGCCTGATGGAAGGGGAGTTCAGCTCGCCCATCCAGGCCGAGCTGGTCAAGCAGCTGG
ACCGTGCGGCCCGCATCCGTCAGATCTTCTTCAGCCAGCAGGGCAACCTGGAGGTGCAGTTCGCCCTCGAGCCCATCGAG
CTCACCGCCAACAAGCGGCGCAGTGTGCTGAACCTGGACGGCCAGCTACTGGAATATGCCCATGGCCGTCGTACCAAGAT
CCCGCTGGTGTGGCCCAATACCATGCGCGACGGGGCGGAGAGCAAGATCACTCTGGTGCCGGCCGCCCGCGAGCGCTCAC
CACGCAGCGAAGGCTTCGTCGGTCCCTGGGCAATGTTCCGTCTGATGGACAAGGGGGAGCTGACCCAGGTCAATGACGCC
ACCTTCGATGTGCGCTTCCCGGTGGATCAGGGGGCCATGACCTATCGCGTCTACACCGACAGCGCGCAAAACCCCTTCAC
CGGCGGCCTGTTCAGCCAGTTCAGACTGCCTGAATCCCTCTATTGA

Protein sequence :
MFKTIFTFLRQQLPKLKPSWPLLGAVLWVLALILVWWLGPRLEVRGAKPFEPLWGRVVFTLLWLWLLLGVVSWRVWRKMQ
QLKAERQHEVVLEQDPVKGLIDRQALFLDRWLQALNTHLGKGALYAMPWYLVLGLPGSGKSSLIHRANPANKLNPRLDTE
LRDVAQDQLVDCWLGEQAVMLDPAGVLLSQSEAELDPQARKHERLWLHLLGWLNEHRRRQPLNGLVLTVDLAWLSHASVA
ERKAYAQLMRSRLQEVSATMNTRLPLYVTFTKLDLLRGFDVIYQQLDKEARDAVLGVTFKPGADWQQDLALFWDQWVDNL
NQNLPELMLSRLDAAQRNALFSFVRQLAGLKDYVTSLLAETLAIEESKPLLVRGVYVSSVYQQGVPFDAFAQAASRRYNL
PEPIHSALRGESNTYFVRQLFSSIIFPEAHLAGENRLHTLYRRRRMAIGLSCLSLFSAALIGGWHYFYRVNEEAGRNVLT
KAQAFMETNEVADAHAFGVSQLPRLNLIREATLSFGNYRERMPLVADLGLYQGDEIGPYVEGSYLQLLSLRFLPAQMQGL
LADLNQAPAGSEEKLAILRVMRMLDDASGRNKELVAQYMASRWQKAFPGQGAVQEQLMGHLDYALDHTNWYGARAERDQA
AITAFVPFKEPVYGAQRELGKLPMYQRVYQNLVVKAGDVLPPDLNVRDEVGPTFDTVFALRSDNAGQVPRLLTWPGFNDF
FLKQDKALIDLTAMDAWVLGQRKLSQLSEADRKEITRQVNDRYVTDYVNQWQKLLTNLDVQTLESPEQALDVLAAITGND
QPFQRVLASLDDNTRIRKISDVEGDPAQAISARIGRPFMATNGVLAGRGEQGPLIQEVNQKLVELQHYLELIVNATEPGQ
SALKAVQLRMTNKYADPVFALQQYARSLPAPLDRWVGQLSEQSSRLVIDLAMSSLNQEWQDKVLTPFNSQLAGRYPFDPS
SNKDVPLSEMERFFAPNGTLDSFYQVNLKPMVESGLMEGEFSSPIQAELVKQLDRAARIRQIFFSQQGNLEVQFALEPIE
LTANKRRSVLNLDGQLLEYAHGRRTKIPLVWPNTMRDGAESKITLVPAARERSPRSEGFVGPWAMFRLMDKGELTQVNDA
TFDVRFPVDQGAMTYRVYTDSAQNPFTGGLFSQFRLPESLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 47
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 45
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
AHA_1845 YP_856379.1 ImcF family protein VFG2088 Protein 0.0 42