Gene Information

Name : TM1040_0631 (TM1040_0631)
Accession : YP_612626.1
Strain : Ruegeria sp. TM1040
Genome accession: NC_008044
Putative virulence/resistance : Resistance
Product : acriflavin resistance protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0841
EC number : -
Position : 668659 - 671709 bp
Length : 3051 bp
Strand : +
Note : PFAM: acriflavin resistance protein: (9.5e-69); KEGG: sil:SPO0921 transporter, AcrB/AcrD/AcrF family, ev=0.0, 72% identity

DNA sequence :
ATGGACATTGCACGCGGATCAATTCAACGGCCGCTTTATACGTGGCTGATCATGCTGTTTGCCCTCTTTGGGGGCATCTG
GGGCTTTCTCTCGCTGGGTCGATTGGAAGACCCTGCCTTCACCATCAAACAGGCGGTGATCCAGACGACTTATGCCGGCG
CCAGCGCCGAGCAGGTGGCCCTGGAGGTGTCCGAACCTCTGGAATCCGCCATCCAGAAGATGGAGGAAGTCAAGGAAATC
ACCTCCATGAACCGCCCCGGCGTATCGATCATCGAAGTGGAGATGCGCGACACCTATGACGGCTCGGAGTTGCCACAGAT
CTGGACCAAACTGCGCGCAAAGGTGCGTGATGCGGCCCGTGGCCTGCCGGATGGCGTCAGCGAGCCGCTGGTGAACGACA
GCTTTGGGGACGTGTTCGGTATCTTCTATGCGGTAACGGCTGAAGGGTTTTCCGACCGCGAGAAATACGAGCTATCGACC
TTCCTGCGCCGCGAGTTGTTGACCGTGGACGGTGTGGCGGATGTCGATATCTCGGGGCTGCGCAAAGAGACAATCTATGT
CGAACCCGACATGCCGATTGCCACCAACCTTGGGGTTTCGGTCAATGCGATCGCCAACGCCGTTGCCACTTCGAACTCGG
TGACACCGGCGGGAGAGATCAAGGCCGGACCGATGAACACCATTCTGCAAACTCCCGAAGGCTCGGATTCCGTTTCGGAG
ATTGCGGGTCTGACGGTGGGCGTCGGGGGTGAGACCATCAACATCATCGACATCGCAAATGTTTGGCGCGGACGAGAGAT
CGACCCGTCGCTGATGATCCGCTTTGACGGGGTTGAGGCCTTTACCCTTGGGGTGTCCGGGATCGCCAATGAAAACATTG
TGGATGTTGGTCATCGCGTGGACGCCAAGCTGGCGGAGCTGGACAGCGACATTCCCTTTGGCGTGGAACTGCATCCGATC
TATCAGCAGCATATCGTTGTGGAGCAGGCGTCCAATGATTTCCTCGTGAACCTTGCCATGTCGGTGGCCATTGTGATCGT
GGTGCTGGCGCTCTTCATGGGGTGGCGCGCGGCCATTGTGGTCGGGACAACCCTGCTGCTGACGGTGGTGGGGACGCTCT
TCTTCATGGCGCTGTTTTCAATCGAGATGGAACGCATCTCGCTTGGGGCGTTGATCATCGCCATGGGCATGCTCGTCGAT
AACGCCATCGTGGTGGCGGAGGGGATGCAAATCTCCATGCAAAAAGGCAGATCCTCGCGTGAAGCCGCGCAGGAGGCGGC
CTCGAAGACCCAGATCCCTCTGTTGGGCGCGACGGTGATTGGCATCATGGCCTTTGCCGGGATCGGGCTCAGCCCGGACG
CCACGGGCGAGTTCATGTTCTCGCTCTTTGCCGTGATCGGAATTTCGCTTCTGCTGAGCTGGATTCTTGCGCTGACGGTC
ACGCCGCTCTTGGGGCATTACTTCTTTAAACGCGGTTCGGGAGACGATCAGGACAGCTACAACGGGCTGTTGTTTCGCAG
CTACGGCGCTCTGCTGCGCGGCGCACTCAAGGCGCGCTGGCTTGTGGTGGTCGGCCTGATTGGCGTGACTGTGATGTGTT
TCATGGGGTTTGGCCTCATCAAGCAGCAGTTCTTTCCGAACTCCAACACGCCGTTGTTCTATGTGCACTACAAGTTGCCA
CAGGGCACCGCGATCGAAGCAACCTCTGCGCATATGGCTGAAATGGAATCCTGGCTGGCTGCGCGCGATGAGGTGGTATC
GACCGCAACCTTCGTGGGGCAGGGGGCGACCCGCTTCTTGTTGACCTATGCCGGTCAGAAGGCCAATCCAAGCTACGGGC
ATCTGATCATCCGCACGGAAAATCTCGAACAGATCCCGGGCTTGCAGGCGGATCTCGAGGCCTGGGGCAAGGCGCATTTT
CCCGAAGGTGAGTTCCGCACAGAGCGGCTGGTCTTTGGCCCCGGCGGCGGCGCCCCAGTTGAGGTGCGTTTCTCTGGTCC
GGATCCGGCTGTGCTGCGCCGCCTTGCCAATGAAGCCAAGGAACTGCTGGAGACAGAGAGCGATCTGACGCGCGATGTAC
GTCACAATTGGCGCGAGCAGGAGTTGGTGCTCAAACCCGTCTACGCCACGGACCGGGCGCAAACGGCTGGGGTGACGCGA
GAGGCGATCTCGAGCGCGCTGCAGTTCTCGACCAACGGGGTGACAACGGGGGTGTTCCGCGAACGCGAACGCCTGATCCC
GATTGTCATGCGCCAGCCGCAGGAAGACCCTTATAACGTGATGAACCAGGTGGTGTTTTCGGAGACCTCTGGCAGCTGGA
TCCCGCTGGAACAGATGATCGATGGCATCTCCTACGAGGTTCAGAACACGCTCGTGCATCGGCGCGACCGCGTCTACACT
ATCACCGTGGGGGCGGATGTATTGCCGGATGTGACGGCTGCCACCGCCTTTGGCGAGGTGCAGGCCGCGATCGAGGCCAT
CGAGGTGCCGGTCGGCTACAAGATGGAATGGGGCGGGGAGCATGAGAACTCCTCGGAGGCCAATGCAAGTCTCGGCAAGC
AATTGCCGCTCTCGCTGCTGATCATGGTGCTGATCTCCGTTCTGCTCTTCAATGCGATCCGTCAGCCGATCATTATCTGG
CTGCTGGTCCCGATGAGCGTGAACGGGGTTGTGATCGGTCTCTTGGGCACGGGGCTTCCCTTTACGTTCACAGCACTTCT
TGGTCTGCTGAGCCTGTCCGGCATGCTGATCAAGAACGGGATCGTTCTGGTGGAAGAAATCGACATTGTCCGCCGCGAAG
AGGGGCTGCCGCTGCGCGAGAGCATCGTGAAGGCCTCTGTGTCGCGTTTGCGTCCCGTGATGCTTGCGGCGATTACGACC
ATCCTCGGTATGGCCCCGCTTATGAGCGATGCCTTCTTCGTGTCGATGGCGGTGACGATCATGGGCGGTCTGGCCTTTGC
AACAGTGCTGACGCTTGTTGCGGCGCCGGTGTTCTACCTGATCTTTTTCAAAGGGGTGGAACGGCAGGAGCAGAAGGCAG
CCGCCGCCTAG

Protein sequence :
MDIARGSIQRPLYTWLIMLFALFGGIWGFLSLGRLEDPAFTIKQAVIQTTYAGASAEQVALEVSEPLESAIQKMEEVKEI
TSMNRPGVSIIEVEMRDTYDGSELPQIWTKLRAKVRDAARGLPDGVSEPLVNDSFGDVFGIFYAVTAEGFSDREKYELST
FLRRELLTVDGVADVDISGLRKETIYVEPDMPIATNLGVSVNAIANAVATSNSVTPAGEIKAGPMNTILQTPEGSDSVSE
IAGLTVGVGGETINIIDIANVWRGREIDPSLMIRFDGVEAFTLGVSGIANENIVDVGHRVDAKLAELDSDIPFGVELHPI
YQQHIVVEQASNDFLVNLAMSVAIVIVVLALFMGWRAAIVVGTTLLLTVVGTLFFMALFSIEMERISLGALIIAMGMLVD
NAIVVAEGMQISMQKGRSSREAAQEAASKTQIPLLGATVIGIMAFAGIGLSPDATGEFMFSLFAVIGISLLLSWILALTV
TPLLGHYFFKRGSGDDQDSYNGLLFRSYGALLRGALKARWLVVVGLIGVTVMCFMGFGLIKQQFFPNSNTPLFYVHYKLP
QGTAIEATSAHMAEMESWLAARDEVVSTATFVGQGATRFLLTYAGQKANPSYGHLIIRTENLEQIPGLQADLEAWGKAHF
PEGEFRTERLVFGPGGGAPVEVRFSGPDPAVLRRLANEAKELLETESDLTRDVRHNWREQELVLKPVYATDRAQTAGVTR
EAISSALQFSTNGVTTGVFRERERLIPIVMRQPQEDPYNVMNQVVFSETSGSWIPLEQMIDGISYEVQNTLVHRRDRVYT
ITVGADVLPDVTAATAFGEVQAAIEAIEVPVGYKMEWGGEHENSSEANASLGKQLPLSLLIMVLISVLLFNAIRQPIIIW
LLVPMSVNGVVIGLLGTGLPFTFTALLGLLSLSGMLIKNGIVLVEEIDIVRREEGLPLRESIVKASVSRLRPVMLAAITT
ILGMAPLMSDAFFVSMAVTIMGGLAFATVLTLVAAPVFYLIFFKGVERQEQKAAAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0395_A1355 YP_001217299.1 AcrB/AcrD/AcrF family transporter Not tested VPI-2 Protein 2e-158 41

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
TM1040_0631 YP_612626.1 acriflavin resistance protein BAC0519 Protein 1e-174 44
TM1040_0631 YP_612626.1 acriflavin resistance protein BAC0428 Protein 1e-170 44
TM1040_0631 YP_612626.1 acriflavin resistance protein BAC0526 Protein 9e-167 42
TM1040_0631 YP_612626.1 acriflavin resistance protein BAC0425 Protein 8e-159 41