Gene Information

Name : O3I_022475 (O3I_022475)
Accession : YP_006809424.1
Strain : Nocardia brasiliensis HUJEG-1
Genome accession: NC_018681
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5107791 - 5110973 bp
Length : 3183 bp
Strand : -
Note : COG2409 Predicted drug exporters of the RND superfamily

DNA sequence :
GTGTCCGTATACCTCTACAAATGGGGAAAGTTCGCGTTCCGCCGCAAGTGGATCGTGCTGCCGCTATGGCTGGTGCTCCT
CGGTGCGCTCGGCGCCGGTTCGCACCTGCTGAGCAAGTCGATGAGCGACGAGTTCAGCATGCCCGCGCTGCCGTCGGAGC
GCGCGACGGAAATCCTGGACAAGCAGTTTCCGGGCATGTCCGCGCAGTTCGGCATCGACGCCGTCAGCGGTACCTACGTG
ATTCAGGCGCCCGACGGTACGAAGCTGACCGACAAGGACAACAGTGCCGCCGTAGACGCGCTGATCACCGATCTGCGGGC
ACTGGTGGCCGGGGACGACCGGCATCGGCTCGTCGCCGAAAAGGCCGGGGCGGCTTTGCAGAACCCGGTCGCCGCGACCC
AGGCGATGGGCTGCCTGACCACCGCCGATCCCGCGACGTGCGCGGGCGCTCCGCTCAACGTGCTGAACAAGGAAGCCCCG
GCGACCGTCGCCGTGCTCACCGTGCCGTTCGACATCGCCTCGGCGATGGACATCACCGACGAGCAGCGGCAGGTCGCCTA
CGCCGTGGCGGACCCGGCCCGTGCGCGCGGACTGGTGGTCGAACTCGGCGGGTCCATCGCCCGCGAGCAGGAACAGCCCA
GCGGGCAGGCCGAGCTGATCGGCATGGGAGTGGCGCTGGTCGTCATGGTGATCGCGTTCGGCGCCATCGTGGCGGCTTTC
GTGCCGATCGTCACCGCCATCGTCGGCCTGGGCGCCGCGATGCTGGTGATCTCGCTGAGCACCGCGGTCGTCGAGGTGCC
GAGTTTCACGACGTTCCTGGCCTCGATGATCGGTATCGCGCTGTCGATCGACTATGCGCTGTTCATCGTCTCCCGGTACA
AGCACGAATTGCGCGTCGCGGCGAATCCCGAAGAAGCGGCGGGTATTTCGGTGGGTACGGCCGGTTCGGCGGTGGTGTTC
GCCGGGCTCACCGTCATCGTCGCGCTGGGCGCGCTGAGTATCGTCGGGGTGAACTTCCTGACCTTCATGGGTCTCGGCGG
TGCCGTGGCCGCGTTCTTCGCCGTCCTCACCGCGATCACGCTGATGCCCGCGCTGCTCGGCGCTTTCGGCCGTTTCCTGT
TCAAGCCGAAGCTGCCGCTGGTCGCGCGGCACGACCCGGAGGACGACACCTCCGTCACCAACGGCATGCGGGTCGGCCGA
CTGATCGGCAAGCGGCCGTGGTTCGCGCTGATCATCGCGGTGGCCGCATTGGCTGCGCTCGCGACGCCCGCGCTTCAGTT
GCAGTTGGGCCTGCCCGGTGAGGACAGTCTGCCCACCGAGTCCACGGCCCGGCAGGCCTACGACATCCGGACGAACGGCT
TCGGCGAAGGGAGCAACGGCGTTCTCACCGTGGCCGCCGACCTGGAGCAGGTGCCAGAAGGTGAGCGCAAGGCCGCGGTC
ACCGCGCTGCGCGACCGGCTCGCGGAGTTCCCGGAGATGGATTACGTCACCACAGCGCAATTCAGCGCGAACGGGCTGGG
CGCGATGCTCAGCGGGGTGCCGAAATCGGGACCGAACGATCAGGACACCAAAGATCTGGTGCGCGACGCGCGTGCCGCCG
AAGACGAACTGACCGAGCGTTACGGCATCCGATACGGCATCACCGGCACCACCGCCATCTACGCGGACATGGACCACGTC
CTGCTCGGCAAGATCGTGCCCTACCTGGCGATCGTCGCGGGCGCGGCCTTCGTGCTGTTGATCCTGGTGTTCCGGTCCAT
TCTGGTCCCGCTCACCGCGGCGCTGGGGTTCCTGCTCTCGATGGCGGCCACCTTCGGCGCCACGGTGCTGATCTTCCAGG
AGGGCACCTTCGGCCTGATCGCCGATCCGCAGCCGATCGTCAGCTTCCTGCCGATCATGTTGATCGGCCTGGTATTCGGG
CTCGCGATGGACTACCAGGTGTTCCTGGTGACCCGCATGCGCGAGGAATTCGTGCACGGCAAGTCGCCGCGCGACGCCAT
GATCGCCGGGTACCACCACGGCGCGCGCGTGGTCACCTCGGCGGCGATCATCATGATCTCGGTGTTCGGTTCCTTCCTGC
TGGAAAAGGACGTGACCGCGAAGTCCATGGGGTTCGCGCTGGCGGCCGGCGTGGCCATCGATGCCTTCGTGGTGCGCATG
GTGCTGGTGCCGGCCCTGCTGGCGATCATGGGCAGAGCGTCGTGGTGGATGCCGAAGTGGCTCGACCGCATCCTGCCGGA
CATCGATGTCGAAGGCGCGAAGCTGCGCGCGTTGCAGCGCAAGCGGTTCGGTGCGGCCGAGCCCGAGCCGGTCGGGGGCG
AACCGGCTCCGGCGCTCGACGCGGCCGTGGATACCGTTGCGTTGCAGCAGTTGCCGGACGTGGAAGGTGCACAGTTCGTC
GCATCCAACGGCAGGTCGATCTTCGGGCGGGTGTGCCGCGACGACGGCTATCCGGTGCCGGACGCGGCGCTGACTTTGAT
CGACCAACGTGGACAACAGGTTTCGCGGGCGGCAGCGGACGATGACGGCAACTACGCGATCGAGCCGCCCACGCCGGGCG
GCTATGTCCTGATCGTCTCCGCGAACCGGCATCAACCGGCCGCGGTGAACGTCACTGTCGCCGAAGGAGGCGCACACCAC
GTCGACGTGACCCTGCAGGGATCCGGCGAGCTGTCCGGTGTGGTGCGCACCGCAGCCCGCGAGCCGGTGGCGGAGGCCAC
GATCACCGTGACCGACCTGCGCGGCGAGGTGGTCGGGGTAGCGGTGAGCGCGGCGGACGGCGGCTACGCCTGCAAGGGCG
TGCTGGCCGGAACCTATACGCTGGTGGCGGTCGCCGCGCGGATGCGGCCCACCGCGACGACGCTGACCGTACCCGACAGT
GGCCGTTTGCGTTTCGACGTCGAACTCGCGCCCATGGCGATGCTGTGGGGCACGGTGCGCGCGGGCGGCCGCGCGGTTCA
CGACGCCAGGATCACCGTGCTCGACTGGTCCGGCACGATCGTAGGCACCGCGCACACCGACGAGGACGGCCGCTACGCGG
TGGCCGACCTGCCAGAGGGCGAGTACACCGTCGAGGCCCGCGGCTACCCCCTGGTGACCGGCCGGGTCACCATCACCGGC
AGCCAGGTCGACCACGATGTCCGCTTGGGCTTCGACATCGACGAACACGCGGAAATGTCATGA

Protein sequence :
MSVYLYKWGKFAFRRKWIVLPLWLVLLGALGAGSHLLSKSMSDEFSMPALPSERATEILDKQFPGMSAQFGIDAVSGTYV
IQAPDGTKLTDKDNSAAVDALITDLRALVAGDDRHRLVAEKAGAALQNPVAATQAMGCLTTADPATCAGAPLNVLNKEAP
ATVAVLTVPFDIASAMDITDEQRQVAYAVADPARARGLVVELGGSIAREQEQPSGQAELIGMGVALVVMVIAFGAIVAAF
VPIVTAIVGLGAAMLVISLSTAVVEVPSFTTFLASMIGIALSIDYALFIVSRYKHELRVAANPEEAAGISVGTAGSAVVF
AGLTVIVALGALSIVGVNFLTFMGLGGAVAAFFAVLTAITLMPALLGAFGRFLFKPKLPLVARHDPEDDTSVTNGMRVGR
LIGKRPWFALIIAVAALAALATPALQLQLGLPGEDSLPTESTARQAYDIRTNGFGEGSNGVLTVAADLEQVPEGERKAAV
TALRDRLAEFPEMDYVTTAQFSANGLGAMLSGVPKSGPNDQDTKDLVRDARAAEDELTERYGIRYGITGTTAIYADMDHV
LLGKIVPYLAIVAGAAFVLLILVFRSILVPLTAALGFLLSMAATFGATVLIFQEGTFGLIADPQPIVSFLPIMLIGLVFG
LAMDYQVFLVTRMREEFVHGKSPRDAMIAGYHHGARVVTSAAIIMISVFGSFLLEKDVTAKSMGFALAAGVAIDAFVVRM
VLVPALLAIMGRASWWMPKWLDRILPDIDVEGAKLRALQRKRFGAAEPEPVGGEPAPALDAAVDTVALQQLPDVEGAQFV
ASNGRSIFGRVCRDDGYPVPDAALTLIDQRGQQVSRAAADDDGNYAIEPPTPGGYVLIVSANRHQPAAVNVTVAEGGAHH
VDVTLQGSGELSGVVRTAAREPVAEATITVTDLRGEVVGVAVSAADGGYACKGVLAGTYTLVAVAARMRPTATTLTVPDS
GRLRFDVELAPMAMLWGTVRAGGRAVHDARITVLDWSGTIVGTAHTDEDGRYAVADLPEGEYTVEARGYPLVTGRVTITG
SQVDHDVRLGFDIDEHAEMS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cur_1489 YP_001800883.1 RND superfamily drug exporter Not tested Not named Protein 5e-109 42