Gene Information

Name : MSMEG_0690 (MSMEG_0690)
Accession : YP_885098.1
Strain : Mycobacterium smegmatis MC2 155
Genome accession: NC_008596
Putative virulence/resistance : Unknown
Product : (Fe-S)-binding protein
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG0247
EC number : -
Position : 773427 - 776555 bp
Length : 3129 bp
Strand : -
Note : identified by match to protein family HMM PF00037; match to protein family HMM PF02754

DNA sequence :
GTGGCACACACCCTCGAAGTGAGCAGGCTCATCATCGGGCTGCTCATGACGGCCATCGTCTTGGTGTTCGCCGCCAAGCG
AGTGCTCTGGCTGACGAAGCTGATTCGCTCGGGCCAGAAGACGCTCGACGAGAACGGCCGCAAGAACGATCTGCAAAAGC
GCATCACCACGCAGATCACCGAGGTCTTCGGGCAAACGCGCCTGCTGCGCTGGTCGGTTCCGGGCATCGCGCACTTCTTC
ACGATGTGGGGCTTCTTCGTCCTCGCCTCGGTGTACCTCGAGGCCTACGGCGTGCTGTTCGATCCCGAGTTCCACATCCC
GTTCATCGGCCGCTGGCCGGTCCTGGGCTTTCTGCAGGACTTCTTCGCCGTCGCCGTGCTGCTGGGCATCATCGTCTTCG
CGATCATCCGCGTCGTCCGTGAGCCCAAGAAGATCGGGCGCGACTCGCGCTTCTACGGTTCGCACACCGGCGGCGCGTGG
GAGATCCTGTTCATGATCTTCCTGGTCATCGCGACCTACGCGCTGTTCCGCGGTGCCGCGGTCAACACCCTCGGCGAGCG
TTTCCCCTACCAGAGCGGCGCCTTCTTCTCCGACTTCATGGCGTGGATCCTGCGCCCGCTCGGCGCGACCGCCAACATGT
GGATCGAGACCGTCGCCCTGATGGGCCACATCGGCGTCATGCTGGTGTTCCTGCTGATCGTGCTGCACTCCAAGCACCTG
CACATCGGCCTTGCGCCCATCAACGTCACGTTCAAGCGCCTGCCCAACGGCCTCGGCCCGCTGCTGCCGATGGAGTCCAA
CGGCGAGTACATCGACTTCGAGGATCCCGCCGAGGACGCGGTGTTCGGTAAAGGCAAGATCGAGGACTTCACCTGGAAGG
GTTACCTGGACTTCACGACCTGTACCGAGTGTGGCCGCTGCCAGTCGCAGTGCCCGGCGTGGAACACCGGCAAACCGCTG
TCGCCCAAGCTCGTGATCATGAACCTGCGCGACCACATGTTCGCCAAGGCTCCCTACATCCTGGGCGAGAAGCCGTCGCC
GCTGGAGAGCACGCCCGAGGGCGGCCTGGGTGAGAAGGCCCGCGGCGAGAAGCACGAGCAGAAGCACGCGCACGACCACG
TCCCCGAGTCCGGCTTCGAGCGGATCCTCGGCAGCGGCCCCGAGCAGGCCCTGCGCCCGCTGGTCGGCACCGAGGAACAG
GGCGGCGTGATCGATCCCGACGTGCTGTGGTCCTGCACCAACTGCGGTGCGTGCGTCGAGCAGTGCCCCGTGGACATCGA
GCACATCGACCACATCGTCGACATGCGCCGCTACCAGGTCATGGTCGAGTCGGAGTTCCCCGGTGAGCTGGGCGTGCTGT
TCAAGAACCTGGAGACCAAGGGCAACCCCTGGGGCCAGAACGCCAAGGACCGCACCAACTGGATCGACGAGGTCGACTTC
GACGTGCCGGTCTACGGCGAGGACGTCGACTCGTTCGACGGGTTCGAGTACCTGTTCTGGGTCGGCTGCGCCGGCGCCTA
CGAGGACCGCGCCAAGAAGACCACAAAGGCCGTCGCCGAACTGCTCGCCACCGCGGGCGTGAAGTTCCTGGTGCTGGGCA
CCGGCGAGACCTGCACGGGCGACTCGGCGCGCCGCTCGGGCAACGAGTTCCTGTTCCAGCAGCTGGCCGCGCAGAACGTC
GAGACCATCAACGAACTGTTCGAAGGCGTCGAGACGGTCGACCGCAAGATCGTGGTGACGTGCCCGCACTGCTTCAACAC
GATCGGCCGTGAATACCCGCAGCTGGGTGCCAACTACAGCGTCGTGCACCACACGCAGCTGCTCAACCGGCTGGTCCGCG
ACAAGAAGCTGGTTCCGGTCAAGTCGGTCAGCGAGCAGAACGGCCAGCCCGTCACCTACCACGACCCGTGCTTCCTGGGC
CGCCACAACAAGGTCTACGAGGCCCCGCGTGAGCTGGTCGAGGCCTCCGGCGTGACGCTGAAGGAAATGCCGCGCCACGC
CGACCGCGGCCTGTGCTGTGGCGCCGGCGGTGCGCGTATGTGGATGGAAGAGCACATCGGCAAGCGCGTCAACGTCGAAC
GCACCGAAGAGGCCATGGACACGGCCTCGACCATCGCGACCGGCTGCCCGTTCTGCCGCGTGATGATCACCGACGGTGTC
GACGACGTGGCGGCCAGCCGCAACGTCGAGAAGGCCGAGGTGCTCGACGTCGCCCAGCTGCTGCTGAACTCGCTGGACAC
CAGCAAGGTCACGCTGCCCGAAAAGGGCACGGCGGCAAAGGAATCCGAGAAGCGCGCCGCTGCCCGGGCAGAGGCCGAAG
CGAAGGCGGAAGCGGCCGCTCCCCCGGTCGAGGAAGCCGCACCCGAAGCAGAGGCCCCGGCCGCCCCGGCAGCCGGTGGG
GCCGAGGCCAAGCCGGTCACCGGCCTGGGCATGGCGGGTGCCGCGAAGCGTCCGGGCGCCAAGAAAGCCGCACCTGCCGC
GGAAGCGTCTGCGGCACCGGCCGCCGCGCCCGCCCCGGCCAAGGGTCTGGGCCTGGCCGGCGGGGCCAAGCGTCCCGGCG
CGAAGAAGGCCGCCGCACCGGCCGCCGAGGCACCCGCTGCTCCGGCTTCGGACGCTCCGCCGGTCAAGGGGCTCGGTCTC
GCGGGTGGTGCCAAGCGACCCGGCGCCAAGAAGACCGCGGCAGCTGCTCCGGCCGAAAAGCCCGCAGCCACAGAGGCTCC
GGAAGCGTCGGCGACCCCGGCAGCCCCGGCGGCGCCCGTGAAGGGGCTGGGTCTCGCGGCAGGCGCCAAGCGTCCCGGCG
CCAAGAAGACCGCAGCCGCACCGGCCGAAAAGCCCGCAGCCGCAGAGACCGAGGCACCGGCGCCGGCAGAAACCGCGGCT
CCGGCCGAGCCGGCCAAGCCCGAACCGCCCGTCGTGGGCCTCGGCATCGCCGCGGGCGCTCGCCGTCCGGGTGCCAAGAA
GGCGGCCGCCAAGCCTGCCGCTGCGCCGGCGCCCGCCGCCGAGAAGCCGGCCGAGCAGGCCGCGGAGCCCGAAAAGCCCG
CGGAGAAGCCGGCTGAGCCCGAGAAGCCGGAACCGCCCGTGGTGGGGCTCGGCATCAAGCCGGGCGCCAAGCGGCCCGGT
AAGCGCTGA

Protein sequence :
MAHTLEVSRLIIGLLMTAIVLVFAAKRVLWLTKLIRSGQKTLDENGRKNDLQKRITTQITEVFGQTRLLRWSVPGIAHFF
TMWGFFVLASVYLEAYGVLFDPEFHIPFIGRWPVLGFLQDFFAVAVLLGIIVFAIIRVVREPKKIGRDSRFYGSHTGGAW
EILFMIFLVIATYALFRGAAVNTLGERFPYQSGAFFSDFMAWILRPLGATANMWIETVALMGHIGVMLVFLLIVLHSKHL
HIGLAPINVTFKRLPNGLGPLLPMESNGEYIDFEDPAEDAVFGKGKIEDFTWKGYLDFTTCTECGRCQSQCPAWNTGKPL
SPKLVIMNLRDHMFAKAPYILGEKPSPLESTPEGGLGEKARGEKHEQKHAHDHVPESGFERILGSGPEQALRPLVGTEEQ
GGVIDPDVLWSCTNCGACVEQCPVDIEHIDHIVDMRRYQVMVESEFPGELGVLFKNLETKGNPWGQNAKDRTNWIDEVDF
DVPVYGEDVDSFDGFEYLFWVGCAGAYEDRAKKTTKAVAELLATAGVKFLVLGTGETCTGDSARRSGNEFLFQQLAAQNV
ETINELFEGVETVDRKIVVTCPHCFNTIGREYPQLGANYSVVHHTQLLNRLVRDKKLVPVKSVSEQNGQPVTYHDPCFLG
RHNKVYEAPRELVEASGVTLKEMPRHADRGLCCGAGGARMWMEEHIGKRVNVERTEEAMDTASTIATGCPFCRVMITDGV
DDVAASRNVEKAEVLDVAQLLLNSLDTSKVTLPEKGTAAKESEKRAAARAEAEAKAEAAAPPVEEAAPEAEAPAAPAAGG
AEAKPVTGLGMAGAAKRPGAKKAAPAAEASAAPAAAPAPAKGLGLAGGAKRPGAKKAAAPAAEAPAAPASDAPPVKGLGL
AGGAKRPGAKKTAAAAPAEKPAATEAPEASATPAAPAAPVKGLGLAAGAKRPGAKKTAAAPAEKPAAAETEAPAPAETAA
PAEPAKPEPPVVGLGIAAGARRPGAKKAAAKPAAAPAPAAEKPAEQAAEPEKPAEKPAEPEKPEPPVVGLGIKPGAKRPG
KR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cpfrc_01915 YP_003784315.1 hypothetical protein Not tested PiCp 7 Protein 1e-139 44
fadF YP_005682164.1 Protein fadF Not tested PiCp 7 Protein 1e-139 44
fadF YP_005684256.1 Protein fadF Not tested PiCp 7 Protein 1e-139 44
fadF YP_005686348.1 Protein fadF Not tested PiCp 7 Protein 1e-139 44