Gene Information

Name : thiC (HMPREF0424_0892)
Accession : YP_003374107.1
Strain : Gardnerella vaginalis 409-05
Genome accession: NC_013721
Putative virulence/resistance : Unknown
Product : thiamine biosynthesis protein ThiC
Function : -
COG functional category : H : Coenzyme transport and metabolism
COG ID : COG0422
EC number : -
Position : 1012629 - 1015286 bp
Length : 2658 bp
Strand : +
Note : identified by match to protein family HMM PF01964; match to protein family HMM PF02581; match to protein family HMM TIGR00190

DNA sequence :
ATGACATCGAATTATCCGTACGCATCAATGCGTAATCAATTTAATCTAAGCGCCTGCTTTATTGCAGACCCACAAGCTTG
TAATAATCGTCCGCTTACCGATATTGTCGATGATGCTTTGCGCGCTGGAGCGACTTTTATTCGTCTTCATTGCAACAATG
AAAACGCTAAAGAAATCACTACTATTGCACGCGATATTGCACAAATCATCGAAGACAATAACAAATGCGATTCTGTAACT
TTTGTTATCGACGAGCGTGTTGATGTCGTTTGGCAAGCTCGCAATCAAAGCATCAAAGTTGACGGAGTGCATTTAGCACA
AAGCGACATGGAGCCTAGAGAGGCTCGCGCTCTTCTTGGCGAAGATGCTGTAATCGGCTTATCTGTTGAAACCGAAAGCT
TAGTTAAAGTAATAAACGAACTTCCAGATGGATGCATTGATTACATTTGCGTAACAGCAATGCGCAATCCTGAAGAAGGA
TGCGAAAGCACTACTGCCGCTTACGAATTGGAAGCAAATCACACAACGCTGGACGAAGCGAAAATTAATACGATTTGTTC
CGCAAGTGATTTTCCAGTTCTAGTTGGCGGAAGAACTGCACTCGACGATATCGATACAATCGCTCACACCAAGGCTGCAG
GATGGTTCGTTTCTGAAGCATTGTATTCTTCAGAAACACCAGAATCAACTATGCGTGAATTTGTTGAACATTGGAAAGCT
GTGCGAGGTGAAGAAAAGCACGGCTACGCTAAGCGAGTGATAGTTGCAGAAAATTCTGAATCAAAATCTTCAGAAACTCA
AGAAAAGAAGCCAACTTTTATTAATGCGAAAGAAGCAAAGGATGCTGCGAAATTAGCTAAACAACAGCGAGTTGACATTG
CAGCTCGCGGATGCACTCAGCGCGATAAAGCTCATATTCGCAAAACAACTCCAATTCATTTTGAGTATGAATATGGTTCT
TATGATTTGGAAGTTCCTTATACGGAAATTAAGCTTTCGGATACTCCTGGCGTAGGTCCTAACCCGCCTTTTAAGGATTA
CAATACAGAAGGTCCAAAGTGCGATCCGAAGGAAGGTTTGGCTCCGCTTCGCCTTGACTGGATTCGCGACCGCGGTGACG
TTGTGGAATATGAGGGTCGCAGGCGCAATCTTCAAGACGACGGCAAGCGCGCAATTAAGCGAGGCAAAGCTTCTAAAGAA
TGGCGTGGACGCACGCATAAGCCAATGAAGGGCGCGGATCATCCGATTACACAAATGTGGTACGCTCGCCACGGAATCAC
TACTCCGGAAATGCAATATGTTGCAACGCGCGAAAATTGCGATGTAGAGCTGGTTCGCGAAGAAGTTGCAGCCGGACGTG
CTGTAATTCCTTGCAATATTAACCATCCTGAAGCTGAACCTATGATTATTGGCTCGCGCTTCTTGACTAAGCTCAACGCA
AACATGGGTAATTCTGCTGTTACGTCATCTATCGACGAAGAAGTAGAAAAGCTTACGTGGGCCACGAAGTGGGGTGCGGA
TACCGTTATGGATCTTTCCACCGGTAACGATATTCACACAACGCGCGAATGGATTTTGCGCAACTCCCCTGTGCCAATTG
GAACAGTGCCAATGTATCAGGCTTTGGAAAAGGTTGAGGATGATGCTTCTAAGCTCAGCTGGGAGCTTTTCCGCGACACT
GTTATTGAGCAGTGCGAGCAGGGCGTTGACTACATGACTATTCACGCTGGCGTGCTTCTTCGCTACGTGCCGCTTACTGC
AAACCGCGTAACCGGTATTGTTTCTCGTGGTGGCTCAATTATGGCTGAATGGTGCTTGCAACATCATCAAGAGAGCTTCT
TGTATACGCACTTTGAAGAATTATGCGAGATTTTCGCAAAGTACGATGTTGCATTCTCTTTGGGTGATGGTTTGCGTCCA
GGTAGCTTGGCTGATGCTAACGATGCGGCTCAGCTTTCCGAGCTTATGACGCTTGGCGAGCTTACGAAGATCGCTTGGAA
GCATGACGTACAGGTGATGATTGAAGGTCCTGGTCACGTGCCATTCGACACTGTGCGTATGAATATTGAGATGGAAAAGG
CAATTTGCCAGAATGCTCCATTCTATACGCTTGGTCCTTTGACTACGGATACCGCACCTGGCTATGACCACATTACTTCC
GCAATTGGTGGCGTGGAGATTGCGCGATACGGCACCGCAATGCTTTGCTATGTGACTCCTAAGGAACATTTGGGGTTGCC
TAACAAGGATGACGTGAAGCAAGGCGTGATTGCGTATAAGATCGCTTGCCACGCAGCTGATCTTGCTAAGCATCATCCAC
ATGCTATGGATCGCGACAACGCAATCAGTAAGGCTCGCTTTGAGTTCCGCTGGTTGGATCAGTTCAACTTAAGCTATGAT
CCAGATACCGCAATCGCCTTCCACGACGAAACACTTCCTGCAGAACCAGCAAAAATGGCGCACTTCTGCTCGATGTGCGG
ACCAAAGTTCTGCTCGATGGCTATTTCGCAAAATATTCGTAAGCGTTTTGGCGGAGCAGCTCAGCAGGAGCAGCTCGTTG
AAGAAGCACGCAGTCAGGCAATTGCCGATGGTATGAAAGAGATGAGCAAAAAGTTCCAAGAATCCGGCTCATCGTTGTAT
CAAAGCGTGAAAGCATAA

Protein sequence :
MTSNYPYASMRNQFNLSACFIADPQACNNRPLTDIVDDALRAGATFIRLHCNNENAKEITTIARDIAQIIEDNNKCDSVT
FVIDERVDVVWQARNQSIKVDGVHLAQSDMEPREARALLGEDAVIGLSVETESLVKVINELPDGCIDYICVTAMRNPEEG
CESTTAAYELEANHTTLDEAKINTICSASDFPVLVGGRTALDDIDTIAHTKAAGWFVSEALYSSETPESTMREFVEHWKA
VRGEEKHGYAKRVIVAENSESKSSETQEKKPTFINAKEAKDAAKLAKQQRVDIAARGCTQRDKAHIRKTTPIHFEYEYGS
YDLEVPYTEIKLSDTPGVGPNPPFKDYNTEGPKCDPKEGLAPLRLDWIRDRGDVVEYEGRRRNLQDDGKRAIKRGKASKE
WRGRTHKPMKGADHPITQMWYARHGITTPEMQYVATRENCDVELVREEVAAGRAVIPCNINHPEAEPMIIGSRFLTKLNA
NMGNSAVTSSIDEEVEKLTWATKWGADTVMDLSTGNDIHTTREWILRNSPVPIGTVPMYQALEKVEDDASKLSWELFRDT
VIEQCEQGVDYMTIHAGVLLRYVPLTANRVTGIVSRGGSIMAEWCLQHHQESFLYTHFEELCEIFAKYDVAFSLGDGLRP
GSLADANDAAQLSELMTLGELTKIAWKHDVQVMIEGPGHVPFDTVRMNIEMEKAICQNAPFYTLGPLTTDTAPGYDHITS
AIGGVEIARYGTAMLCYVTPKEHLGLPNKDDVKQGVIAYKIACHAADLAKHHPHAMDRDNAISKARFEFRWLDQFNLSYD
PDTAIAFHDETLPAEPAKMAHFCSMCGPKFCSMAISQNIRKRFGGAAQQEQLVEEARSQAIADGMKEMSKKFQESGSSLY
QSVKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
thiC YP_001598318.1 thiamine biosynthesis protein ThiC Not tested MDA Protein 6e-155 55