Skip to main navigation Skip to search Skip to main content

Multi-omics data driven analysis establishes reference codon biases for synthetic gene design in microbial and mammalian cells

  • National University of Singapore
  • Agency for Science, Technology and Research, Singapore
  • Sangon Biotech (Shanghai) Co. Ltd.

Research output: Contribution to journalArticlepeer-review

Abstract

In this study, we analyzed multi-omics data and subsets thereof to establish reference codon usage biases for codon optimization in synthetic gene design. Specifically, publicly available genomic, transcriptomic, proteomic and translatomic data for microbial and mammalian expression hosts, Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris and Chinese hamster ovary (CHO) cells, were compiled to derive their individual codon and codon pair frequencies. Then, host dependent and -omics specific codon biases were generated and compared by principal component analysis and hierarchical clustering. Interestingly, our results indicated the similar codon bias patterns of the highly expressed transcripts, highly abundant proteins, and efficiently translated mRNA in microbial cells, despite the general lack of correlation between mRNA and protein expression levels. However, for CHO cells, the codon bias patterns among various -omics subsets are not distinguishable, forming one cluster. Thus, we further investigated the effect of different input codon biases on codon optimized sequences using the codon context (CC) and individual codon usage (ICU) design parameters, via in silico case study on the expression of human IFNγ sequence in CHO cells. The results supported that CC is more robust design parameter than ICU for improved heterologous gene design.

Original languageEnglish
Pages (from-to)26-35
Number of pages10
JournalMethods
Volume102
DOIs
StatePublished - 1 Jun 2016
Externally publishedYes

Keywords

  • Codon optimization
  • Microbial and mammalian hosts
  • Multi-omics data
  • Reference codon bias
  • Synthetic gene design

Fingerprint

Dive into the research topics of 'Multi-omics data driven analysis establishes reference codon biases for synthetic gene design in microbial and mammalian cells'. Together they form a unique fingerprint.

Cite this