Controlled molecule generator for optimizing multiple chemical properties

Bonggun Shin, Sungsoo Park, Jin Yeong Bak, Joyce C. Ho

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

Generating a novel and optimized molecule with desired chemical properties is an essential part of the drug discovery process. Failure to meet one of the required properties can frequently lead to failure in a clinical test which is costly. In addition, optimizing these multiple properties is a challenging task because the optimization of one property is prone to changing other properties. In this paper, we pose this multi-property optimization problem as a sequence translation process and propose a new optimized molecule generator model based on the Transformer with two constraint networks: Property prediction and similarity prediction. We further improve the model by incorporating score predictions from these constraint networks in a modified beam search algorithm. The experiments demonstrate that our proposed model, Controlled Molecule Generator (CMG), outperforms state-of-the-art models by a significant margin for optimizing multiple properties simultaneously.

Original languageEnglish
Title of host publicationACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning
PublisherAssociation for Computing Machinery, Inc
Pages146-153
Number of pages8
ISBN (Electronic)9781450383592
DOIs
StatePublished - 8 Apr 2021
Event2021 ACM Conference on Health, Inference, and Learning, CHIL 2021 - Virtual, Online, United States
Duration: 8 Apr 20219 Apr 2021

Publication series

NameACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning

Conference

Conference2021 ACM Conference on Health, Inference, and Learning, CHIL 2021
Country/TerritoryUnited States
CityVirtual, Online
Period8/04/219/04/21

Keywords

  • drug discovery
  • molecule optimization
  • neural networks
  • self-attention
  • sequence to sequence

Fingerprint

Dive into the research topics of 'Controlled molecule generator for optimizing multiple chemical properties'. Together they form a unique fingerprint.

Cite this