Holistic Molecular Representation Learning via Multi-view Fragmentation

  • Seojin Kim
  • , Jaehyun Nam
  • , Junsu Kim
  • , Hankook Lee
  • , Sungsoo Ahn
  • , Jinwoo Shin

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Learning chemically meaningful representations from unlabeled molecules plays a vital role in AI-based drug design and discovery. In response to this, several self-supervised learning methods have been developed, focusing either on global (e.g., graph-level) or local (e.g., motif-level) information of molecular graphs. However, it is still unclear which approach is more effective for learning better molecular representations. In this paper, we propose a novel holistic self-supervised molecular representation learning framework that effectively learns both global and local molecular information. Our key idea is to utilize fragmentation, which decomposes a molecule into a set of chemically meaningful fragments (e.g., functional groups), to associate a global graph structure to a set of local substructures, thereby preserving chemical properties and learn both information via contrastive learning between them. Additionally, we also consider the 3D geometry of molecules as another view for contrastive learning. We demonstrate that our framework outperforms prior molecular representation learning methods across various molecular property prediction tasks.

Original languageEnglish
JournalTransactions on Machine Learning Research
Volume2024
StatePublished - 2024

Fingerprint

Dive into the research topics of 'Holistic Molecular Representation Learning via Multi-view Fragmentation'. Together they form a unique fingerprint.

Cite this