GLDM: hit molecule generation with constrained graph latent diffusion model

Conghao Wang, Hiok Hian Ong, Shunsuke Chiba, Jagath C. Rajapakse*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Discovering hit molecules with desired biological activity in a directed manner is a promising but profound task in computer-aided drug discovery. Inspired by recent generative AI approaches, particularly Diffusion Models (DM), we propose Graph Latent Diffusion Model (GLDM)-a latent DM that preserves both the effectiveness of autoencoders of compressing complex chemical data and the DM's capabilities of generating novel molecules. Specifically, we first develop an autoencoder to encode the molecular data into low-dimensionallatentrepresentationsandthentraintheDMonthelatentspacetogeneratemoleculesinducingtargetedbiologicalactivity defined by gene expression profiles. Manipulating DM in the latent space rather than the input space avoids complicated operations to map molecule decomposition and reconstruction to diffusion processes, and thus improves training efficiency. Experiments show that GLDM not only achieves outstanding performances on molecular generation benchmarks, but also generates samples with optimal chemical properties and potentials to induce desired biological activity.

Original languageEnglish
Article numberbbae142
JournalBriefings in Bioinformatics
Volume25
Issue number3
DOIs
Publication statusPublished - May 1 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

ASJC Scopus Subject Areas

  • Information Systems
  • Molecular Biology

Keywords

  • deep generative models
  • Diffusion Models
  • drug design
  • hit molecule discovery

Fingerprint

Dive into the research topics of 'GLDM: hit molecule generation with constrained graph latent diffusion model'. Together they form a unique fingerprint.

Cite this