Abstract
Discovering hit molecules with desired biological activity in a directed manner is a promising but profound task in computer-aided drug discovery. Inspired by recent generative AI approaches, particularly Diffusion Models (DM), we propose Graph Latent Diffusion Model (GLDM)-a latent DM that preserves both the effectiveness of autoencoders of compressing complex chemical data and the DM's capabilities of generating novel molecules. Specifically, we first develop an autoencoder to encode the molecular data into low-dimensionallatentrepresentationsandthentraintheDMonthelatentspacetogeneratemoleculesinducingtargetedbiologicalactivity defined by gene expression profiles. Manipulating DM in the latent space rather than the input space avoids complicated operations to map molecule decomposition and reconstruction to diffusion processes, and thus improves training efficiency. Experiments show that GLDM not only achieves outstanding performances on molecular generation benchmarks, but also generates samples with optimal chemical properties and potentials to induce desired biological activity.
Original language | English |
---|---|
Article number | bbae142 |
Journal | Briefings in Bioinformatics |
Volume | 25 |
Issue number | 3 |
DOIs | |
Publication status | Published - May 1 2024 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© The Author(s) 2024.
ASJC Scopus Subject Areas
- Information Systems
- Molecular Biology
Keywords
- deep generative models
- Diffusion Models
- drug design
- hit molecule discovery
Fingerprint
Dive into the research topics of 'GLDM: hit molecule generation with constrained graph latent diffusion model'. Together they form a unique fingerprint.Press/Media
-
Recent Research from Nanyang Technological University Highlight Findings in Bioinformatics (Gldm: Hit Molecule Generation With Constrained Graph Latent Diffusion Model)
8/16/24
1 item of Media coverage
Press/Media: Research