Optimized Drug Design using Multi-Objective Evolutionary Algorithms with SELFIES

Read original: arXiv:2405.00401 - Published 5/2/2024 by Tomoya Homberg, Sanaz Mostaghim, Satoru Hiwa, Tomoyuki Hiroyasu
Total Score

0

Optimized Drug Design using Multi-Objective Evolutionary Algorithms with SELFIES

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper explores the use of multi-objective evolutionary algorithms (MOEAs) and the SELFIES molecular representation for optimized drug design.
  • The researchers developed a MOEA framework that aims to balance multiple objectives, such as biological activity and drug-like properties, in the search for novel drug candidates.
  • The SELFIES representation allows for the generation of chemically valid molecular structures, which is a key challenge in computational drug design.

Plain English Explanation

The paper discusses a new approach for designing better drug molecules using computer algorithms. Developing new drugs is incredibly difficult and expensive, often taking over a decade and costing hundreds of millions of dollars. One key challenge is figuring out the right molecular structure that has the desired effects on the body while also being safe and easy to manufacture.

The researchers in this paper use a special type of computer algorithm called a "multi-objective evolutionary algorithm" to search for promising drug molecule designs. These algorithms are inspired by the process of evolution, where traits that are most fit for their environment tend to get passed on to future generations. In this case, the algorithm is trying to "evolve" new molecular structures that balance multiple objectives, like being effective at their intended biological target and also having good drug-like properties.

To represent the molecular structures, the researchers use a system called SELFIES, which allows the computer to generate chemically valid molecules. This is important because many molecule designs proposed by computers end up being impossible or very difficult to actually synthesize in a lab.

By combining the power of multi-objective evolutionary algorithms with the SELFIES molecular representation, the researchers were able to efficiently search through the vast space of possible drug molecules and identify promising new candidates. This could help speed up the expensive and time-consuming process of drug discovery and development.

Technical Explanation

The researchers developed a multi-objective evolutionary algorithm (MOEA) framework for optimized drug design using the SELFIES molecular representation. MOEAs are well-suited for this task as they can balance multiple, often conflicting objectives, such as biological activity and drug-like properties.

The SELFIES (Self-Referencing Embedded Strings) representation ensures that the generated molecular structures are chemically valid, addressing a key challenge in computational drug design. SELFIES uses a context-free grammar to encode molecules in a way that avoids invalid structures.

The MOEA framework consists of several components:

  • Initialization: The algorithm starts with a population of randomly generated SELFIES strings representing molecular structures.
  • Fitness Evaluation: The molecules are evaluated based on multiple objective functions, such as predicted biological activity and drug-likeness.
  • Selection and Variation: Parent molecules are selected from the population, and new offspring are generated through crossover and mutation operators.
  • Replacement: The new offspring replace the least fit individuals in the population, maintaining a constant population size.

This iterative process allows the algorithm to gradually explore the chemical space and converge towards optimal solutions that balance the desired objectives.

The researchers tested their approach on several benchmark drug design problems and found that it outperformed single-objective optimization methods and other MOEA approaches. The SELFIES representation was crucial in generating chemically valid molecules throughout the optimization process.

Critical Analysis

The paper presents a promising approach for optimized drug design using multi-objective evolutionary algorithms and the SELFIES molecular representation. However, it's important to recognize some limitations and areas for further research:

  • The paper focuses on a limited set of objective functions, such as predicted biological activity and drug-likeness. In practice, drug design involves balancing many more complex factors, including safety, stability, and manufacturability.
  • The evaluation of the generated molecules is based on computational models and simulations, which may not fully capture the real-world behavior of the compounds. Experimental validation would be necessary to assess the true potential of the identified drug candidates.
  • The paper does not provide a detailed analysis of the computational efficiency and scalability of the MOEA framework, which could be crucial for its practical application in large-scale drug discovery projects.
  • Further research could explore ways to incorporate additional domain knowledge, such as structural information about known drug targets, to guide the optimization process and improve the relevance of the generated molecules.

Conclusion

This paper presents a novel approach for optimized drug design using multi-objective evolutionary algorithms and the SELFIES molecular representation. By balancing multiple objectives, the framework can efficiently explore the vast chemical space and identify promising drug candidates that are both biologically active and possess desirable drug-like properties.

The use of SELFIES ensures the generation of chemically valid molecules, overcoming a key challenge in computational drug design. The results demonstrate the effectiveness of this approach compared to single-objective optimization and other MOEA methods.

While there are still some limitations to address, this research opens up exciting possibilities for accelerating the drug discovery process and ultimately leading to the development of more effective and safer medications for patients in need.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Optimized Drug Design using Multi-Objective Evolutionary Algorithms with SELFIES
Total Score

0

Optimized Drug Design using Multi-Objective Evolutionary Algorithms with SELFIES

Tomoya Homberg, Sanaz Mostaghim, Satoru Hiwa, Tomoyuki Hiroyasu

Computer aided drug design is a promising approach to reduce the tremendous costs, i.e. time and resources, for developing new medicinal drugs. It finds application in aiding the traversal of the vast chemical space of potentially useful compounds. In this paper, we deploy multi-objective evolutionary algorithms, namely NSGA-II, NSGA-III, and MOEA/D, for this purpose. At the same time, we used the SELFIES string representation method. In addition to the QED and SA score, we optimize compounds using the GuacaMol benchmark multi-objective task sets. Our results indicate that all three algorithms show converging behavior and successfully optimize the defined criteria whilst differing mainly in the number of potential solutions found. We observe that novel and promising candidates for synthesis are discovered among obtained compounds in the Pareto-sets.

Read more

5/2/2024

🛠️

Total Score

0

Human-level molecular optimization driven by mol-gene evolution

Jiebin Fang (Hainan Institute of Zhejiang University, Institute of Marine Biology and Pharmacology, Ocean College, Zhejiang University), Churu Mao (Institute of Marine Biology and Pharmacology, Ocean College, Zhejiang University), Yuchen Zhu (College of Pharmaceutical Sciences and Cancer Center, Zhejiang University), Xiaoming Chen (Institute of Marine Biology and Pharmacology, Ocean College, Zhejiang University), Chang-Yu Hsieh (College of Pharmaceutical Sciences and Cancer Center, Zhejiang University), Zhongjun Ma (Hainan Institute of Zhejiang University, Institute of Marine Biology and Pharmacology, Ocean College, Zhejiang University)

De novo molecule generation allows the search for more drug-like hits across a vast chemical space. However, lead optimization is still required, and the process of optimizing molecular structures faces the challenge of balancing structural novelty with pharmacological properties. This study introduces the Deep Genetic Molecular Modification Algorithm (DGMM), which brings structure modification to the level of medicinal chemists. A discrete variational autoencoder (D-VAE) is used in DGMM to encode molecules as quantization code, mol-gene, which incorporates deep learning into genetic algorithms for flexible structural optimization. The mol-gene allows for the discovery of pharmacologically similar but structurally distinct compounds, and reveals the trade-offs of structural optimization in drug discovery. We demonstrate the effectiveness of the DGMM in several applications.

Read more

6/21/2024

🛸

Total Score

0

Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Ningfeng Liu (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, Peking-Tsinghua Center for Life Science), Jie Yu (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Siyu Xiu (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Xinfang Zhao (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Siyu Lin (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Bo Qiang (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Ruqiu Zheng (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Hongwei Jin (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Liangren Zhang (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University), Zhenming Liu (State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, State Key Laboratory of Pharmaceutical Biotechnology, Nanjing University)

Molecular generation, an essential method for identifying new drug structures, has been supported by advancements in machine learning and computational technology. However, challenges remain in multi-objective generation, model adaptability, and practical application in drug discovery. In this study, we developed a versatile 'plug-in' molecular generation model that incorporates multiple objectives related to target affinity, drug-likeness, and synthesizability, facilitating its application in various drug development contexts. We improved the Particle Swarm Optimization (PSO) in the context of drug discoveries, and identified PSO-ENP as the optimal variant for multi-objective molecular generation and optimization through comparative experiments. The model also incorporates a novel target-ligand affinity predictor, enhancing the model's utility by supporting three-dimensional information and improving synthetic feasibility. Case studies focused on generating and optimizing drug-like big marine natural products were performed, underscoring PSO-ENP's effectiveness and demonstrating its considerable potential for practical drug discovery applications.

Read more

4/11/2024

Peptide Vaccine Design by Evolutionary Multi-Objective Optimization
Total Score

0

Peptide Vaccine Design by Evolutionary Multi-Objective Optimization

Dan-Xuan Liu, Yi-Heng Xu, Chao Qian

Peptide vaccines are growing in significance for fighting diverse diseases. Machine learning has improved the identification of peptides that can trigger immune responses, and the main challenge of peptide vaccine design now lies in selecting an effective subset of peptides due to the allelic diversity among individuals. Previous works mainly formulated this task as a constrained optimization problem, aiming to maximize the expected number of peptide-Major Histocompatibility Complex (peptide-MHC) bindings across a broad range of populations by selecting a subset of diverse peptides with limited size; and employed a greedy algorithm, whose performance, however, may be limited due to the greedy nature. In this paper, we propose a new framework PVD-EMO based on Evolutionary Multi-objective Optimization, which reformulates Peptide Vaccine Design as a bi-objective optimization problem that maximizes the expected number of peptide-MHC bindings and minimizes the number of selected peptides simultaneously, and employs a Multi-Objective Evolutionary Algorithm (MOEA) to solve it. We also incorporate warm-start and repair strategies into MOEAs to improve efficiency and performance. We prove that the warm-start strategy ensures that PVD-EMO maintains the same worst-case approximation guarantee as the previous greedy algorithm, and meanwhile, the EMO framework can help avoid local optima. Experiments on a peptide vaccine design for COVID-19, caused by the SARS-CoV-2 virus, demonstrate the superiority of PVD-EMO.

Read more

6/11/2024