FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction

Read original: arXiv:2406.01651 - Published 6/5/2024 by Zhaohan Meng, Zaiqiao Meng, Iadh Ounis

FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction

Overview

This paper introduces FusionDTI, a novel deep learning model for fine-grained drug-target interaction (DTI) prediction.
FusionDTI uses a token-level fusion strategy to effectively combine drug and target information, enabling it to make more accurate and interpretable DTI predictions.
The model is evaluated on several benchmark datasets and shown to outperform state-of-the-art DTI prediction methods.

Plain English Explanation

FusionDTI is a machine learning model that can predict how well a drug will interact with a specific target, such as a protein in the body. This is important for drug discovery, as it can help researchers identify promising drug candidates more efficiently.

The key innovation in FusionDTI is its "token-level fusion" strategy. This means the model looks at the individual words or "tokens" in the drug and target descriptions, and combines this information in a smart way to make its predictions. This allows FusionDTI to capture subtle details about the drug-target interaction that other models might miss.

For example, if a drug is known to interact with a certain protein domain, FusionDTI can pick up on that signal and use it to make a more accurate prediction, rather than just looking at the overall drug and protein structures.

By using this more fine-grained approach, FusionDTI was able to outperform other state-of-the-art DTI models on several standard benchmark datasets. This suggests it could be a valuable tool for researchers working on developing new drugs.

Technical Explanation

FusionDTI uses a token-level fusion strategy to effectively combine drug and target information for DTI prediction. Specifically, the model takes in the textual descriptions of drugs and targets, and learns to attend to the most relevant token-level features for predicting their interaction.

This is in contrast to previous DTI models, which typically encode the drug and target as fixed-length vectors and then use techniques like HierGraphDTI or Diff-DTI to model their relationship.

FusionDTI's architecture includes a token-level fusion module that learns to dynamically combine the drug and target representations, allowing the model to capture more fine-grained and interpretable patterns in the data. This is complemented by a task-specific prediction head that generates the final DTI score.

The model is evaluated on several standard DTI benchmark datasets, including ChEMBL and BindingDB. FusionDTI is shown to outperform state-of-the-art DTI prediction methods, demonstrating the effectiveness of its token-level fusion strategy.

Critical Analysis

The authors note that FusionDTI, like other DTI prediction models, relies on the availability of high-quality training data. In practice, drug-target interaction data can be sparse, noisy, and imbalanced, which can limit the model's performance.

Additionally, the paper does not explore the model's generalization to novel drugs or targets, or its robustness to data shifts. Further research would be needed to assess the model's real-world applicability and deployment challenges.

That said, the token-level fusion approach is a promising direction for improving the interpretability and fine-grained nature of DTI predictions. Incorporating additional multimodal data sources, such as chemical structures or biological annotations, could also help enhance the model's performance and usability for drug discovery.

Conclusion

The FusionDTI model introduces an innovative token-level fusion strategy for drug-target interaction prediction, which allows it to outperform state-of-the-art methods on several benchmark datasets. This fine-grained approach to combining drug and target information holds promise for improving the accuracy and interpretability of DTI models, potentially accelerating the drug discovery process. However, further research is needed to address the model's limitations and explore its real-world applicability.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction

Zhaohan Meng, Zaiqiao Meng, Iadh Ounis

Predicting drug-target interaction (DTI) is critical in the drug discovery process. Despite remarkable advances in recent DTI models through the integration of representations from diverse drug and target encoders, such models often struggle to capture the fine-grained interactions between drugs and protein, i.e. the binding of specific drug atoms (or substructures) and key amino acids of proteins, which is crucial for understanding the binding mechanisms and optimising drug design. To address this issue, this paper introduces a novel model, called FusionDTI, which uses a token-level Fusion module to effectively learn fine-grained information for Drug-Target Interaction. In particular, our FusionDTI model uses the SELFIES representation of drugs to mitigate sequence fragment invalidation and incorporates the structure-aware (SA) vocabulary of target proteins to address the limitation of amino acid sequences in structural information, additionally leveraging pre-trained language models extensively trained on large-scale biomedical datasets as encoders to capture the complex information of drugs and targets. Experiments on three well-known benchmark datasets show that our proposed FusionDTI model achieves the best performance in DTI prediction compared with seven existing state-of-the-art baselines. Furthermore, our case study indicates that FusionDTI could highlight the potential binding sites, enhancing the explainability of the DTI prediction.

6/5/2024

🔮

A Cross-Field Fusion Strategy for Drug-Target Interaction Prediction

Hongzhi Zhang, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

Drug-target interaction (DTI) prediction is a critical component of the drug discovery process. In the drug development engineering field, predicting novel drug-target interactions is extremely crucial.However, although existing methods have achieved high accuracy levels in predicting known drugs and drug targets, they fail to utilize global protein information during DTI prediction. This leads to an inability to effectively predict interaction the interactions between novel drugs and their targets. As a result, the cross-field information fusion strategy is employed to acquire local and global protein information. Thus, we propose the siamese drug-target interaction SiamDTI prediction method, which utilizes a double channel network structure for cross-field supervised learning.Experimental results on three benchmark datasets demonstrate that SiamDTI achieves higher accuracy levels than other state-of-the-art (SOTA) methods on novel drugs and targets.Additionally, SiamDTI's performance with known drugs and targets is comparable to that of SOTA approachs. The code is available at https://anonymous.4open.science/r/DDDTI-434D.

5/24/2024

🔮

HiGraphDTI: Hierarchical Graph Representation Learning for Drug-Target Interaction Prediction

Bin Liu, Siqi Wu, Jin Wang, Xin Deng, Ao Zhou

The discovery of drug-target interactions (DTIs) plays a crucial role in pharmaceutical development. The deep learning model achieves more accurate results in DTI prediction due to its ability to extract robust and expressive features from drug and target chemical structures. However, existing deep learning methods typically generate drug features via aggregating molecular atom representations, ignoring the chemical properties carried by motifs, i.e., substructures of the molecular graph. The atom-drug double-level molecular representation learning can not fully exploit structure information and fails to interpret the DTI mechanism from the motif perspective. In addition, sequential model-based target feature extraction either fuses limited contextual information or requires expensive computational resources. To tackle the above issues, we propose a hierarchical graph representation learning-based DTI prediction method (HiGraphDTI). Specifically, HiGraphDTI learns hierarchical drug representations from triple-level molecular graphs to thoroughly exploit chemical information embedded in atoms, motifs, and molecules. Then, an attentional feature fusion module incorporates information from different receptive fields to extract expressive target features.Last, the hierarchical attention mechanism identifies crucial molecular segments, which offers complementary views for interpreting interaction mechanisms. The experiment results not only demonstrate the superiority of HiGraphDTI to the state-of-the-art methods, but also confirm the practical ability of our model in interaction interpretation and new DTI discovery.

4/17/2024

Accurate Prediction of Ligand-Protein Interaction Affinities with Fine-Tuned Small Language Models

Ben Fauber

We describe the accurate prediction of ligand-protein interaction (LPI) affinities, also known as drug-target interactions (DTI), with instruction fine-tuned pretrained generative small language models (SLMs). We achieved accurate predictions for a range of affinity values associated with ligand-protein interactions on out-of-sample data in a zero-shot setting. Only the SMILES string of the ligand and the amino acid sequence of the protein were used as the model inputs. Our results demonstrate a clear improvement over machine learning (ML) and free-energy perturbation (FEP+) based methods in accurately predicting a range of ligand-protein interaction affinities, which can be leveraged to further accelerate drug discovery campaigns against challenging therapeutic targets.

7/2/2024