LncRNA-disease association prediction method based on heterogeneous information completion and convolutional neural network

Read original: arXiv:2406.03406 - Published 6/6/2024 by Wen-Yu Xi, Juan Wang, Yu-Lin Zhang, Jin-Xing Liu, Yin-Lian Gao
Total Score

0

🔮

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • The paper proposes a deep learning model called HCNNLDA for accurately predicting associations between long non-coding RNAs (lncRNAs) and diseases.
  • The model uses a heterogeneous network that includes lncRNAs, diseases, and microRNAs (miRNAs), and a convolutional neural network (CNN) to learn low-dimensional feature representations.
  • The model achieves high performance on a benchmark dataset and demonstrates effectiveness in identifying novel lncRNA-disease associations through case studies.

Plain English Explanation

Long non-coding RNAs (lncRNAs) are a class of RNA molecules that do not code for proteins but play crucial roles in various human diseases. Accurately identifying the relationships between lncRNAs and diseases is essential for disease prevention and treatment. However, existing methods have limitations in detecting complex, nonlinear associations between lncRNAs and diseases.

The researchers developed a deep learning model called HCNNLDA to address this challenge. The model first constructs a heterogeneous network that includes lncRNAs, diseases, and microRNAs (miRNAs), which are small RNA molecules that interact with lncRNAs. The model then uses a convolutional neural network (CNN) to learn low-dimensional feature representations of the lncRNA-disease pairs based on various biological information about these entities.

Finally, the model trains an XGBoost classifier to predict potential associations between lncRNAs and diseases. The researchers demonstrate that HCNNLDA outperforms several state-of-the-art methods in predicting lncRNA-disease associations and is effective in identifying novel associations through case studies of three diseases.

Technical Explanation

The researchers first construct a heterogeneous network that includes nodes representing lncRNAs, diseases, and miRNAs, as well as edges representing various biological relationships between these entities. They then build an embedding matrix for each lncRNA-disease pair based on features derived from the network structure and known biological information about the lncRNAs, diseases, and miRNAs.

The researchers use a convolutional neural network (CNN) to learn low-dimensional feature representations from the embedding matrices. The CNN architecture consists of convolutional, pooling, and fully connected layers, which capture the complex nonlinear relationships between lncRNAs and diseases.

Finally, the researchers train an XGBoost classifier on the learned feature representations to predict potential lncRNA-disease associations. The model is evaluated using a 5-fold cross-validation on a benchmark dataset, and it achieves an AUC (area under the receiver operating characteristic curve) of 0.9752 and an AUPR (area under the precision-recall curve) of 0.9740, outperforming several state-of-the-art methods.

The researchers further demonstrate the effectiveness of HCNNLDA in identifying novel lncRNA-disease associations through case studies of three diseases: colon cancer, gastric cancer, and lung cancer. The model is able to identify previously unknown associations that are supported by recent research findings.

Critical Analysis

The researchers have proposed a promising deep learning model for predicting lncRNA-disease associations. The use of a heterogeneous network and a CNN-based feature learning approach allows the model to capture complex, nonlinear relationships between lncRNAs and diseases, which is a key strength.

However, the paper does not provide much detail on the specific biological features used to construct the embedding matrices, nor does it discuss the interpretability of the learned feature representations. Enhancing the interpretability of the model could be an important area for future research, as it would help researchers better understand the mechanisms underlying lncRNA-disease associations.

Additionally, the researchers only evaluate the model on a single benchmark dataset. Validating the model's performance on a more diverse set of datasets would help establish its robustness and generalizability.

Overall, the paper presents a valuable contribution to the field of lncRNA-disease association prediction, and the HCNNLDA model has the potential to facilitate the development of new disease biomarkers and therapeutic targets. However, further research is needed to address the limitations and enhance the practical applicability of the approach.

Conclusion

The paper introduces a deep learning model called HCNNLDA for accurately predicting associations between long non-coding RNAs (lncRNAs) and diseases. The model leverages a heterogeneous network and a convolutional neural network to learn low-dimensional feature representations, which are then used to train a classifier to identify potential lncRNA-disease associations.

The researchers demonstrate that HCNNLDA outperforms several state-of-the-art methods on a benchmark dataset and is effective in identifying novel associations through case studies. This work highlights the potential of deep learning techniques, combined with the integration of diverse biological data, to advance our understanding of the complex roles of lncRNAs in human diseases. Further research in this area could have significant implications for disease prevention, diagnosis, and treatment.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

Total Score

0

LncRNA-disease association prediction method based on heterogeneous information completion and convolutional neural network

Wen-Yu Xi, Juan Wang, Yu-Lin Zhang, Jin-Xing Liu, Yin-Lian Gao

The emerging research shows that lncRNA has crucial research value in a series of complex human diseases. Therefore, the accurate identification of lncRNA-disease associations (LDAs) is very important for the warning and treatment of diseases. However, most of the existing methods have limitations in identifying nonlinear LDAs, and it remains a huge challenge to predict new LDAs. In this paper, a deep learning model based on a heterogeneous network and convolutional neural network (CNN) is proposed for lncRNA-disease association prediction, named HCNNLDA. The heterogeneous network containing the lncRNA, disease, and miRNA nodes, is constructed firstly. The embedding matrix of a lncRNA-disease node pair is constructed according to various biological premises about lncRNAs, diseases, and miRNAs. Then, the low-dimensional feature representation is fully learned by the convolutional neural network. In the end, the XGBoot classifier model is trained to predict the potential LDAs. HCNNLDA obtains a high AUC value of 0.9752 and AUPR of 0.9740 under the 5-fold cross-validation. The experimental results show that the proposed model has better performance than that of several latest prediction models. Meanwhile, the effectiveness of HCNNLDA in identifying novel LDAs is further demonstrated by case studies of three diseases. To sum up, HCNNLDA is a feasible calculation model to predict LDAs.

Read more

6/6/2024

🌐

Total Score

0

Heterogeneous network and graph attention auto-encoder for LncRNA-disease association prediction

Jin-Xing Liu, Wen-Yu Xi, Ling-Yun Dai, Chun-Hou Zheng, Ying-Lian Gao

The emerging research shows that lncRNAs are associated with a series of complex human diseases. However, most of the existing methods have limitations in identifying nonlinear lncRNA-disease associations (LDAs), and it remains a huge challenge to predict new LDAs. Therefore, the accurate identification of LDAs is very important for the warning and treatment of diseases. In this work, multiple sources of biomedical data are fully utilized to construct characteristics of lncRNAs and diseases, and linear and nonlinear characteristics are effectively integrated. Furthermore, a novel deep learning model based on graph attention automatic encoder is proposed, called HGATELDA. To begin with, the linear characteristics of lncRNAs and diseases are created by the miRNA-lncRNA interaction matrix and miRNA-disease interaction matrix. Following this, the nonlinear features of diseases and lncRNAs are extracted using a graph attention auto-encoder, which largely retains the critical information and effectively aggregates the neighborhood information of nodes. In the end, LDAs can be predicted by fusing the linear and nonlinear characteristics of diseases and lncRNA. The HGATELDA model achieves an impressive AUC value of 0.9692 when evaluated using a 5-fold cross-validation indicating its superior performance in comparison to several recent prediction models. Meanwhile, the effectiveness of HGATELDA in identifying novel LDAs is further demonstrated by case studies. the HGATELDA model appears to be a viable computational model for predicting LDAs.

Read more

5/7/2024

🧠

Total Score

0

Convolutional Neural Networks for Predictive Modeling of Lung Disease

Yingbin Liang, Xiqing Liu, Haohao Xia, Yiru Cang, Zitao Zheng, Yuanfang Yang

In this paper, Pro-HRnet-CNN, an innovative model combining HRNet and void-convolution techniques, is proposed for disease prediction under lung imaging. Through the experimental comparison on the authoritative LIDC-IDRI dataset, we found that compared with the traditional ResNet-50, Pro-HRnet-CNN showed better performance in the feature extraction and recognition of small-size nodules, significantly improving the detection accuracy. Particularly within the domain of detecting smaller targets, the model has exhibited a remarkable enhancement in accuracy, thereby pioneering an innovative avenue for the early identification and prognostication of pulmonary conditions.

Read more

8/26/2024

Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction
Total Score

0

Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction

Kexin Zhang, Feng Huang, Luotao Liu, Zhankun Xiong, Hongyu Zhang, Yuan Quan, Wen Zhang

The recent focus on microbes in human medicine highlights their potential role in the genetic framework of diseases. To decode the complex interactions among genes, microbes, and diseases, computational predictions of gene-microbe-disease (GMD) associations are crucial. Existing methods primarily address gene-disease and microbe-disease associations, but the more intricate triple-wise GMD associations remain less explored. In this paper, we propose a Heterogeneous Causal Metapath Graph Neural Network (HCMGNN) to predict GMD associations. HCMGNN constructs a heterogeneous graph linking genes, microbes, and diseases through their pairwise associations, and utilizes six predefined causal metapaths to extract directed causal subgraphs, which facilitate the multi-view analysis of causal relations among three entity types. Within each subgraph, we employ a causal semantic sharing message passing network for node representation learning, coupled with an attentive fusion method to integrate these representations for predicting GMD associations. Our extensive experiments show that HCMGNN effectively predicts GMD associations and addresses association sparsity issue by enhancing the graph's semantics and structure.

Read more

6/28/2024