Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate

Read original: arXiv:2007.07796 - Published 6/6/2024 by George H. Chen, Linhong Li, Ren Zuo, Amanda Coston, Jeremy C. Weiss

🧠

Overview

The researchers present a neural network framework for learning a survival model to predict time-to-event outcomes while simultaneously learning a topic model that reveals feature relationships.
The framework models each subject as a distribution over topics, where a topic could represent factors like age, disorder, or disease.
The presence of a topic in a subject means certain clinical features are more likely to appear, and topics are learned in a supervised manner to predict time-to-event outcomes.
The framework can combine various topic and survival models, and training scales to large datasets using standard neural network optimizers and minibatch gradient descent.
The researchers address practical implementation issues and visualize results to assist clinical interpretation.
They evaluate the framework on clinical datasets for predicting time until death and hospital ICU length of stay, finding it achieves competitive accuracy while providing interpretable clinical topics.

Plain English Explanation

The researchers have developed a machine learning system that can simultaneously learn two important things from medical data:

Survival Model: A model that can predict how long it will take for a certain event to happen, such as a patient's death or length of hospital stay.
Topic Model: A way of identifying related "topics" or factors in the data, such as age groups, disorders, or diseases. These topics help explain how different clinical features are connected.

The key innovation is that the system learns these two models together, using the topics to improve the survival predictions. For example, the system might find that certain age groups or medical conditions are associated with a higher risk of mortality.

The researchers show that this joint learning approach can be applied flexibly, combining different types of topic models (e.g. LDA) and survival models (e.g. Cox model). This allows the system to be tailored to different medical domains and datasets.

The researchers also address practical challenges in applying these models to real-world clinical data, such as how to visualize the results to help doctors interpret the findings. They demonstrate the effectiveness of their framework on several healthcare datasets, showing it can achieve accurate survival predictions while also providing interpretable insights.

Technical Explanation

The key components of the researchers' neural network framework are:

Topic Model: Each patient is represented as a distribution over "topics", where a topic could correspond to factors like age, disorder, or disease. The presence of a topic indicates certain clinical features are more likely for that patient.
Survival Model: The topic distributions are used as input features to predict a time-to-event outcome, such as time until death or hospital discharge.
Joint Learning: The topic and survival models are learned together in a supervised manner, with the topic model optimized to improve the survival predictions.

This joint modeling approach allows the framework to discover relationships between clinical features and outcomes, encoded in the learned topics. For example, the system might find a "diabetes" topic that is predictive of shorter survival times.

The researchers show how this framework can incorporate different types of topic models, such as Latent Dirichlet Allocation (LDA), and survival models, such as the Cox proportional hazards model. This flexibility enables the framework to be adapted to various medical domains and datasets.

To address practical challenges, the researchers describe techniques for handling missing data, visualizing topic-feature relationships, and interpreting the clinical relevance of the learned topics. They evaluate their framework on several healthcare datasets, including predicting time until death and hospital ICU length of stay, demonstrating competitive predictive performance and interpretable clinical insights.

Critical Analysis

The researchers provide a thorough discussion of the limitations and caveats of their approach. One key issue is the potential for topic models to capture spurious correlations in the data, leading to topics that are not clinically meaningful. The researchers suggest addressing this by incorporating domain knowledge and expert input when interpreting the learned topics.

Another limitation is the reliance on the Cox proportional hazards model, which assumes a linear relationship between the covariates and the log-hazard. This may not always be appropriate, and the researchers acknowledge the need for more flexible survival models within their framework.

The researchers also note that their framework, like many machine learning models, may be susceptible to issues like dataset shift and lack of generalizability. Careful evaluation on diverse datasets and populations would be necessary to assess the broader applicability of their approach.

Overall, the researchers have made a valuable contribution by proposing a flexible and interpretable framework for jointly learning survival and topic models. However, as with any complex machine learning system, there are potential limitations and areas for further research and refinement.

Conclusion

The researchers have presented a novel neural network framework that can simultaneously learn a survival model to predict time-to-event outcomes and a topic model to reveal relationships between clinical features. This joint learning approach allows the framework to discover interpretable topics that are predictive of patient outcomes, such as time until death or hospital discharge.

The flexibility of the framework, which supports combining different topic and survival models, enables it to be adapted to various medical domains and datasets. The researchers have also addressed practical implementation challenges, demonstrating the effectiveness of their approach on several real-world healthcare datasets.

While the framework has limitations and caveats, as discussed in the critical analysis, it represents an important step forward in developing machine learning systems that can provide both accurate predictions and clinically meaningful insights. As the field of healthcare AI continues to evolve, approaches like this that balance predictive power and interpretability will be increasingly valuable for supporting clinical decision-making and improving patient outcomes.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate

George H. Chen, Linhong Li, Ren Zuo, Amanda Coston, Jeremy C. Weiss

We present a neural network framework for learning a survival model to predict a time-to-event outcome while simultaneously learning a topic model that reveals feature relationships. In particular, we model each subject as a distribution over topics, where a topic could, for instance, correspond to an age group, a disorder, or a disease. The presence of a topic in a subject means that specific clinical features are more likely to appear for the subject. Topics encode information about related features and are learned in a supervised manner to predict a time-to-event outcome. Our framework supports combining many different topic and survival models; training the resulting joint survival-topic model readily scales to large datasets using standard neural net optimizers with minibatch gradient descent. For example, a special case is to combine LDA with a Cox model, in which case a subject's distribution over topics serves as the input feature vector to the Cox model. We explain how to address practical implementation issues that arise when applying these neural survival-supervised topic models to clinical data, including how to visualize results to assist clinical interpretation. We study the effectiveness of our proposed framework on seven clinical datasets on predicting time until death as well as hospital ICU length of stay, where we find that neural survival-supervised topic models achieve competitive accuracy with existing approaches while yielding interpretable clinical topics that explain feature relationships. Our code is available at: https://github.com/georgehc/survival-topics

6/6/2024

TransformerLSR: Attentive Joint Model of Longitudinal Data, Survival, and Recurrent Events with Concurrent Latent Structure

Zhiyue Zhang, Yao Zhao, Yanxun Xu

In applications such as biomedical studies, epidemiology, and social sciences, recurrent events often co-occur with longitudinal measurements and a terminal event, such as death. Therefore, jointly modeling longitudinal measurements, recurrent events, and survival data while accounting for their dependencies is critical. While joint models for the three components exist in statistical literature, many of these approaches are limited by heavy parametric assumptions and scalability issues. Recently, incorporating deep learning techniques into joint modeling has shown promising results. However, current methods only address joint modeling of longitudinal measurements at regularly-spaced observation times and survival events, neglecting recurrent events. In this paper, we develop TransformerLSR, a flexible transformer-based deep modeling and inference framework to jointly model all three components simultaneously. TransformerLSR integrates deep temporal point processes into the joint modeling framework, treating recurrent and terminal events as two competing processes dependent on past longitudinal measurements and recurrent event times. Additionally, TransformerLSR introduces a novel trajectory representation and model architecture to potentially incorporate a priori knowledge of known latent structures among concurrent longitudinal variables. We demonstrate the effectiveness and necessity of TransformerLSR through simulation studies and analyzing a real-world medical dataset on patients after kidney transplantation.

4/8/2024

Bayesian Survival Analysis by Approximate Inference of Neural Networks

Christian Marius Lillelund, Martin Magris, Christian Fischer Pedersen

Variational Inference (VI) is a commonly used technique for approximate Bayesian inference and uncertainty estimation in deep learning models, yet it comes at a computational cost, as it doubles the number of trainable parameters to represent uncertainty. This rapidly becomes challenging in high-dimensional settings and motivates the use of alternative techniques for inference, such as Monte Carlo Dropout (MCD) or Spectral-normalized Neural Gaussian Process (SNGP). However, such methods have seen little adoption in survival analysis, and VI remains the prevalent approach for training probabilistic neural networks. In this paper, we investigate how to train deep probabilistic survival models in large datasets without introducing additional overhead in model complexity. To achieve this, we adopt three probabilistic approaches, namely VI, MCD, and SNGP, and evaluate them in terms of their prediction performance, calibration performance, and model complexity. In the context of probabilistic survival analysis, we investigate whether non-VI techniques can offer comparable or possibly improved prediction performance and uncertainty calibration compared to VI. In the MIMIC-IV dataset, we find that MCD aligns with VI in terms of the concordance index (0.748 vs. 0.743) and mean absolute error (254.9 vs. 254.7) using hinge loss, while providing C-calibrated uncertainty estimates. Moreover, our SNGP implementation provides D-calibrated survival functions in all datasets compared to VI (4/4 vs. 2/4, respectively). Our work encourages the use of techniques alternative to VI for survival analysis in high-dimensional datasets, where computational efficiency and overhead are of concern.

6/21/2024

🔮

Advancing Head and Neck Cancer Survival Prediction via Multi-Label Learning and Deep Model Interpretation

Meixu Chen, Kai Wang, Jing Wang

A comprehensive and reliable survival prediction model is of great importance to assist in the personalized management of Head and Neck Cancer (HNC) patients treated with curative Radiation Therapy (RT). In this work, we propose IMLSP, an Interpretable Multi-Label multi-modal deep Survival Prediction framework for predicting multiple HNC survival outcomes simultaneously and provide time-event specific visual explanation of the deep prediction process. We adopt Multi-Task Logistic Regression (MTLR) layers to convert survival prediction from a regression problem to a multi-time point classification task, and to enable predicting of multiple relevant survival outcomes at the same time. We also present Grad-TEAM, a Gradient-weighted Time-Event Activation Mapping approach specifically developed for deep survival model visual explanation, to generate patient-specific time-to-event activation maps. We evaluate our method with the publicly available RADCURE HNC dataset, where it outperforms the corresponding single-modal models and single-label models on all survival outcomes. The generated activation maps show that the model focuses primarily on the tumor and nodal volumes when making the decision and the volume of interest varies for high- and low-risk patients. We demonstrate that the multi-label learning strategy can improve the learning efficiency and prognostic performance, while the interpretable survival prediction model is promising to help understand the decision-making process of AI and facilitate personalized treatment.

5/10/2024