Bayesian Survival Analysis by Approximate Inference of Neural Networks

2404.06421

Published 6/21/2024 by Christian Marius Lillelund, Martin Magris, Christian Fischer Pedersen

Bayesian Survival Analysis by Approximate Inference of Neural Networks

Abstract

Variational Inference (VI) is a commonly used technique for approximate Bayesian inference and uncertainty estimation in deep learning models, yet it comes at a computational cost, as it doubles the number of trainable parameters to represent uncertainty. This rapidly becomes challenging in high-dimensional settings and motivates the use of alternative techniques for inference, such as Monte Carlo Dropout (MCD) or Spectral-normalized Neural Gaussian Process (SNGP). However, such methods have seen little adoption in survival analysis, and VI remains the prevalent approach for training probabilistic neural networks. In this paper, we investigate how to train deep probabilistic survival models in large datasets without introducing additional overhead in model complexity. To achieve this, we adopt three probabilistic approaches, namely VI, MCD, and SNGP, and evaluate them in terms of their prediction performance, calibration performance, and model complexity. In the context of probabilistic survival analysis, we investigate whether non-VI techniques can offer comparable or possibly improved prediction performance and uncertainty calibration compared to VI. In the MIMIC-IV dataset, we find that MCD aligns with VI in terms of the concordance index (0.748 vs. 0.743) and mean absolute error (254.9 vs. 254.7) using hinge loss, while providing C-calibrated uncertainty estimates. Moreover, our SNGP implementation provides D-calibrated survival functions in all datasets compared to VI (4/4 vs. 2/4, respectively). Our work encourages the use of techniques alternative to VI for survival analysis in high-dimensional datasets, where computational efficiency and overhead are of concern.

Create account to get full access

Overview

Presents a Bayesian approach to survival analysis using neural networks
Focuses on estimating uncertainty in survival predictions
Demonstrates how this approach can outperform traditional survival analysis methods

Plain English Explanation

This research paper introduces a new way to analyze survival data using neural networks. Survival analysis is the study of how long it takes for certain events to happen, like how long a patient lives after a medical treatment. Traditional survival analysis methods can struggle to capture the complexity of real-world data, but the authors show that a Bayesian neural network approach can provide more accurate and reliable predictions.

The key idea is to use a neural network to model the underlying survival distribution, rather than making assumptions about the shape of the distribution. This allows the model to learn patterns in the data more flexibly. The Bayesian approach also provides a way to quantify the uncertainty in the predictions, which is important for making informed decisions in high-stakes applications like healthcare.

The paper demonstrates the effectiveness of this method on several real-world datasets, showing that it can outperform standard survival analysis techniques. By combining the power of neural networks with the principled uncertainty quantification of Bayesian inference, this research offers a promising new tool for researchers and practitioners working with survival data.

Technical Explanation

The paper presents a Bayesian approach to survival analysis using neural networks, known as Bayesian Neural Survival Analysis (BNSA). Rather than making restrictive assumptions about the underlying survival distribution, the method uses a neural network to model the survival function directly from the data.

The authors employ a Bayesian neural network to estimate the survival probabilities. This allows the model to capture complex, nonlinear relationships in the data while also providing principled uncertainty estimates through the Bayesian framework. The survival function is parameterized using a neural network, and the model is trained using a novel variational inference technique.

Experiments on several real-world survival analysis datasets demonstrate that BNSA can outperform traditional methods like Cox proportional hazards and parametric survival models. The improved performance is attributed to the flexible modeling capacity of neural networks and the ability to quantify uncertainty.

Critical Analysis

The authors acknowledge several limitations of the proposed approach. First, the variational inference technique used for training the Bayesian neural network may not scale well to very large datasets, and alternative inference methods may be required in such cases. Additionally, the paper does not explore the interpretability of the trained models, which is an important consideration for many real-world applications.

While the results show promising improvements over traditional survival analysis methods, further research is needed to fully understand the strengths and weaknesses of the BNSA approach. For example, it would be valuable to investigate the stability and robustness of the predictions, as well as the model's ability to generalize to new, unseen data. Bayesian methods for causal inference may also provide insights into how BNSA can be applied to study treatment effects and other causal questions in survival analysis.

Conclusion

The Bayesian Neural Survival Analysis (BNSA) framework presented in this paper offers a promising new approach to survival analysis that can outperform traditional methods. By combining the flexibility of neural networks with the principled uncertainty quantification of Bayesian inference, BNSA provides a powerful tool for modeling complex survival data. While the method has some limitations, the results suggest that this research represents an important step forward in the field of survival analysis, with potential applications in healthcare, finance, and other domains where accurate predictions of time-to-event outcomes are crucial.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning

Asaf Ben Arie, Malka Gorfine

Deep learning models have significantly improved prediction accuracy in various fields, gaining recognition across numerous disciplines. Yet, an aspect of deep learning that remains insufficiently addressed is the assessment of prediction uncertainty. Producing reliable uncertainty estimators could be crucial in practical terms. For instance, predictions associated with a high degree of uncertainty could be sent for further evaluation. Recent works in uncertainty quantification of deep learning predictions, including Bayesian posterior credible intervals and a frequentist confidence-interval estimation, have proven to yield either invalid or overly conservative intervals. Furthermore, there is currently no method for quantifying uncertainty that can accommodate deep neural networks for survival (time-to-event) data that involves right-censored outcomes. In this work, we provide a valid non-parametric bootstrap method that correctly disentangles data uncertainty from the noise inherent in the adopted optimization algorithm, ensuring that the resulting point-wise confidence intervals or the simultaneous confidence bands are accurate (i.e., valid and not overly conservative). The proposed ad-hoc method can be easily integrated into any deep neural network without interfering with the training process. The utility of the proposed approach is illustrated by constructing simultaneous confidence bands for survival curves derived from deep neural networks for survival data with right censoring.

6/21/2024

stat.ML cs.LG

Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis

Zeinab Abboud, Herve Lombaert, Samuel Kadoury

Efficiently quantifying predictive uncertainty in medical images remains a challenge. While Bayesian neural networks (BNN) offer predictive uncertainty, they require substantial computational resources to train. Although Bayesian approximations such as ensembles have shown promise, they still suffer from high training and inference costs. Existing approaches mainly address the costs of BNN inference post-training, with little focus on improving training efficiency and reducing parameter complexity. This study introduces a training procedure for a sparse (partial) Bayesian network. Our method selectively assigns a subset of parameters as Bayesian by assessing their deterministic saliency through gradient sensitivity analysis. The resulting network combines deterministic and Bayesian parameters, exploiting the advantages of both representations to achieve high task-specific performance and minimize predictive uncertainty. Demonstrated on multi-label ChestMNIST for classification and ISIC, LIDC-IDRI for segmentation, our approach achieves competitive performance and predictive uncertainty estimation by reducing Bayesian parameters by over 95%, significantly reducing computational expenses compared to fully Bayesian and ensemble methods.

6/12/2024

cs.CV

🧠

A Study of Bayesian Neural Network Surrogates for Bayesian Optimization

Yucen Lily Li, Tim G. J. Rudner, Andrew Gordon Wilson

Bayesian optimization is a highly efficient approach to optimizing objective functions which are expensive to query. These objectives are typically represented by Gaussian process (GP) surrogate models which are easy to optimize and support exact inference. While standard GP surrogates have been well-established in Bayesian optimization, Bayesian neural networks (BNNs) have recently become practical function approximators, with many benefits over standard GPs such as the ability to naturally handle non-stationarity and learn representations for high-dimensional data. In this paper, we study BNNs as alternatives to standard GP surrogates for optimization. We consider a variety of approximate inference procedures for finite-width BNNs, including high-quality Hamiltonian Monte Carlo, low-cost stochastic MCMC, and heuristics such as deep ensembles. We also consider infinite-width BNNs, linearized Laplace approximations, and partially stochastic models such as deep kernel learning. We evaluate this collection of surrogate models on diverse problems with varying dimensionality, number of objectives, non-stationarity, and discrete and continuous inputs. We find: (i) the ranking of methods is highly problem dependent, suggesting the need for tailored inductive biases; (ii) HMC is the most successful approximate inference procedure for fully stochastic BNNs; (iii) full stochasticity may be unnecessary as deep kernel learning is relatively competitive; (iv) deep ensembles perform relatively poorly; (v) infinite-width BNNs are particularly promising, especially in high dimensions.

5/9/2024

cs.LG stat.ML

📈

Model Free Prediction with Uncertainty Assessment

Yuling Jiao, Lican Kang, Jin Liu, Heng Peng, Heng Zuo

Deep nonparametric regression, characterized by the utilization of deep neural networks to learn target functions, has emerged as a focus of research attention in recent years. Despite considerable progress in understanding convergence rates, the absence of asymptotic properties hinders rigorous statistical inference. To address this gap, we propose a novel framework that transforms the deep estimation paradigm into a platform conducive to conditional mean estimation, leveraging the conditional diffusion model. Theoretically, we develop an end-to-end convergence rate for the conditional diffusion model and establish the asymptotic normality of the generated samples. Consequently, we are equipped to construct confidence regions, facilitating robust statistical inference. Furthermore, through numerical experiments, we empirically validate the efficacy of our proposed methodology.

6/18/2024

stat.ML cs.LG