Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation

Read original: arXiv:2309.05919 - Published 8/20/2024 by Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

🤿

Overview

Single-modality medical images often lack sufficient information for accurate diagnosis
Physicians rely on multimodal medical images, like PET/CT, to reach reliable diagnoses
Effective fusion of multimodal information is crucial for reliable decision-making and explanation

Plain English Explanation

Doctors often need to look at multiple types of medical images, such as PET scans and CT scans, to accurately diagnose diseases. This is because a single type of medical image, like a PET scan alone, may not contain enough information for the doctor to make a reliable diagnosis. The key is being able to effectively combine, or "fuse," the information from multiple types of medical images.

In this paper, the researchers propose a framework that uses deep learning and the Dempster-Shafer theory of evidence to fuse multimodal medical image data for more accurate and reliable disease segmentation. The framework takes into account the reliability of each individual image type when segmenting different objects. It then combines the evidence from each image type using Dempster's rule to reach a final decision.

The researchers tested their framework on two medical imaging datasets - one with PET-CT scans of lymphomas and another with multiple MRI scans of brain tumors. The results showed that their method outperformed existing state-of-the-art approaches in terms of accuracy and reliability.

Technical Explanation

The proposed framework leverages deep learning and the Dempster-Shafer theory of evidence to effectively fuse multimodal medical image data.

The key innovation is a "contextual discounting" operation that takes into account the reliability of each individual image modality when segmenting different objects. This discounted evidence from each modality is then combined using Dempster's rule to reach a final decision.

The researchers evaluated their framework on two datasets - a PET-CT dataset with lymphomas and a multi-MRI dataset with brain tumors. The results showed that their method outperformed state-of-the-art multimodal fusion approaches in terms of both accuracy and reliability.

Critical Analysis

The paper provides a thorough evaluation of the proposed framework, including comparisons to existing state-of-the-art methods. However, it does not address certain limitations or potential issues that could be explored in future research.

For example, the framework assumes that the reliability of each modality can be accurately estimated, which may not always be the case in practice. Additionally, the paper does not discuss how the framework might perform on more diverse or larger medical imaging datasets.

Further research could also explore ways to improve the confidence and interpretability of the fusion decisions, as well as investigate semi-supervised or unsupervised approaches to reduce the need for labeled training data.

Conclusion

This paper presents a novel framework for effectively fusing multimodal medical image data using deep learning and the Dempster-Shafer theory of evidence. The key contribution is a contextual discounting operation that accounts for the reliability of each modality when segmenting different objects, leading to improved accuracy and reliability compared to existing methods.

The findings of this research have important implications for the field of medical image analysis, as they demonstrate the potential of advanced data fusion techniques to enhance diagnostic capabilities and decision-making. Further development and refinement of these methods could lead to significant improvements in clinical practice and patient outcomes.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation

Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

Single-modality medical images generally do not contain enough information to reach an accurate and reliable diagnosis. For this reason, physicians generally diagnose diseases based on multimodal medical images such as, e.g., PET/CT. The effective fusion of multimodal information is essential to reach a reliable decision and explain how the decision is made as well. In this paper, we propose a fusion framework for multimodal medical image segmentation based on deep learning and the Dempster-Shafer theory of evidence. In this framework, the reliability of each single modality image when segmenting different objects is taken into account by a contextual discounting operation. The discounted pieces of evidence from each modality are then combined by Dempster's rule to reach a final decision. Experimental results with a PET-CT dataset with lymphomas and a multi-MRI dataset with brain tumors show that our method outperforms the state-of-the-art methods in accuracy and reliability.

8/20/2024

Uncertainty-aware Evidential Fusion-based Learning for Semi-supervised Medical Image Segmentation

Yuanpeng He, Lijian Li

Although the existing uncertainty-based semi-supervised medical segmentation methods have achieved excellent performance, they usually only consider a single uncertainty evaluation, which often fails to solve the problem related to credibility completely. Therefore, based on the framework of evidential deep learning, this paper integrates the evidential predictive results in the cross-region of mixed and original samples to reallocate the confidence degree and uncertainty measure of each voxel, which is realized by emphasizing uncertain information of probability assignments fusion rule of traditional evidence theory. Furthermore, we design a voxel-level asymptotic learning strategy by introducing information entropy to combine with the fused uncertainty measure to estimate voxel prediction more precisely. The model will gradually pay attention to the prediction results with high uncertainty in the learning process, to learn the features that are difficult to master. The experimental results on LA, Pancreas-CT, ACDC and TBAD datasets demonstrate the superior performance of our proposed method in comparison with the existing state of the arts.

4/12/2024

Multi-modal Evidential Fusion Network for Trusted PET/CT Tumor Segmentation

Yuxuan Qi, Li Lin, Jiajun Wang, Jingya Zhang, Bin Zhang

Accurate segmentation of tumors in PET/CT images is important in computer-aided diagnosis and treatment of cancer. The key issue of such a segmentation problem lies in the effective integration of complementary information from PET and CT images. However, the quality of PET and CT images varies widely in clinical settings, which leads to uncertainty in the modality information extracted by networks. To take the uncertainty into account in multi-modal information fusion, this paper proposes a novel Multi-modal Evidential Fusion Network (MEFN) comprising a Cross-Modal Feature Learning (CFL) module and a Multi-modal Trusted Fusion (MTF) module. The CFL module reduces the domain gap upon modality conversion and highlights common tumor features, thereby alleviating the needs of the segmentation module to handle modality specificity. The MTF module utilizes mutual attention mechanisms and an uncertainty calibrator to fuse modality features based on modality uncertainty and then fuse the segmentation results under the guidance of Dempster-Shafer Theory. Besides, a new uncertainty perceptual loss is introduced to force the model focusing on uncertain features and hence improve its ability to extract trusted modality information. Extensive comparative experiments are conducted on two publicly available PET/CT datasets to evaluate the performance of our proposed method whose results demonstrate that our MEFN significantly outperforms state-of-the-art methods with improvements of 2.15% and 3.23% in DSC scores on the AutoPET dataset and the Hecktor dataset, respectively. More importantly, our model can provide radiologists with credible uncertainty of the segmentation results for their decision in accepting or rejecting the automatic segmentation results, which is particularly important for clinical applications. Our code will be available at https://github.com/QPaws/MEFN.

6/27/2024

🤿

A review of deep learning-based information fusion techniques for multimodal medical image classification

Yihao Li, Mostafa El Habib Daho, Pierre-Henri Conze, Rachid Zeghlache, Hugo Le Boit'e, Ramin Tadayoni, B'eatrice Cochener, Mathieu Lamard, Gwenol'e Quellec

Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep learning-based multimodal fusion techniques have emerged as powerful tools for improving medical image classification. This review offers a thorough analysis of the developments in deep learning-based multimodal fusion for medical classification tasks. We explore the complementary relationships among prevalent clinical modalities and outline three main fusion schemes for multimodal classification networks: input fusion, intermediate fusion (encompassing single-level fusion, hierarchical fusion, and attention-based fusion), and output fusion. By evaluating the performance of these fusion techniques, we provide insight into the suitability of different network architectures for various multimodal fusion scenarios and application domains. Furthermore, we delve into challenges related to network architecture selection, handling incomplete multimodal data management, and the potential limitations of multimodal fusion. Finally, we spotlight the promising future of Transformer-based multimodal fusion techniques and give recommendations for future research in this rapidly evolving field.

4/24/2024