Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fr'echet Domain Distance

Read original: arXiv:2405.09934 - Published 5/17/2024 by Milda Poceviv{c}i=ut.e, Gabriel Eilertsen, Stina Garvin, Claes Lundstrom

Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fr'echet Domain Distance

Overview

This paper introduces a multiple instance learning (MIL) method for detecting domain shifts in digital pathology images.
The method uses an attention-based approach to identify key patches within whole slide images that are most indicative of a domain shift.
The researchers evaluate their approach on a dataset of digital pathology images and demonstrate its effectiveness in detecting domain shifts.

Plain English Explanation

The paper describes a new way to detect differences between medical images, such as those used in digital pathology. When doctors analyze medical images, they often need to compare images from different sources or taken at different times. However, these images can vary in subtle ways, making it challenging to identify meaningful differences.

The researchers developed a multiple instance learning (MIL) method that uses an attention-based approach to find the most important regions, or "patches," within the medical images. By focusing on these key patches, the method can more reliably detect differences between images, even when the overall images appear similar.

The researchers tested their approach on a dataset of digital pathology images and found that it was effective at identifying domain shifts - differences between images that could indicate important changes in the underlying medical condition. This type of domain shift detection is important in digital pathology and other medical imaging fields, where accurately identifying changes over time or between different data sources can have significant implications for patient care.

Technical Explanation

The paper presents a multiple instance learning (MIL) method for detecting domain shifts in digital pathology images. The method uses an attention-based approach to identify the most informative patches within whole slide images that are indicative of a domain shift.

The researchers first divide each whole slide image into a grid of smaller patches. They then train a neural network to classify each patch as either belonging to the source domain or the target domain. The network uses an attention mechanism to learn which patches are most important for making this classification.

During inference, the model takes a new whole slide image and outputs a domain shift score, which indicates the likelihood that the image belongs to a different domain than the source images. The score is computed as a weighted average of the patch-level domain predictions, with the weights determined by the attention mechanism.

The researchers evaluate their approach on a dataset of digital pathology images and show that it outperforms other domain shift detection methods, particularly when the domain shift is subtle or localized to specific regions of the image.

Critical Analysis

The researchers provide a thorough evaluation of their method, including comparisons to other domain shift detection approaches and an analysis of the attention-based mechanism's ability to identify the most informative image patches. However, the paper does not address some potential limitations of the approach.

For example, the method assumes that the domain shift is manifested in specific, localized regions of the image. In some cases, the domain shift may be more diffuse or distributed across the entire image, which could make it harder for the attention mechanism to identify the most relevant patches.

Additionally, the researchers only evaluate their approach on a single dataset of digital pathology images. It would be valuable to see how the method performs on a wider range of medical imaging data, such as remote sensing images or cross-modality medical scans, to assess its generalizability.

Further research could also explore ways to combine the attention-based patch selection with other domain adaptation or transfer learning techniques to improve the method's performance on challenging domain shift scenarios.

Conclusion

This paper presents a novel attention-based MIL method for detecting domain shifts in digital pathology images. The key innovation is the use of an attention mechanism to identify the most informative image patches, which allows the model to focus on the regions that are most indicative of a domain shift.

The researchers demonstrate the effectiveness of their approach on a dataset of digital pathology images, showing that it outperforms other domain shift detection methods. This work has important implications for the field of digital pathology, where accurately identifying changes over time or between different data sources is critical for patient care and clinical decision-making.

While the paper has some limitations, the attention-based MIL method represents a promising step forward in addressing the challenge of domain shift detection in medical imaging. Further research building on this work could lead to even more robust and generalizable solutions for this important problem.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fr'echet Domain Distance

Milda Poceviv{c}i=ut.e, Gabriel Eilertsen, Stina Garvin, Claes Lundstrom

Multiple-instance learning (MIL) is an attractive approach for digital pathology applications as it reduces the costs related to data collection and labelling. However, it is not clear how sensitive MIL is to clinically realistic domain shifts, i.e., differences in data distribution that could negatively affect performance, and if already existing metrics for detecting domain shifts work well with these algorithms. We trained an attention-based MIL algorithm to classify whether a whole-slide image of a lymph node contains breast tumour metastases. The algorithm was evaluated on data from a hospital in a different country and various subsets of this data that correspond to different levels of domain shift. Our contributions include showing that MIL for digital pathology is affected by clinically realistic differences in data, evaluating which features from a MIL model are most suitable for detecting changes in performance, and proposing an unsupervised metric named Fr'echet Domain Distance (FDD) for quantification of domain shifts. Shift measure performance was evaluated through the mean Pearson correlation to change in classification performance, where FDD achieved 0.70 on 10-fold cross-validation models. The baselines included Deep ensemble, Difference of Confidence, and Representation shift which resulted in 0.45, -0.29, and 0.56 mean Pearson correlation, respectively. FDD could be a valuable tool for care providers and vendors who need to verify if a MIL system is likely to perform reliably when implemented at a new site, without requiring any additional annotations from pathologists.

5/17/2024

cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process

Yihang Chen, Tsai Hor Chan, Guosheng Yin, Yuming Jiang, Lequan Yu

Multiple instance learning (MIL) has been extensively applied to whole slide histopathology image (WSI) analysis. The existing aggregation strategy in MIL, which primarily relies on the first-order distance (e.g., mean difference) between instances, fails to accurately approximate the true feature distribution of each instance, leading to biased slide-level representations. Moreover, the scarcity of WSI observations easily leads to model overfitting, resulting in unstable testing performance and limited generalizability. To tackle these challenges, we propose a new Bayesian nonparametric framework for multiple instance learning, which adopts a cascade of Dirichlet processes (cDP) to incorporate the instance-to-bag characteristic of the WSIs. We perform feature aggregation based on the latent clusters formed by the Dirichlet process, which incorporates the covariances of the patch features and forms more representative clusters. We then perform bag-level prediction with another Dirichlet process model on the bags, which imposes a natural regularization on learning to prevent overfitting and enhance generalizability. Moreover, as a Bayesian nonparametric method, the cDP model can accurately generate posterior uncertainty, which allows for the detection of outlier samples and tumor localization. Extensive experiments on five WSI benchmarks validate the superior performance of our method, as well as its generalizability and ability to estimate uncertainties. Codes are available at https://github.com/HKU-MedAI/cDPMIL.

7/22/2024

🖼️

Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism

Tianhang Nan, Hao Quan, Yong Ding, Xingyu Li, Kai Yang, Xiaoyu Cui

Multiple Instance Learning (MIL) has garnered widespread attention in the field of Whole Slide Image (WSI) classification as it replaces pixel-level manual annotation with diagnostic reports as labels, significantly reducing labor costs. Recent research has shown that bag-level MIL methods often yield better results because they can consider all patches of the WSI as a whole. However, a drawback of such methods is the incorporation of more redundant patches, leading to interference. To extract patches with high diagnostic value while excluding interfering patches to address this issue, we developed an attention-based feature distillation multi-instance learning (AFD-MIL) approach. This approach proposed the exclusion of redundant patches as a preprocessing operation in weakly supervised learning, directly mitigating interference from extensive noise. It also pioneers the use of attention mechanisms to distill features with high diagnostic value, as opposed to the traditional practice of indiscriminately and forcibly integrating all patches. Additionally, we introduced global loss optimization to finely control the feature distillation module. AFD-MIL is orthogonal to many existing MIL methods, leading to consistent performance improvements. This approach has surpassed the current state-of-the-art method, achieving 91.47% ACC (accuracy) and 94.29% AUC (area under the curve) on the Camelyon16 (Camelyon Challenge 2016, breast cancer), while 93.33% ACC and 98.17% AUC on the TCGA-NSCLC (The Cancer Genome Atlas Program: non-small cell lung cancer). Different feature distillation methods were used for the two datasets, tailored to the specific diseases, thereby improving performance and interpretability.

8/19/2024

Domain Shift Analysis in Chest Radiographs Classification in a Veterans Healthcare Administration Population

Mayanka Chandrashekar, Ian Goethert, Md Inzamam Ul Haque, Benjamin McMahon, Sayera Dhaubhadel, Kathryn Knight, Joseph Erdos, Donna Reagan, Caroline Taylor, Peter Kuzmak, John Michael Gaziano, Eileen McAllister, Lauren Costa, Yuk-Lam Ho, Kelly Cho, Suzanne Tamang, Samah Fodeh-Jarad, Olga S. Ovchinnikova, Amy C. Justice, Jacob Hinkle, Ioana Danciu

Objectives: This study aims to assess the impact of domain shift on chest X-ray classification accuracy and to analyze the influence of ground truth label quality and demographic factors such as age group, sex, and study year. Materials and Methods: We used a DenseNet121 model pretrained MIMIC-CXR dataset for deep learning-based multilabel classification using ground truth labels from radiology reports extracted using the CheXpert and CheXbert Labeler. We compared the performance of the 14 chest X-ray labels on the MIMIC-CXR and Veterans Healthcare Administration chest X-ray dataset (VA-CXR). The VA-CXR dataset comprises over 259k chest X-ray images spanning between the years 2010 and 2022. Results: The validation of ground truth and the assessment of multi-label classification performance across various NLP extraction tools revealed that the VA-CXR dataset exhibited lower disagreement rates than the MIMIC-CXR datasets. Additionally, there were notable differences in AUC scores between models utilizing CheXpert and CheXbert. When evaluating multi-label classification performance across different datasets, minimal domain shift was observed in unseen datasets, except for the label Enlarged Cardiomediastinum. The study year's subgroup analyses exhibited the most significant variations in multi-label classification model performance. These findings underscore the importance of considering domain shifts in chest X-ray classification tasks, particularly concerning study years. Conclusion: Our study reveals the significant impact of domain shift and demographic factors on chest X-ray classification, emphasizing the need for improved transfer learning and equitable model development. Addressing these challenges is crucial for advancing medical imaging and enhancing patient care.

8/1/2024