MergeUp-augmented Semi-Weakly Supervised Learning for WSI Classification

Read original: arXiv:2408.12825 - Published 8/26/2024 by Mingxi Ouyang, Yuqiu Fu, Renao Yan, ShanShan Shi, Xitong Ling, Lianghui Zhu, Yonghong He, Tian Guan

MergeUp-augmented Semi-Weakly Supervised Learning for WSI Classification

Overview

This paper presents a new approach called "MergeUp" for semi-weakly supervised learning on whole slide images (WSI) for classification tasks.
The method combines multiple instance learning (MIL) and self-supervised learning to leverage both labeled and unlabeled data.
Experiments show MergeUp outperforms existing semi-weakly supervised and MIL methods on WSI classification benchmarks.

Plain English Explanation

The research tackles the challenge of classifying whole slide images (WSIs) - large digital scans of tissue samples used in medical diagnosis. This is a difficult task because WSIs contain a huge number of small image patches, and it's expensive to get detailed labels for each one.

The key idea behind the MergeUp approach is to combine two powerful machine learning techniques:

Multiple Instance Learning (MIL): This allows the model to learn from "bags" of image patches, where only the overall bag-level label is known, not the labels of individual patches.
Self-Supervised Learning: This leverages the structure of the unlabeled data itself to learn useful image representations, without needing human-provided labels.

By blending these techniques, MergeUp can effectively use both the limited labeled data and the abundant unlabeled WSI data to train accurate classification models. The experiments show this hybrid approach outperforms prior methods that use MIL or semi-supervised learning alone.

Technical Explanation

The core of the MergeUp approach is a neural network architecture that combines MIL and self-supervised learning. The network takes a bag of image patches from a WSI as input and outputs a classification prediction for the entire WSI.

The MIL component models the relationship between individual patches and the WSI-level label using an attention mechanism. This allows the network to identify the most informative patches for the classification task.

The self-supervised component learns general visual representations by training the network to predict relationships between randomly sampled image patches within each WSI. This unsupervised pretraining helps the network extract useful low-level features even without detailed labels.

During training, the network jointly optimizes both the MIL classification objective and the self-supervised pretext task, allowing the two components to reinforce each other. This "MergeUp" strategy is shown to outperform prior semi-weakly supervised and MIL methods on challenging WSI classification benchmarks.

Critical Analysis

The paper provides a thorough experimental evaluation, demonstrating the effectiveness of the MergeUp approach across multiple WSI classification datasets. However, a few potential limitations or areas for further research are worth considering:

The self-supervised pretraining relies on predicting relationships between randomly sampled image patches. Other self-supervised tasks, such as contrastive learning, may further improve the quality of the learned visual representations.
The method assumes the availability of some labeled WSI data, which may not always be the case in practice. Exploring ways to leverage only unlabeled data, perhaps through weakly supervised techniques, could further broaden the applicability of the approach.
The paper focuses on classification tasks, but MIL and semi-supervised learning have broader applications in areas like object detection or image segmentation on WSIs. Extending the MergeUp framework to these other tasks could be a valuable direction for future research.

Conclusion

The MergeUp approach presented in this paper offers a promising solution for leveraging both labeled and unlabeled whole slide images to train accurate classification models. By combining multiple instance learning and self-supervised learning, the method can effectively extract useful information from the complex WSI data, even when detailed annotations are scarce. The strong empirical results demonstrate the potential of this hybrid technique to advance the state of the art in WSI analysis and medical image understanding more broadly.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

MergeUp-augmented Semi-Weakly Supervised Learning for WSI Classification

Mingxi Ouyang, Yuqiu Fu, Renao Yan, ShanShan Shi, Xitong Ling, Lianghui Zhu, Yonghong He, Tian Guan

Recent advancements in computational pathology and artificial intelligence have significantly improved whole slide image (WSI) classification. However, the gigapixel resolution of WSIs and the scarcity of manual annotations present substantial challenges. Multiple instance learning (MIL) is a promising weakly supervised learning approach for WSI classification. Recently research revealed employing pseudo bag augmentation can encourage models to learn various data, thus bolstering models' performance. While directly inheriting the parents' labels can introduce more noise by mislabeling in training. To address this issue, we translate the WSI classification task from weakly supervised learning to semi-weakly supervised learning, termed SWS-MIL, where adaptive pseudo bag augmentation (AdaPse) is employed to assign labeled and unlabeled data based on a threshold strategy. Using the student-teacher pattern, we introduce a feature augmentation technique, MergeUp, which merges bags with low-priority bags to enhance inter-category information, increasing training data diversity. Experimental results on the CAMELYON-16, BRACS, and TCGA-LUNG datasets demonstrate the superiority of our method over existing state-of-the-art approaches, affirming its efficacy in WSI classification.

8/26/2024

Shapley Values-enabled Progressive Pseudo Bag Augmentation for Whole Slide Image Classification

Renao Yan, Qiehe Sun, Cheng Jin, Yiqing Liu, Yonghong He, Tian Guan, Hao Chen

In computational pathology, whole-slide image (WSI) classification presents a formidable challenge due to its gigapixel resolution and limited fine-grained annotations. Multiple-instance learning (MIL) offers a weakly supervised solution, yet refining instance-level information from bag-level labels remains challenging. While most of the conventional MIL methods use attention scores to estimate instance importance scores (IIS) which contribute to the prediction of the slide labels, these often lead to skewed attention distributions and inaccuracies in identifying crucial instances. To address these issues, we propose a new approach inspired by cooperative game theory: employing Shapley values to assess each instance's contribution, thereby improving IIS estimation. The computation of the Shapley value is then accelerated using attention, meanwhile retaining the enhanced instance identification and prioritization. We further introduce a framework for the progressive assignment of pseudo bags based on estimated IIS, encouraging more balanced attention distributions in MIL models. Our extensive experiments on CAMELYON-16, BRACS, TCGA-LUNG, and TCGA-BRCA datasets show our method's superiority over existing state-of-the-art approaches, offering enhanced interpretability and class-wise insights. Our source code is available at https://github.com/RenaoYan/PMIL.

9/6/2024

Advances in Multiple Instance Learning for Whole Slide Image Analysis: Techniques, Challenges, and Future Directions

Jun Wang, Yu Mao, Nan Guan, Chun Jason Xue

Whole slide images (WSIs) are gigapixel-scale digital images of H&E-stained tissue samples widely used in pathology. The substantial size and complexity of WSIs pose unique analytical challenges. Multiple Instance Learning (MIL) has emerged as a powerful approach for addressing these challenges, particularly in cancer classification and detection. This survey provides a comprehensive overview of the challenges and methodologies associated with applying MIL to WSI analysis, including attention mechanisms, pseudo-labeling, transformers, pooling functions, and graph neural networks. Additionally, it explores the potential of MIL in discovering cancer cell morphology, constructing interpretable machine learning models, and quantifying cancer grading. By summarizing the current challenges, methodologies, and potential applications of MIL in WSI analysis, this survey aims to inform researchers about the state of the field and inspire future research directions.

8/20/2024

Rethinking Multiple Instance Learning: Developing an Instance-Level Classifier via Weakly-Supervised Self-Training

Yingfan Ma, Xiaoyuan Luo, Mingzhi Yuan, Xinrong Chen, Manning Wang

Multiple instance learning (MIL) problem is currently solved from either bag-classification or instance-classification perspective, both of which ignore important information contained in some instances and result in limited performance. For example, existing methods often face difficulty in learning hard positive instances. In this paper, we formulate MIL as a semi-supervised instance classification problem, so that all the labeled and unlabeled instances can be fully utilized to train a better classifier. The difficulty in this formulation is that all the labeled instances are negative in MIL, and traditional self-training techniques used in semi-supervised learning tend to degenerate in generating pseudo labels for the unlabeled instances in this scenario. To resolve this problem, we propose a weakly-supervised self-training method, in which we utilize the positive bag labels to construct a global constraint and a local constraint on the pseudo labels to prevent them from degenerating and force the classifier to learn hard positive instances. It is worth noting that easy positive instances are instances are far from the decision boundary in the classification process, while hard positive instances are those close to the decision boundary. Through iterative optimization, the pseudo labels can gradually approach the true labels. Extensive experiments on two MNIST synthetic datasets, five traditional MIL benchmark datasets and two histopathology whole slide image datasets show that our method achieved new SOTA performance on all of them. The code will be publicly available.

8/12/2024