SC-MIL: Sparsely Coded Multiple Instance Learning for Whole Slide Image Classification

Read original: arXiv:2311.00048 - Published 8/2/2024 by Peijie Qiu, Pan Xiao, Wenhui Zhu, Yalin Wang, Aristeidis Sotiras

🖼️

Overview

The paper proposes a sparse coding-based multiple instance learning (SC-MIL) method for weakly supervised whole slide image (WSI) classification.
Typical MIL methods have two main components: a feature embedding part and an MIL aggregator.
This paper aims to address both of these aspects by leveraging sparse dictionary learning.

Plain English Explanation

The paper presents a new approach for classifying whole slide images (WSIs) using multiple instance learning (MIL). MIL is a type of machine learning that is useful when you have a collection of "instances" (e.g., image patches) and you want to classify the entire collection (e.g., the whole slide image) rather than individual instances.

Typical MIL methods have two main parts: 1) a feature embedding part that converts the instances into numerical features, and 2) an aggregator that combines those instance features to make a prediction about the entire collection. This paper proposes a new way to handle both of these parts by using sparse dictionary learning.

The key idea is to represent each instance as a sparse linear combination of a set of "dictionary atoms." This sparse coding captures the similarities between instances and helps identify the most relevant ones for classification. The paper also shows how to integrate this sparse coding approach into a deep learning framework in a way that is efficient to compute.

Experiments on multiple datasets show that this sparse coding-based MIL method can significantly improve the performance of state-of-the-art MIL techniques for WSI classification.

Technical Explanation

The proposed SC-MIL method addresses two key aspects of typical MIL frameworks:

Feature Embedding: The paper leverages sparse dictionary learning to capture the similarities between instances and suppress irrelevant ones. Each instance is represented as a sparse linear combination of atoms in an overcomplete dictionary.
MIL Aggregation: The sparse coding approach is integrated into the MIL aggregator through a deep unrolling technique. This allows the sparse coding to be end-to-end trainable within the MIL framework.

The key benefits of the SC-MIL method are:

Improved Feature Embeddings: The sparse coding suppresses irrelevant instances while retaining the most salient ones, leading to better instance-level features.
Modeling Instance Correlations: The sparse representation captures the similarities between instances, which can help the MIL aggregator make more accurate predictions.
Plug-and-Play Integration: The SC module can be easily incorporated into existing MIL frameworks without significantly increasing the computational cost.

Experimental results on multiple WSI classification datasets demonstrate that the proposed SC-MIL method can substantially outperform state-of-the-art MIL approaches.

Critical Analysis

The paper provides a well-designed and thorough evaluation of the SC-MIL method, including comparisons to several baseline MIL techniques across different datasets. The authors also discuss potential limitations and areas for future work, such as:

Extending the sparse coding approach to other types of MIL problems beyond WSI classification.
Investigating more advanced sparse coding techniques that could further improve the instance feature embeddings.
Exploring ways to make the sparse coding module more efficient to enable real-time applications.

While the paper presents a compelling solution, one potential concern is the computational complexity of the sparse coding step, which could limit the scalability of the approach for very large WSIs. Additionally, the paper does not provide a thorough analysis of the learned dictionaries and sparse codes, which could yield additional insights into how the method is functioning.

Overall, the SC-MIL method represents a novel and promising direction for improving MIL frameworks, with the sparse coding approach serving as a flexible and effective way to enhance both the feature embedding and aggregation components.

Conclusion

This paper introduces a sparse coding-based multiple instance learning (SC-MIL) method for weakly supervised whole slide image classification. The key innovation is the use of sparse dictionary learning to capture the similarities between instances and suppress irrelevant ones, leading to better feature embeddings and MIL aggregation.

The experimental results demonstrate that the proposed SC-MIL approach can substantially outperform state-of-the-art MIL techniques on multiple datasets. This suggests that the sparse coding-based representation learning is a powerful tool for enhancing MIL frameworks, with potential applications beyond just WSI classification.

The paper also highlights areas for future research, such as exploring more advanced sparse coding techniques and investigating the scalability of the approach. Overall, the SC-MIL method represents an important contribution to the field of weakly supervised medical image analysis.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🖼️

SC-MIL: Sparsely Coded Multiple Instance Learning for Whole Slide Image Classification

Peijie Qiu, Pan Xiao, Wenhui Zhu, Yalin Wang, Aristeidis Sotiras

Multiple Instance Learning (MIL) has been widely used in weakly supervised whole slide image (WSI) classification. Typical MIL methods include a feature embedding part, which embeds the instances into features via a pre-trained feature extractor, and an MIL aggregator that combines instance embeddings into predictions. Most efforts have typically focused on improving these parts. This involves refining the feature embeddings through self-supervised pre-training as well as modeling the correlations between instances separately. In this paper, we proposed a sparsely coding MIL (SC-MIL) method that addresses those two aspects at the same time by leveraging sparse dictionary learning. The sparse dictionary learning captures the similarities of instances by expressing them as sparse linear combinations of atoms in an over-complete dictionary. In addition, imposing sparsity improves instance feature embeddings by suppressing irrelevant instances while retaining the most relevant ones. To make the conventional sparse coding algorithm compatible with deep learning, we unrolled it into a sparsely coded module leveraging deep unrolling. The proposed SC module can be incorporated into any existing MIL framework in a plug-and-play manner with an acceptable computational cost. The experimental results on multiple datasets demonstrated that the proposed SC module could substantially boost the performance of state-of-the-art MIL methods. The codes are available at href{https://github.com/sotiraslab/SCMIL.git}{https://github.com/sotiraslab/SCMIL.git}.

8/2/2024

SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification

Heng Fang, Sheng Huang, Wenhao Tang, Luwen Huangfu, Bo Liu

Multiple Instance Learning (MIL) represents the predominant framework in Whole Slide Image (WSI) classification, covering aspects such as sub-typing, diagnosis, and beyond. Current MIL models predominantly rely on instance-level features derived from pretrained models such as ResNet. These models segment each WSI into independent patches and extract features from these local patches, leading to a significant loss of global spatial context and restricting the model's focus to merely local features. To address this issue, we propose a novel MIL framework, named SAM-MIL, that emphasizes spatial contextual awareness and explicitly incorporates spatial context by extracting comprehensive, image-level information. The Segment Anything Model (SAM) represents a pioneering visual segmentation foundational model that can capture segmentation features without the need for additional fine-tuning, rendering it an outstanding tool for extracting spatial context directly from raw WSIs. Our approach includes the design of group feature extraction based on spatial context and a SAM-Guided Group Masking strategy to mitigate class imbalance issues. We implement a dynamic mask ratio for different segmentation categories and supplement these with representative group features of categories. Moreover, SAM-MIL divides instances to generate additional pseudo-bags, thereby augmenting the training set, and introduces consistency of spatial context across pseudo-bags to further enhance the model's performance. Experimental results on the CAMELYON-16 and TCGA Lung Cancer datasets demonstrate that our proposed SAM-MIL model outperforms existing mainstream methods in WSIs classification. Our open-source implementation code is is available at https://github.com/FangHeng/SAM-MIL.

7/26/2024

Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need

Linhao Qu, Yingfan Ma, Xiaoyuan Luo, Manning Wang, Zhijian Song

Weakly supervised whole slide image classification is usually formulated as a multiple instance learning (MIL) problem, where each slide is treated as a bag, and the patches cut out of it are treated as instances. Existing methods either train an instance classifier through pseudo-labeling or aggregate instance features into a bag feature through attention mechanisms and then train a bag classifier, where the attention scores can be used for instance-level classification. However, the pseudo instance labels constructed by the former usually contain a lot of noise, and the attention scores constructed by the latter are not accurate enough, both of which affect their performance. In this paper, we propose an instance-level MIL framework based on contrastive learning and prototype learning to effectively accomplish both instance classification and bag classification tasks. To this end, we propose an instance-level weakly supervised contrastive learning algorithm for the first time under the MIL setting to effectively learn instance feature representation. We also propose an accurate pseudo label generation method through prototype learning. We then develop a joint training strategy for weakly supervised contrastive learning, prototype learning, and instance classifier training. Extensive experiments and visualizations on four datasets demonstrate the powerful performance of our method. Codes are available at https://github.com/miccaiif/INS.

5/14/2024

DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification

Wenhui Zhu, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang

Multiple instance learning (MIL) stands as a powerful approach in weakly supervised learning, regularly employed in histological whole slide image (WSI) classification for detecting tumorous lesions. However, existing mainstream MIL methods focus on modeling correlation between instances while overlooking the inherent diversity among instances. However, few MIL methods have aimed at diversity modeling, which empirically show inferior performance but with a high computational cost. To bridge this gap, we propose a novel MIL aggregation method based on diverse global representation (DGR-MIL), by modeling diversity among instances through a set of global vectors that serve as a summary of all instances. First, we turn the instance correlation into the similarity between instance embeddings and the predefined global vectors through a cross-attention mechanism. This stems from the fact that similar instance embeddings typically would result in a higher correlation with a certain global vector. Second, we propose two mechanisms to enforce the diversity among the global vectors to be more descriptive of the entire bag: (i) positive instance alignment and (ii) a novel, efficient, and theoretically guaranteed diversification learning paradigm. Specifically, the positive instance alignment module encourages the global vectors to align with the center of positive instances (e.g., instances containing tumors in WSI). To further diversify the global representations, we propose a novel diversification learning paradigm leveraging the determinantal point process. The proposed model outperforms the state-of-the-art MIL aggregation models by a substantial margin on the CAMELYON-16 and the TCGA-lung cancer datasets. The code is available at url{https://github.com/ChongQingNoSubway/DGR-MIL}.

7/8/2024