Hard Negative Sample Mining for Whole Slide Image Classification

Read original: arXiv:2410.02212 - Published 10/4/2024 by Wentao Huang, Xiaoling Hu, Shahira Abousamra, Prateek Prasanna, Chao Chen

Hard Negative Sample Mining for Whole Slide Image Classification

Overview

Whole slide image (WSI) classification is a challenging task in medical imaging
This paper proposes a hard negative sample mining approach to improve WSI classification
The key ideas include:
- Leveraging self-training to generate hard negative samples
- Using a task-oriented embedding to guide the hard negative sample mining process
- Achieving state-of-the-art performance on multiple WSI classification benchmarks

Plain English Explanation

Whole slide images (WSIs) are high-resolution digital scans of tissue samples used in medical diagnosis. Classifying these WSIs is an important but challenging task, as the images can be very complex and contain a lot of detailed information.

This paper introduces a new approach to improve WSI classification by mining hard negative samples. The key idea is to use self-training to automatically generate hard negative samples - samples that are visually similar to the positive samples but belong to a different class. By incorporating these hard negatives into the training process, the model can learn to better distinguish between similar-looking samples.

The paper also uses a task-oriented embedding to guide the hard negative sample mining. This embedding is designed to capture the important features for the specific WSI classification task, helping the model identify the most informative hard negative samples.

The authors demonstrate that their hard negative sample mining approach leads to state-of-the-art performance on multiple WSI classification benchmarks. This suggests that this technique could be a valuable tool for improving the accuracy of medical image analysis systems.

Technical Explanation

The paper proposes a hard negative sample mining approach to improve whole slide image (WSI) classification. The key components of the method are:

Self-Training: The authors use self-training to generate hard negative samples. The model is first trained on the labeled data, then used to predict pseudo-labels on unlabeled data. Samples that are confidently predicted as negative (i.e., hard negatives) are added to the training set.
Task-Oriented Embedding: The authors introduce a task-oriented embedding to guide the hard negative sample mining process. This embedding is designed to capture the important features for the specific WSI classification task, helping the model identify the most informative hard negative samples.
Iterative Training: The model is trained in an iterative fashion, alternating between training on the labeled data and mining hard negative samples from the unlabeled data. This allows the model to progressively refine its understanding of the task and identify more challenging hard negatives.

The authors evaluate their approach on multiple WSI classification benchmarks, including CAMELYON16 and Kather-CRC-2016. They demonstrate that their hard negative sample mining technique leads to state-of-the-art performance, outperforming a range of baseline methods.

Critical Analysis

The paper presents a well-designed and empirically-validated approach for improving whole slide image classification. The authors have carefully considered the challenges of this task and have developed a novel technique to address them.

One potential limitation of the approach is that it relies on the availability of a large amount of unlabeled data to mine hard negative samples. In settings where unlabeled data is scarce, the performance of the method may be more limited. The authors do not extensively explore the sensitivity of their approach to the amount of unlabeled data.

Additionally, the paper does not provide a detailed analysis of the types of hard negative samples that are identified and how they differ from the standard negative samples. A deeper understanding of the characteristics of these hard negatives could lead to further improvements in the method.

Overall, the paper makes a valuable contribution to the field of medical image analysis and demonstrates the potential of hard negative sample mining for enhancing the performance of classification models in challenging domains.

Conclusion

This paper introduces a hard negative sample mining approach to improve whole slide image (WSI) classification. By leveraging self-training and a task-oriented embedding, the method is able to identify challenging hard negative samples and incorporate them into the training process. The authors show that this technique leads to state-of-the-art performance on multiple WSI classification benchmarks, suggesting its potential as a valuable tool for medical image analysis systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

New!Hard Negative Sample Mining for Whole Slide Image Classification

Wentao Huang, Xiaoling Hu, Shahira Abousamra, Prateek Prasanna, Chao Chen

Weakly supervised whole slide image (WSI) classification is challenging due to the lack of patch-level labels and high computational costs. State-of-the-art methods use self-supervised patch-wise feature representations for multiple instance learning (MIL). Recently, methods have been proposed to fine-tune the feature representation on the downstream task using pseudo labeling, but mostly focusing on selecting high-quality positive patches. In this paper, we propose to mine hard negative samples during fine-tuning. This allows us to obtain better feature representations and reduce the training cost. Furthermore, we propose a novel patch-wise ranking loss in MIL to better exploit these hard negative samples. Experiments on two public datasets demonstrate the efficacy of these proposed ideas. Our codes are available at https://github.com/winston52/HNM-WSI

10/4/2024

Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification

Xuenian Wang, Shanshan Shi, Renao Yan, Qiehe Sun, Lianghui Zhu, Tian Guan, Yonghong He

In the field of whole slide image (WSI) classification, multiple instance learning (MIL) serves as a promising approach, commonly decoupled into feature extraction and aggregation. In this paradigm, our observation reveals that discriminative embeddings are crucial for aggregation to the final prediction. Among all feature updating strategies, task-oriented ones can capture characteristics specifically for certain tasks. However, they can be prone to overfitting and contaminated by samples assigned with noisy labels. To address this issue, we propose a heuristic clustering-driven feature fine-tuning method (HC-FT) to enhance the performance of multiple instance learning by providing purified positive and hard negative samples. Our method first employs a well-trained MIL model to evaluate the confidence of patches. Then, patches with high confidence are marked as positive samples, while the remaining patches are used to identify crucial negative samples. After two rounds of heuristic clustering and selection, purified positive and hard negative samples are obtained to facilitate feature fine-tuning. The proposed method is evaluated on both CAMELYON16 and BRACS datasets, achieving an AUC of 97.13% and 85.85%, respectively, consistently outperforming all compared methods.

6/4/2024

An efficient framework based on large foundation model for cervical cytopathology whole slide image screening

Jialong Huang, Gaojie Li, Shichao Kan, Jianfeng Liu, Yixiong Liang

Current cervical cytopathology whole slide image (WSI) screening primarily relies on detection-based approaches, which are limited in performance due to the expense and time-consuming annotation process. Multiple Instance Learning (MIL), a weakly supervised approach that relies solely on bag-level labels, can effectively alleviate these challenges. Nonetheless, MIL commonly employs frozen pretrained models or self-supervised learning for feature extraction, which suffers from low efficacy or inefficiency. In this paper, we propose an efficient framework for cervical cytopathology WSI classification using only WSI-level labels through unsupervised and weakly supervised learning. Given the sparse and dispersed nature of abnormal cells within cytopathological WSIs, we propose a strategy that leverages the pretrained foundation model to filter the top$k$ high-risk patches. Subsequently, we suggest parameter-efficient fine-tuning (PEFT) of a large foundation model using contrastive learning on the filtered patches to enhance its representation ability for task-specific signals. By training only the added linear adapters, we enhance the learning of patch-level features with substantially reduced time and memory consumption. Experiments conducted on the CSD and FNAC 2019 datasets demonstrate that the proposed method enhances the performance of various MIL methods and achieves state-of-the-art (SOTA) performance. The code and trained models are publicly available at https://github.com/CVIU-CSU/TCT-InfoNCE.

7/17/2024

Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need

Linhao Qu, Yingfan Ma, Xiaoyuan Luo, Manning Wang, Zhijian Song

Weakly supervised whole slide image classification is usually formulated as a multiple instance learning (MIL) problem, where each slide is treated as a bag, and the patches cut out of it are treated as instances. Existing methods either train an instance classifier through pseudo-labeling or aggregate instance features into a bag feature through attention mechanisms and then train a bag classifier, where the attention scores can be used for instance-level classification. However, the pseudo instance labels constructed by the former usually contain a lot of noise, and the attention scores constructed by the latter are not accurate enough, both of which affect their performance. In this paper, we propose an instance-level MIL framework based on contrastive learning and prototype learning to effectively accomplish both instance classification and bag classification tasks. To this end, we propose an instance-level weakly supervised contrastive learning algorithm for the first time under the MIL setting to effectively learn instance feature representation. We also propose an accurate pseudo label generation method through prototype learning. We then develop a joint training strategy for weakly supervised contrastive learning, prototype learning, and instance classifier training. Extensive experiments and visualizations on four datasets demonstrate the powerful performance of our method. Codes are available at https://github.com/miccaiif/INS.

5/14/2024