Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images

Read original: arXiv:2409.09053 - Published 9/17/2024 by Masoud Tafavvoghi, Anders Sildnes, Mehrdad Rakaee, Nikita Shvetsov, Lars Ailo Bongo, Lill-Tove Rasmussen Busund, Kajsa M{o}llersen

🤿

Overview

Classifying breast cancer molecular subtypes is crucial for tailoring treatment strategies.
Immunohistochemistry (IHC) and gene expression profiling are standard methods, but IHC can be subjective and gene profiling is costly.
Previous deep learning approaches have used H&E-stained whole slide images (WSI) for molecular subtyping, with varying methods and performance.
This research investigates whether H&E-stained WSIs can be used to solely predict breast cancer molecular subtypes.

Plain English Explanation

Breast cancer is a complex disease, and understanding the specific molecular characteristics of a tumor is important for choosing the most effective treatment. Currently, doctors use two main methods to determine these molecular subtypes: immunohistochemistry (IHC) and gene expression profiling. IHC involves staining and examining tumor samples under a microscope, but this can be subjective. Gene expression profiling provides more detailed molecular information, but it is expensive and not widely available in many regions.

Previous research has shown that deep learning models, which are a type of artificial intelligence, may be able to determine breast cancer molecular subtypes by analyzing digital images of tumor samples. These models can "learn" to recognize patterns in the images that correspond to different subtypes. However, the methods and results of these previous studies have varied.

In this new research, the scientists investigated whether deep learning models could accurately predict breast cancer molecular subtypes using only H&E-stained whole slide images (WSIs) - digital images that capture an entire tumor sample. This would be a more efficient and accessible approach compared to the current standard methods.

The researchers developed a two-step deep learning pipeline to first identify the tumor regions in the WSIs, and then classify the molecular subtypes based on those tumor regions. They tested this pipeline on a dataset of 221 WSIs and achieved promising results, with an overall accuracy of 73% for molecular subtyping.

These findings suggest that deep learning models could potentially be used as supportive tools to help doctors determine breast cancer molecular subtypes, particularly in regions where the standard methods are not readily available. However, the approach still needs further validation before it could be widely used in clinical practice.

Technical Explanation

The researchers used a two-step pipeline to predict breast cancer molecular subtypes from H&E-stained whole slide images (WSIs):

Tumor tile classification: The first step was to classify individual tiles (small image regions) within the WSIs as either "tumor" or "non-tumor" using a deep learning model. This allowed the researchers to focus only on the tumor regions for the subsequent molecular subtyping.
Molecular subtyping: The second step employed an One-vs-Rest (OvR) strategy, where four binary OvR classifiers were trained to distinguish each molecular subtype (luminal A, luminal B, HER2-enriched, and basal) from the other subtypes. The results of these four classifiers were then aggregated using an XGBoost model to make the final subtype prediction.

The pipeline was trained and evaluated on a dataset of 1,433 WSIs of breast cancer samples, with 221 held-out for testing. The tumor detection model achieved a macro F1 score of 0.95, and the molecular subtyping model achieved a macro F1 score of 0.73 on the test set.

The researchers made their code available to facilitate ongoing research and development in this area. They suggest that, with further validation, supervised deep learning models could serve as supportive tools for molecular subtyping in breast cancer, potentially improving access to this crucial information in regions where standard methods are not readily available.

Critical Analysis

The researchers acknowledge several limitations of their study. First, the dataset used was relatively small, and the performance of the models may need to be evaluated on larger and more diverse datasets to assess their robustness. Second, the study did not directly compare the deep learning-based approach to the standard IHC and gene expression profiling methods, which would be important for understanding its practical utility.

Additionally, the researchers did not provide much detail on the specific architectural choices and hyperparameter tuning of the deep learning models, making it difficult to assess the technical merits of the approach. Further research could explore different model architectures and training strategies to potentially improve the molecular subtyping performance.

It would also be valuable to understand the types of errors made by the deep learning models, as well as the specific image features they are learning to discriminate between subtypes. This could provide insights into the underlying biological differences captured by the models and guide further refinements.

Despite these limitations, the researchers have demonstrated the potential of deep learning for breast cancer molecular subtyping using readily available H&E-stained WSIs. This could be a promising avenue for improving access to this crucial information, particularly in resource-limited settings. Further research and validation will be necessary before this approach could be considered for clinical deployment.

Conclusion

This research investigates the use of deep learning models to predict breast cancer molecular subtypes using only H&E-stained whole slide images, without relying on the more subjective IHC or costly gene expression profiling methods. The researchers developed a two-step pipeline that first identifies tumor regions in the images and then classifies the molecular subtypes, achieving promising results.

While the approach still requires further validation and refinement, the findings suggest that deep learning could serve as a supportive tool for molecular subtyping in breast cancer, potentially improving access to this important information in regions where standard methods are not widely available. Continued research and development in this area could lead to more efficient and accessible breast cancer diagnosis and treatment strategies.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

New!Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images

Masoud Tafavvoghi, Anders Sildnes, Mehrdad Rakaee, Nikita Shvetsov, Lars Ailo Bongo, Lill-Tove Rasmussen Busund, Kajsa M{o}llersen

Classifying breast cancer molecular subtypes is crucial for tailoring treatment strategies. While immunohistochemistry (IHC) and gene expression profiling are standard methods for molecular subtyping, IHC can be subjective, and gene profiling is costly and not widely accessible in many regions. Previous approaches have highlighted the potential application of deep learning models on H&E-stained whole slide images (WSI) for molecular subtyping, but these efforts vary in their methods, datasets, and reported performance. In this work, we investigated whether H&E-stained WSIs could be solely leveraged to predict breast cancer molecular subtypes (luminal A, B, HER2-enriched, and Basal). We used 1,433 WSIs of breast cancer in a two-step pipeline: first, classifying tumor and non-tumor tiles to use only the tumor regions for molecular subtyping; and second, employing a One-vs-Rest (OvR) strategy to train four binary OvR classifiers and aggregating their results using an eXtreme Gradient Boosting (XGBoost) model. The pipeline was tested on 221 hold-out WSIs, achieving an overall macro F1 score of 0.95 for tumor detection and 0.73 for molecular subtyping. Our findings suggest that, with further validation, supervised deep learning models could serve as supportive tools for molecular subtyping in breast cancer. Our codes are made available to facilitate ongoing research and development.

9/17/2024

👀

A vision transformer-based framework for knowledge transfer from multi-modal to mono-modal lymphoma subtyping models

Bilel Guetarni, Feryal Windal, Halim Benhabiles, Marianne Petit, Romain Dubois, Emmanuelle Leteurtre, Dominique Collard

Determining lymphoma subtypes is a crucial step for better patient treatment targeting to potentially increase their survival chances. In this context, the existing gold standard diagnosis method, which relies on gene expression technology, is highly expensive and time-consuming, making it less accessibility. Although alternative diagnosis methods based on IHC (immunohistochemistry) technologies exist (recommended by the WHO), they still suffer from similar limitations and are less accurate. Whole Slide Image (WSI) analysis using deep learning models has shown promising potential for cancer diagnosis, that could offer cost-effective and faster alternatives to existing methods. In this work, we propose a vision transformer-based framework for distinguishing DLBCL (Diffuse Large B-Cell Lymphoma) cancer subtypes from high-resolution WSIs. To this end, we introduce a multi-modal architecture to train a classifier model from various WSI modalities. We then leverage this model through a knowledge distillation process to efficiently guide the learning of a mono-modal classifier. Our experimental study conducted on a lymphoma dataset of 157 patients shows the promising performance of our mono-modal classification model, outperforming six recent state-of-the-art methods. In addition, the power-law curve, estimated on our experimental data, suggests that with more training data from a reasonable number of additional patients, our model could achieve competitive diagnosis accuracy with IHC technologies. Furthermore, the efficiency of our framework is confirmed through an additional experimental study on an external breast cancer dataset (BCI dataset).

5/30/2024

Deep Learning-based Prediction of Breast Cancer Tumor and Immune Phenotypes from Histopathology

Tiago Gonc{c}alves, Dagoberto Pulido-Arias, Julian Willett, Katharina V. Hoebel, Mason Cleveland, Syed Rakin Ahmed, Elizabeth Gerstner, Jayashree Kalpathy-Cramer, Jaime S. Cardoso, Christopher P. Bridge, Albert E. Kim

The interactions between tumor cells and the tumor microenvironment (TME) dictate therapeutic efficacy of radiation and many systemic therapies in breast cancer. However, to date, there is not a widely available method to reproducibly measure tumor and immune phenotypes for each patient's tumor. Given this unmet clinical need, we applied multiple instance learning (MIL) algorithms to assess activity of ten biologically relevant pathways from the hematoxylin and eosin (H&E) slide of primary breast tumors. We employed different feature extraction approaches and state-of-the-art model architectures. Using binary classification, our models attained area under the receiver operating characteristic (AUROC) scores above 0.70 for nearly all gene expression pathways and on some cases, exceeded 0.80. Attention maps suggest that our trained models recognize biologically relevant spatial patterns of cell sub-populations from H&E. These efforts represent a first step towards developing computational H&E biomarkers that reflect facets of the TME and hold promise for augmenting precision oncology.

4/26/2024

✅

Development and Validation of Fully Automatic Deep Learning-Based Algorithms for Immunohistochemistry Reporting of Invasive Breast Ductal Carcinoma

Sumit Kumar Jha, Purnendu Mishra, Shubham Mathur, Gursewak Singh, Rajiv Kumar, Kiran Aatre, Suraj Rengarajan

Immunohistochemistry (IHC) analysis is a well-accepted and widely used method for molecular subtyping, a procedure for prognosis and targeted therapy of breast carcinoma, the most common type of tumor affecting women. There are four molecular biomarkers namely progesterone receptor (PR), estrogen receptor (ER), antigen Ki67, and human epidermal growth factor receptor 2 (HER2) whose assessment is needed under IHC procedure to decide prognosis as well as predictors of response to therapy. However, IHC scoring is based on subjective microscopic examination of tumor morphology and suffers from poor reproducibility, high subjectivity, and often incorrect scoring in low-score cases. In this paper, we present, a deep learning-based semi-supervised trained, fully automatic, decision support system (DSS) for IHC scoring of invasive ductal carcinoma. Our system automatically detects the tumor region removing artifacts and scores based on Allred standard. The system is developed using 3 million pathologist-annotated image patches from 300 slides, fifty thousand in-house cell annotations, and forty thousand pixels marking of HER2 membrane. We have conducted multicentric trials at four centers with three different types of digital scanners in terms of percentage agreement with doctors. And achieved agreements of 95, 92, 88 and 82 percent for Ki67, HER2, ER, and PR stain categories, respectively. In addition to overall accuracy, we found that there is 5 percent of cases where pathologist have changed their score in favor of algorithm score while reviewing with detailed algorithmic analysis. Our approach could improve the accuracy of IHC scoring and subsequent therapy decisions, particularly where specialist expertise is unavailable. Our system is highly modular. The proposed algorithm modules can be used to develop DSS for other cancer types.

6/18/2024