Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images

Read original: arXiv:2407.15816 - Published 7/23/2024 by Kshitij Ingale, Sun Hae Hong, Qiyuan Hu, Renyu Zhang, Bo Osinski, Mina Khoshdeli, Josh Och, Kunal Nagpal, Martin C. Stumpe, Rohan P. Joshi

🔮

Overview

Paper proposes a multi-task model to predict multiple DNA alterations from routine hematoxylin and eosin (H&E)-stained tumor images
Aims to address limitations of current molecular testing, including lack of standardization, high cost, and limited tissue availability
Multi-task approach outperforms individual models, with particular gains for rare mutations
Models generalize well to different datasets, and embeddings from the multi-task model show promise for other downstream tasks

Plain English Explanation

When doctors are treating cancer, they often need to test the tumor sample to see if there are any genetic mutations that could be targeted with specific drugs. However, this molecular testing can be challenging due to a lack of standardization, long turnaround times, high costs, and limited availability of tumor tissue.

The researchers in this paper propose a new approach that could help address these issues. They trained a machine learning model to look at the routine H&E-stained images of tumor samples and predict multiple genetic mutations at the same time. This "multi-task" approach performed better on average than individual models trained to predict each mutation separately, and was particularly effective at detecting rare mutations.

Importantly, the models were able to generalize well to new datasets, including samples from different hospitals and processed in different ways. This suggests the approach could be clinically useful, providing doctors with multiple actionable predictions from a single slide.

Overall, this research represents a promising step towards developing AI-powered tools that can enhance and streamline the process of molecular testing for cancer patients.

Technical Explanation

The researchers trained multi-task learning models to simultaneously predict the presence of multiple DNA alterations from routine H&E-stained tumor whole slide images. This approach aimed to address limitations of current molecular testing workflows, such as lack of standardization, long turnaround times, high costs, and limited tissue availability.

Compared to training individual models for each biomarker, the multi-task framework performed better on average, with particularly pronounced gains for rare mutations. The models were able to reasonably generalize to independent temporal-holdout, externally-stained, and multi-site TCGA test sets. Additionally, the whole slide image embeddings derived using the multi-task models demonstrated strong performance on downstream tasks that were not part of the original training.

The use of a multi-task approach allows the model to leverage shared patterns across different DNA alterations, potentially improving performance, especially for rare mutations that may not be well-represented in individual datasets. By providing multiple actionable predictions from a single slide, this work represents a promising step towards developing clinically useful algorithms that can enhance and streamline molecular testing for cancer patients.

Critical Analysis

The paper presents a thoughtful and well-designed approach to address important challenges in molecular testing for cancer care. The use of a multi-task framework is a clever strategy to leverage shared patterns across alterations and improve performance, especially for rare mutations.

That said, the paper does acknowledge several limitations and areas for further research. For example, the models were trained on data from a single institution, and it's unclear how well they would generalize to more diverse, real-world clinical settings. Additionally, the paper does not provide a detailed analysis of the types of errors made by the models or the clinical implications of false positives/negatives.

It would also be valuable to better understand the relationship between the image features learned by the multi-task model and the underlying biological mechanisms driving the genetic alterations. Insights into this could help build trust and interpretability around the model's predictions.

Overall, this work represents an important step forward, but continued research and validation in larger, more diverse cohorts will be crucial to assessing the true clinical utility of this approach.

Conclusion

This paper presents a novel multi-task learning approach to predict multiple DNA alterations from routine H&E-stained tumor images. By leveraging shared patterns across different biomarkers, the models were able to outperform individual, biomarker-specific models, particularly for rare mutations.

The ability to provide multiple actionable predictions from a single slide could help address key limitations of current molecular testing workflows, including lack of standardization, high costs, and limited tissue availability. Additionally, the strong performance of the model embeddings on downstream tasks suggests the potential for broader applications beyond the primary focus of this work.

While further research is needed to validate the models in real-world clinical settings, this work represents an important step towards developing AI-powered tools that can enhance and streamline the process of molecular testing for cancer patients.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images

Kshitij Ingale, Sun Hae Hong, Qiyuan Hu, Renyu Zhang, Bo Osinski, Mina Khoshdeli, Josh Och, Kunal Nagpal, Martin C. Stumpe, Rohan P. Joshi

Molecular testing of tumor samples for targetable biomarkers is restricted by a lack of standardization, turnaround-time, cost, and tissue availability across cancer types. Additionally, targetable alterations of low prevalence may not be tested in routine workflows. Algorithms that predict DNA alterations from routinely generated hematoxylin and eosin (H&E)-stained images could prioritize samples for confirmatory molecular testing. Costs and the necessity of a large number of samples containing mutations limit approaches that train individual algorithms for each alteration. In this work, models were trained for simultaneous prediction of multiple DNA alterations from H&E images using a multi-task approach. Compared to biomarker-specific models, this approach performed better on average, with pronounced gains for rare mutations. The models reasonably generalized to independent temporal-holdout, externally-stained, and multi-site TCGA test sets. Additionally, whole slide image embeddings derived using multi-task models demonstrated strong performance in downstream tasks that were not a part of training. Overall, this is a promising approach to develop clinically useful algorithms that provide multiple actionable predictions from a single slide.

7/23/2024

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Ran A. Godrich, Matthew C. H. Lee, Chad Vanderbilt, Razik Yousfi, Thomas Fuchs, David S. Klimstra, Siqi Liu

Many molecular alterations serve as clinically prognostic or therapy-predictive biomarkers, typically detected using single or multi-gene molecular assays. However, these assays are expensive, tissue destructive and often take weeks to complete. Using AI on routine H&E WSIs offers a fast and economical approach to screen for multiple molecular biomarkers. We present a high-throughput AI-based system leveraging Virchow2, a foundation model pre-trained on 3 million slides, to interrogate genomic features previously determined by an next-generation sequencing (NGS) assay, using 47,960 scanned hematoxylin and eosin (H&E) whole slide images (WSIs) from 38,984 cancer patients. Unlike traditional methods that train individual models for each biomarker or cancer type, our system employs a unified model to simultaneously predict a wide range of clinically relevant molecular biomarkers across cancer types. By training the network to replicate the MSK-IMPACT targeted biomarker panel of 505 genes, it identified 80 high performing biomarkers with a mean AU-ROC of 0.89 in 15 most common cancer types. In addition, 40 biomarkers demonstrated strong associations with specific cancer histologic subtypes. Furthermore, 58 biomarkers were associated with targets frequently assayed clinically for therapy selection and response prediction. The model can also predict the activity of five canonical signaling pathways, identify defects in DNA repair mechanisms, and predict genomic instability measured by tumor mutation burden, microsatellite instability (MSI), and chromosomal instability (CIN). The proposed model can offer potential to guide therapy selection, improve treatment efficacy, accelerate patient screening for clinical trials and provoke the interrogation of new therapeutic targets.

8/21/2024

Scalable, Trustworthy Generative Model for Virtual Multi-Staining from H&E Whole Slide Images

Mehdi Ounissi, Ilias Sarbout, Jean-Pierre Hugot, Christine Martinez-Vinson, Dominique Berrebi, Daniel Racoceanu

Chemical staining methods are dependable but require extensive time, expensive chemicals, and raise environmental concerns. These challenges highlight the need for alternative solutions like virtual staining, which accelerates the diagnostic process and enhances stain application flexibility. Generative AI technologies are pivotal in addressing these issues. However, the high-stakes nature of healthcare decisions, especially in computational pathology, complicates the adoption of these tools due to their opaque processes. Our work introduces the use of generative AI for virtual staining, aiming to enhance performance, trustworthiness, scalability, and adaptability in computational pathology. The methodology centers on a singular H&E encoder supporting multiple stain decoders. This design focuses on critical regions in the latent space of H&E, enabling precise synthetic stain generation. Our method, tested to generate 8 different stains from a single H&E slide, offers scalability by loading only necessary model components during production. We integrate label-free knowledge in training, using loss functions and regularization to minimize artifacts, thus improving paired/unpaired virtual staining accuracy. To build trust, we use real-time self-inspection with discriminators for each stain type, providing pathologists with confidence heat-maps. Automatic quality checks on new H&E slides ensure conformity to the trained distribution, ensuring accurate synthetic stains. Recognizing pathologists' challenges with new technologies, we have developed an open-source, cloud-based system, that allows easy virtual staining of H&E slides through a browser, addressing hardware/software issues and facilitating real-time user feedback. We also curated a novel dataset of 8 paired H&E/stains related to pediatric Crohn's disease, comprising 480 WSIs to further stimulate computational pathology research.

7/2/2024

Advancing H&E-to-IHC Stain Translation in Breast Cancer: A Multi-Magnification and Attention-Based Approach

Linhao Qu, Chengsheng Zhang, Guihui Li, Haiyong Zheng, Chen Peng, Wei He

Breast cancer presents a significant healthcare challenge globally, demanding precise diagnostics and effective treatment strategies, where histopathological examination of Hematoxylin and Eosin (H&E) stained tissue sections plays a central role. Despite its importance, evaluating specific biomarkers like Human Epidermal Growth Factor Receptor 2 (HER2) for personalized treatment remains constrained by the resource-intensive nature of Immunohistochemistry (IHC). Recent strides in deep learning, particularly in image-to-image translation, offer promise in synthesizing IHC-HER2 slides from H&E stained slides. However, existing methodologies encounter challenges, including managing multiple magnifications in pathology images and insufficient focus on crucial information during translation. To address these issues, we propose a novel model integrating attention mechanisms and multi-magnification information processing. Our model employs a multi-magnification processing strategy to extract and utilize information from various magnifications within pathology images, facilitating robust image translation. Additionally, an attention module within the generative network prioritizes critical information for image distribution translation while minimizing less pertinent details. Rigorous testing on a publicly available breast cancer dataset demonstrates superior performance compared to existing methods, establishing our model as a state-of-the-art solution in advancing pathology image translation from H&E to IHC staining.

8/6/2024