Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Read original: arXiv:2408.09554 - Published 8/21/2024 by Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Ran A. Godrich, Matthew C. H. Lee, Chad Vanderbilt, Razik Yousfi, Thomas Fuchs, David S. Klimstra and 1 other
Total Score

0

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Presents a high-throughput approach to screen for genetic and phenotypic biomarkers in cancer from H&E whole slide images
  • Leverages deep learning models to detect biomarkers and predict molecular alterations across multiple cancer types
  • Aims to enable rapid, cost-effective cancer screening and diagnostics

Plain English Explanation

This research paper introduces a new method for efficiently screening for biomarkers in cancer. The key idea is to use deep learning models to analyze standard H&E (hematoxylin and eosin) whole slide images of tumor samples, which are commonly used in pathology.

The deep learning models are trained to detect genetic and phenotypic biomarkers directly from the H&E images, without requiring additional molecular testing. This enables a high-throughput, cost-effective approach to cancer screening and diagnostics.

The models are designed to work across multiple cancer types, allowing for pan-cancer biomarker screening. This could lead to faster and more comprehensive cancer detection and monitoring, with the potential to improve patient outcomes.

Technical Explanation

The researchers developed deep learning models that can analyze H&E whole slide images to predict molecular alterations and detect phenotypic biomarkers associated with various cancer types.

They first curated a large dataset of H&E slides paired with corresponding genomic and histological annotations. This allowed them to train the models to learn the visual patterns linked to specific biomarkers and molecular changes.

The models use a multi-task learning approach, where a single neural network is trained to perform multiple related tasks simultaneously. This enables the models to efficiently transfer knowledge between different cancer types and biomarkers, improving overall performance.

The researchers demonstrate the models' ability to accurately predict genetic alterations and detect phenotypic biomarkers across a range of cancer types, including lung, breast, and prostate cancer. This highlights the potential of their approach for high-throughput, pan-cancer screening.

Critical Analysis

The paper presents a promising approach, but there are a few caveats to consider. The models are highly dependent on the quality and comprehensiveness of the training data, which can be challenging to obtain, especially for rare cancer types or biomarkers.

Additionally, while the models demonstrate strong performance on the tested datasets, further validation on real-world, diverse clinical samples would be necessary to assess their true robustness and generalizability.

The researchers also acknowledge the need for additional work to integrate their models into clinical workflows and ensure their interpretability and trustworthiness for medical decision-making.

Conclusion

This research paper introduces a novel deep learning-based method for high-throughput, pan-cancer screening of genetic and phenotypic biomarkers from standard H&E whole slide images. If successfully implemented, this approach could significantly streamline cancer diagnostics, enabling faster and more comprehensive detection of relevant biomarkers across multiple cancer types.

The findings highlight the potential of leveraging advanced AI techniques to extract valuable information from widely available pathology data, paving the way for more efficient and accessible cancer screening and monitoring solutions.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images
Total Score

0

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Ran A. Godrich, Matthew C. H. Lee, Chad Vanderbilt, Razik Yousfi, Thomas Fuchs, David S. Klimstra, Siqi Liu

Many molecular alterations serve as clinically prognostic or therapy-predictive biomarkers, typically detected using single or multi-gene molecular assays. However, these assays are expensive, tissue destructive and often take weeks to complete. Using AI on routine H&E WSIs offers a fast and economical approach to screen for multiple molecular biomarkers. We present a high-throughput AI-based system leveraging Virchow2, a foundation model pre-trained on 3 million slides, to interrogate genomic features previously determined by an next-generation sequencing (NGS) assay, using 47,960 scanned hematoxylin and eosin (H&E) whole slide images (WSIs) from 38,984 cancer patients. Unlike traditional methods that train individual models for each biomarker or cancer type, our system employs a unified model to simultaneously predict a wide range of clinically relevant molecular biomarkers across cancer types. By training the network to replicate the MSK-IMPACT targeted biomarker panel of 505 genes, it identified 80 high performing biomarkers with a mean AU-ROC of 0.89 in 15 most common cancer types. In addition, 40 biomarkers demonstrated strong associations with specific cancer histologic subtypes. Furthermore, 58 biomarkers were associated with targets frequently assayed clinically for therapy selection and response prediction. The model can also predict the activity of five canonical signaling pathways, identify defects in DNA repair mechanisms, and predict genomic instability measured by tumor mutation burden, microsatellite instability (MSI), and chromosomal instability (CIN). The proposed model can offer potential to guide therapy selection, improve treatment efficacy, accelerate patient screening for clinical trials and provoke the interrogation of new therapeutic targets.

Read more

8/21/2024

🔮

Total Score

0

Efficient and generalizable prediction of molecular alterations in multiple cancer cohorts using H&E whole slide images

Kshitij Ingale, Sun Hae Hong, Qiyuan Hu, Renyu Zhang, Bo Osinski, Mina Khoshdeli, Josh Och, Kunal Nagpal, Martin C. Stumpe, Rohan P. Joshi

Molecular testing of tumor samples for targetable biomarkers is restricted by a lack of standardization, turnaround-time, cost, and tissue availability across cancer types. Additionally, targetable alterations of low prevalence may not be tested in routine workflows. Algorithms that predict DNA alterations from routinely generated hematoxylin and eosin (H&E)-stained images could prioritize samples for confirmatory molecular testing. Costs and the necessity of a large number of samples containing mutations limit approaches that train individual algorithms for each alteration. In this work, models were trained for simultaneous prediction of multiple DNA alterations from H&E images using a multi-task approach. Compared to biomarker-specific models, this approach performed better on average, with pronounced gains for rare mutations. The models reasonably generalized to independent temporal-holdout, externally-stained, and multi-site TCGA test sets. Additionally, whole slide image embeddings derived using multi-task models demonstrated strong performance in downstream tasks that were not a part of training. Overall, this is a promising approach to develop clinically useful algorithms that provide multiple actionable predictions from a single slide.

Read more

7/23/2024

Prompting Whole Slide Image Based Genetic Biomarker Prediction
Total Score

0

Prompting Whole Slide Image Based Genetic Biomarker Prediction

Ling Zhang, Boxiang Yun, Xingran Xie, Qingli Li, Xinxing Li, Yan Wang

Prediction of genetic biomarkers, e.g., microsatellite instability and BRAF in colorectal cancer is crucial for clinical decision making. In this paper, we propose a whole slide image (WSI) based genetic biomarker prediction method via prompting techniques. Our work aims at addressing the following challenges: (1) extracting foreground instances related to genetic biomarkers from gigapixel WSIs, and (2) the interaction among the fine-grained pathological components in WSIs.Specifically, we leverage large language models to generate medical prompts that serve as prior knowledge in extracting instances associated with genetic biomarkers. We adopt a coarse-to-fine approach to mine biomarker information within the tumor microenvironment. This involves extracting instances related to genetic biomarkers using coarse medical prior knowledge, grouping pathology instances into fine-grained pathological components and mining their interactions. Experimental results on two colorectal cancer datasets show the superiority of our method, achieving 91.49% in AUC for MSI classification. The analysis further shows the clinical interpretability of our method. Code is publicly available at https://github.com/DeepMed-Lab-ECNU/PromptBio.

Read more

7/16/2024

🤿

Total Score

0

New!Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images

Masoud Tafavvoghi, Anders Sildnes, Mehrdad Rakaee, Nikita Shvetsov, Lars Ailo Bongo, Lill-Tove Rasmussen Busund, Kajsa M{o}llersen

Classifying breast cancer molecular subtypes is crucial for tailoring treatment strategies. While immunohistochemistry (IHC) and gene expression profiling are standard methods for molecular subtyping, IHC can be subjective, and gene profiling is costly and not widely accessible in many regions. Previous approaches have highlighted the potential application of deep learning models on H&E-stained whole slide images (WSI) for molecular subtyping, but these efforts vary in their methods, datasets, and reported performance. In this work, we investigated whether H&E-stained WSIs could be solely leveraged to predict breast cancer molecular subtypes (luminal A, B, HER2-enriched, and Basal). We used 1,433 WSIs of breast cancer in a two-step pipeline: first, classifying tumor and non-tumor tiles to use only the tumor regions for molecular subtyping; and second, employing a One-vs-Rest (OvR) strategy to train four binary OvR classifiers and aggregating their results using an eXtreme Gradient Boosting (XGBoost) model. The pipeline was tested on 221 hold-out WSIs, achieving an overall macro F1 score of 0.95 for tumor detection and 0.73 for molecular subtyping. Our findings suggest that, with further validation, supervised deep learning models could serve as supportive tools for molecular subtyping in breast cancer. Our codes are made available to facilitate ongoing research and development.

Read more

9/17/2024