Prompting Whole Slide Image Based Genetic Biomarker Prediction

Read original: arXiv:2407.09540 - Published 7/16/2024 by Ling Zhang, Boxiang Yun, Xingran Xie, Qingli Li, Xinxing Li, Yan Wang

Prompting Whole Slide Image Based Genetic Biomarker Prediction

Overview

Predicting genetic biomarkers from whole slide images using text prompts
Developing a deep learning model to translate visual information from pathology images into genetic biomarker predictions
Exploring the use of text prompts to guide the model's predictions and interpretability

Plain English Explanation

In this research, the authors are investigating a way to predict genetic biomarkers from whole slide images of tissue samples. Genetic biomarkers are indicators in a person's DNA that can help doctors understand their risk of certain diseases or how they might respond to treatment.

The key idea is to use a deep learning model that can analyze the visual information in pathology images, such as the structure and appearance of cells and tissues, and translate that into predictions about the patient's genetic makeup. The researchers want to make this process more interpretable by allowing users to provide text prompts that guide the model's predictions and help explain its reasoning.

For example, a user might provide a prompt like "show me the regions of the image that are related to BRCA1 gene mutations." The model would then highlight the relevant areas of the image and explain how those visual features are linked to the genetic biomarker of interest.

This approach could be useful for helping doctors and researchers better understand the connections between what they see in pathology images and the underlying genetic factors that are driving disease processes. By making the model's decision-making more transparent, it could lead to more accurate and trustworthy genetic biomarker predictions.

Technical Explanation

The researchers used a deep learning architecture that takes whole slide images as input and generates predictions about genetic biomarkers. To make the model more interpretable, they incorporated a text prompt system that allows users to guide the model's focus and reasoning.

The model consists of a vision encoder that extracts visual features from the input image, a language encoder that processes the text prompt, and a fusion module that combines the visual and textual information to generate the final biomarker predictions. The model is trained end-to-end using a large dataset of annotated whole slide images and associated genetic data.

During inference, the user can provide a text prompt that specifies the genetic biomarker of interest or the type of visual information they want the model to focus on. The model then uses this prompt to selectively attend to relevant regions of the image and explain its reasoning for the predicted biomarker.

This interactive interpretation capability allows users to better understand the relationship between the visual features in the pathology image and the underlying genetic markers, potentially leading to more trustworthy and clinically relevant predictions.

Critical Analysis

The authors acknowledge several limitations of their approach, including the reliance on a relatively small dataset of annotated whole slide images and the potential for biases in the training data. They also note that the text prompts may not always fully capture the complex relationships between visual features and genetic biomarkers, and that further research is needed to improve the model's interpretability and generalization capabilities.

Additionally, while the use of text prompts is an interesting approach, it may not be practical or accessible for all users, particularly those without a strong understanding of genetics and pathology. There is a risk that the model could be misused or misinterpreted by non-experts, leading to potentially harmful conclusions.

Further research is also needed to explore the clinical utility and real-world applicability of this approach, as well as to address any ethical concerns related to the use of AI in medical diagnosis and decision-making.

Conclusion

This research represents an important step towards developing interpretable machine learning systems for genetic biomarker prediction from whole slide images. By incorporating text prompts, the authors have created a more transparent and interactive model that can help users better understand the connections between visual features and genetic factors.

While there are still challenges to overcome, this work has the potential to significantly improve the accuracy and trustworthiness of genetic biomarker predictions, ultimately leading to more personalized and effective healthcare approaches. As the field of computational pathology continues to evolve, research like this will be crucial for unlocking the full potential of digital pathology and precision medicine.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Prompting Whole Slide Image Based Genetic Biomarker Prediction

Ling Zhang, Boxiang Yun, Xingran Xie, Qingli Li, Xinxing Li, Yan Wang

Prediction of genetic biomarkers, e.g., microsatellite instability and BRAF in colorectal cancer is crucial for clinical decision making. In this paper, we propose a whole slide image (WSI) based genetic biomarker prediction method via prompting techniques. Our work aims at addressing the following challenges: (1) extracting foreground instances related to genetic biomarkers from gigapixel WSIs, and (2) the interaction among the fine-grained pathological components in WSIs.Specifically, we leverage large language models to generate medical prompts that serve as prior knowledge in extracting instances associated with genetic biomarkers. We adopt a coarse-to-fine approach to mine biomarker information within the tumor microenvironment. This involves extracting instances related to genetic biomarkers using coarse medical prior knowledge, grouping pathology instances into fine-grained pathological components and mining their interactions. Experimental results on two colorectal cancer datasets show the superiority of our method, achieving 91.49% in AUC for MSI classification. The analysis further shows the clinical interpretability of our method. Code is publicly available at https://github.com/DeepMed-Lab-ECNU/PromptBio.

7/16/2024

🏷️

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin

Predicting genetic mutations from whole slide images is indispensable for cancer diagnosis. However, existing work training multiple binary classification models faces two challenges: (a) Training multiple binary classifiers is inefficient and would inevitably lead to a class imbalance problem. (b) The biological relationships among genes are overlooked, which limits the prediction performance. To tackle these challenges, we innovatively design a Biological-knowledge enhanced PathGenomic multi-label Transformer to improve genetic mutation prediction performances. BPGT first establishes a novel gene encoder that constructs gene priors by two carefully designed modules: (a) A gene graph whose node features are the genes' linguistic descriptions and the cancer phenotype, with edges modeled by genes' pathway associations and mutation consistencies. (b) A knowledge association module that fuses linguistic and biomedical knowledge into gene priors by transformer-based graph representation learning, capturing the intrinsic relationships between different genes' mutations. BPGT then designs a label decoder that finally performs genetic mutation prediction by two tailored modules: (a) A modality fusion module that firstly fuses the gene priors with critical regions in WSIs and obtains gene-wise mutation logits. (b) A comparative multi-label loss that emphasizes the inherent comparisons among mutation status to enhance the discrimination capabilities. Sufficient experiments on The Cancer Genome Atlas benchmark demonstrate that BPGT outperforms the state-of-the-art.

6/6/2024

Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images

Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Ran A. Godrich, Matthew C. H. Lee, Chad Vanderbilt, Razik Yousfi, Thomas Fuchs, David S. Klimstra, Siqi Liu

Many molecular alterations serve as clinically prognostic or therapy-predictive biomarkers, typically detected using single or multi-gene molecular assays. However, these assays are expensive, tissue destructive and often take weeks to complete. Using AI on routine H&E WSIs offers a fast and economical approach to screen for multiple molecular biomarkers. We present a high-throughput AI-based system leveraging Virchow2, a foundation model pre-trained on 3 million slides, to interrogate genomic features previously determined by an next-generation sequencing (NGS) assay, using 47,960 scanned hematoxylin and eosin (H&E) whole slide images (WSIs) from 38,984 cancer patients. Unlike traditional methods that train individual models for each biomarker or cancer type, our system employs a unified model to simultaneously predict a wide range of clinically relevant molecular biomarkers across cancer types. By training the network to replicate the MSK-IMPACT targeted biomarker panel of 505 genes, it identified 80 high performing biomarkers with a mean AU-ROC of 0.89 in 15 most common cancer types. In addition, 40 biomarkers demonstrated strong associations with specific cancer histologic subtypes. Furthermore, 58 biomarkers were associated with targets frequently assayed clinically for therapy selection and response prediction. The model can also predict the activity of five canonical signaling pathways, identify defects in DNA repair mechanisms, and predict genomic instability measured by tumor mutation burden, microsatellite instability (MSI), and chromosomal instability (CIN). The proposed model can offer potential to guide therapy selection, improve treatment efficacy, accelerate patient screening for clinical trials and provoke the interrogation of new therapeutic targets.

8/21/2024

Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning

Martim Afonso, Praphulla M. S. Bhawsar, Monjoy Saha, Jonas S. Almeida, Arlindo L. Oliveira

Whole Slide Images (WSI), obtained by high-resolution digital scanning of microscope slides at multiple scales, are the cornerstone of modern Digital Pathology. However, they represent a particular challenge to AI-based/AI-mediated analysis because pathology labeling is typically done at slide-level, instead of tile-level. It is not just that medical diagnostics is recorded at the specimen level, the detection of oncogene mutation is also experimentally obtained, and recorded by initiatives like The Cancer Genome Atlas (TCGA), at the slide level. This configures a dual challenge: a) accurately predicting the overall cancer phenotype and b) finding out what cellular morphologies are associated with it at the tile level. To address these challenges, a weakly supervised Multiple Instance Learning (MIL) approach was explored for two prevalent cancer types, Invasive Breast Carcinoma (TCGA-BRCA) and Lung Squamous Cell Carcinoma (TCGA-LUSC). This approach was explored for tumor detection at low magnification levels and TP53 mutations at various levels. Our results show that a novel additive implementation of MIL matched the performance of reference implementation (AUC 0.96), and was only slightly outperformed by Attention MIL (AUC 0.97). More interestingly from the perspective of the molecular pathologist, these different AI architectures identify distinct sensitivities to morphological features (through the detection of Regions of Interest, RoI) at different amplification levels. Tellingly, TP53 mutation was most sensitive to features at the higher applications where cellular morphology is resolved.

4/12/2024