Analysis and Validation of Image Search Engines in Histopathology

Read original: arXiv:2401.03271 - Published 6/11/2024 by Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah and 9 others

Analysis and Validation of Image Search Engines in Histopathology

Overview

This paper analyzes and validates the performance of image search engines in histopathology, which is the study of diseases through microscopic examination of tissues.
It examines two common approaches for image search: Bag of Visual Words (BoVW) and Convolutional Neural Networks (CNNs).
The researchers assess the accuracy and efficiency of these image search methods on a dataset of histopathology images.

Plain English Explanation

When doctors examine tissue samples under a microscope to diagnose diseases, they often need to compare the sample to similar images in a database. This paper explores ways to make that image search process more accurate and efficient.

The researchers looked at two main approaches for searching through histopathology image databases. The first is called Bag of Visual Words (BoVW), which breaks down images into small visual features and then matches those features to find similar images. The second approach uses Convolutional Neural Networks (CNNs), which are a type of AI model that can learn to recognize patterns in images.

The team tested how well these two image search methods worked on a collection of histopathology images. They measured factors like accuracy - how often the search results matched what a human expert would expect - as well as efficiency - how quickly the search could be performed. The goal was to understand the tradeoffs between these different approaches and provide guidance on which ones work best for searching histopathology image databases.

Technical Explanation

The paper evaluates the performance of Bag of Visual Words (BoVW) and Convolutional Neural Network (CNN) based image search approaches in the context of histopathology.

The BoVW method works by first detecting visual features in an image, such as edges or textures, and then representing the image as a histogram of these visual "words". To search for similar images, the system compares the histograms of the query image and database images. In contrast, CNN-based methods use deep learning to encode images into feature representations that capture more high-level semantic information, which can then be used for similarity-based image retrieval.

The researchers assess these two approaches on a dataset of histopathology images. They measure retrieval accuracy using standard metrics like Precision@K and Normalized Discounted Cumulative Gain (NDCG). They also evaluate the computational efficiency of the search methods in terms of indexing and query times.

The results show that while the CNN-based approach generally outperforms BoVW in terms of retrieval accuracy, the BoVW method is more efficient, with faster indexing and querying. The authors discuss the tradeoffs between these approaches and provide guidance on selecting the appropriate image search strategy based on the specific requirements of a histopathology application, such as the need for speed versus the importance of retrieval accuracy.

Critical Analysis

The paper provides a thorough evaluation of two prominent image search approaches in the context of histopathology, which is an important application domain. The comparison of BoVW and CNN-based methods is well-designed and the metrics used to assess performance are appropriate.

One limitation mentioned by the authors is the use of a relatively small dataset of histopathology images. Validating the findings on larger and more diverse datasets would strengthen the generalizability of the results. The authors also note that the performance of the CNN-based approach may be further improved by fine-tuning the models on the target histopathology data, rather than using off-the-shelf CNN features.

Additionally, the paper does not explore the potential for combining BoVW and CNN approaches or incorporating other techniques like artifact handling, which could lead to further performance gains. Investigating these hybrid and multi-modal approaches could be a fruitful direction for future research.

Overall, this paper provides a valuable contribution to the understanding of image search in histopathology and offers practical guidance for selecting appropriate search methods. However, the field would benefit from further research on incorporating visual instructions and other techniques to continually improve the performance and robustness of these systems.

Conclusion

This paper presents an analysis and validation of two widely used image search approaches, BoVW and CNNs, in the context of histopathology. The results show that while CNN-based methods generally outperform BoVW in terms of retrieval accuracy, the BoVW approach is more computationally efficient.

The findings offer practical guidance for researchers and practitioners in selecting the appropriate image search strategy based on the specific requirements of their histopathology applications, such as the need for speed versus the importance of retrieval accuracy. The paper also highlights opportunities for future research, such as exploring hybrid approaches and incorporating techniques like artifact handling, to further improve the performance of image search in this critical domain.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Analysis and Validation of Image Search Engines in Histopathology

Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, Nneka Comfere, Dennis Murphee, Aaron Mangold, Saba Yasir, Chady Meroueh, Lisa Boardman, Vijay H. Shah, Joaquin J. Garcia, H. R. Tizhoosh

Searching for similar images in archives of histology and histopathology images is a crucial task that may aid in patient matching for various purposes, ranging from triaging and diagnosis to prognosis and prediction. Whole slide images (WSIs) are highly detailed digital representations of tissue specimens mounted on glass slides. Matching WSI to WSI can serve as the critical method for patient matching. In this paper, we report extensive analysis and validation of four search methods bag of visual words (BoVW), Yottixel, SISH, RetCCL, and some of their potential variants. We analyze their algorithms and structures and assess their performance. For this evaluation, we utilized four internal datasets ($1269$ patients) and three public datasets ($1207$ patients), totaling more than $200,000$ patches from $38$ different classes/subtypes across five primary sites. Certain search engines, for example, BoVW, exhibit notable efficiency and speed but suffer from low accuracy. Conversely, search engines like Yottixel demonstrate efficiency and speed, providing moderately accurate results. Recent proposals, including SISH, display inefficiency and yield inconsistent outcomes, while alternatives like RetCCL prove inadequate in both accuracy and efficiency. Further research is imperative to address the dual aspects of accuracy and minimal storage requirements in histopathological image search.

6/11/2024

✅

On Validation of Search & Retrieval of Tissue Images in Digital Pathology

H. R. Tizhoosh

Medical images play a crucial role in modern healthcare by providing vital information for diagnosis, treatment planning, and disease monitoring. Fields such as radiology and pathology rely heavily on accurate image interpretation, with radiologists examining X-rays, CT scans, and MRIs to diagnose conditions from fractures to cancer, while pathologists use microscopy and digital images to detect cellular abnormalities for diagnosing cancers and infections. The technological advancements have exponentially increased the volume and complexity of medical images, necessitating efficient tools for management and retrieval. Content-Based Image Retrieval (CBIR) systems address this need by searching and retrieving images based on visual content, enhancing diagnostic accuracy by allowing clinicians to find similar cases and compare pathological patterns. Comprehensive validation of image search engines in medical applications involves evaluating performance metrics like accuracy, indexing, and search times, and storage overhead, ensuring reliable and efficient retrieval of accurate results, as demonstrated by recent validations in histopathology.

8/6/2024

PathAlign: A vision-language model for whole slide images in histopathology

Faruk Ahmed, Andrew Sellergren, Lin Yang, Shawn Xu, Boris Babenko, Abbi Ward, Niels Olson, Arash Mohtashamian, Yossi Matias, Greg S. Corrado, Quang Duong, Dale R. Webster, Shravya Shetty, Daniel Golden, Yun Liu, David F. Steiner, Ellery Wulczyn

Microscopic interpretation of histopathology images underlies many important diagnostic and treatment decisions. While advances in vision-language modeling raise new opportunities for analysis of such images, the gigapixel-scale size of whole slide images (WSIs) introduces unique challenges. Additionally, pathology reports simultaneously highlight key findings from small regions while also aggregating interpretation across multiple slides, often making it difficult to create robust image-text pairs. As such, pathology reports remain a largely untapped source of supervision in computational pathology, with most efforts relying on region-of-interest annotations or self-supervision at the patch-level. In this work, we develop a vision-language model based on the BLIP-2 framework using WSIs paired with curated text from pathology reports. This enables applications utilizing a shared image-text embedding space, such as text or image retrieval for finding cases of interest, as well as integration of the WSI encoder with a frozen large language model (LLM) for WSI-based generative text capabilities such as report generation or AI-in-the-loop interactions. We utilize a de-identified dataset of over 350,000 WSIs and diagnostic text pairs, spanning a wide range of diagnoses, procedure types, and tissue types. We present pathologist evaluation of text generation and text retrieval using WSI embeddings, as well as results for WSI classification and workflow prioritization (slide-level triaging). Model-generated text for WSIs was rated by pathologists as accurate, without clinically significant error or omission, for 78% of WSIs on average. This work demonstrates exciting potential capabilities for language-aligned WSI embeddings.

7/1/2024

🖼️

Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models

Saghir Alfasly, Ghazal Alabtah, Sobhan Hemati, Krishna Rani Kalari, H. R. Tizhoosh

We have tested recently published foundation models for histopathology for image retrieval. We report macro average of F1 score for top-1 retrieval, majority of top-3 retrievals, and majority of top-5 retrievals. We perform zero-shot retrievals, i.e., we do not alter embeddings and we do not train any classifier. As test data, we used diagnostic slides of TCGA, The Cancer Genome Atlas, consisting of 23 organs and 117 cancer subtypes. As a search platform we used Yottixel that enabled us to perform WSI search using patches. Achieved F1 scores show low performance, e.g., for top-5 retrievals, 27% +/- 13% (Yottixel-DenseNet), 42% +/- 14% (Yottixel-UNI), 40%+/-13% (Yottixel-Virchow), 41%+/-13% (Yottixel-GigaPath), and 41%+/-14% (GigaPath WSI).

9/14/2024