Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis

Read original: arXiv:2407.18060 - Published 7/26/2024 by Jatin Chaudhary, Ivan Jambor, Hannu Aronen, Otto Ettala, Jani Saunavaara, Peter Bostrom, Jukka Heikkonen, Rajeev Kanth, Harri Merisaari
Total Score

0

Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • The paper examines the reproducibility of radiomics-based machine learning models for computer-aided diagnosis across different medical imaging vendors.
  • Radiomics is the high-throughput extraction and analysis of quantitative imaging features from medical images.
  • The goal is to assess how well these models perform when applied to data from different imaging vendors, as this is crucial for their real-world deployment.

Plain English Explanation

Radiomics is a technique that uses machine learning to analyze medical images and extract a large number of quantitative features. These features can then be used to build predictive models for diagnosing diseases like cancer. The paper discusses the importance of ensuring these radiomics-based machine learning models can work consistently across different medical imaging devices, which are made by various vendors. If the models fail to perform well on data from different vendors, it would limit their usefulness in real-world clinical settings where patients may undergo scans on equipment from various manufacturers. The researchers investigated how well these radiomics models can generalize and maintain their predictive performance when applied to data from multiple imaging vendors.

Technical Explanation

The researchers used prostate MRI data from three different imaging vendors to train and evaluate radiomics-based machine learning models for prostate cancer diagnosis. They extracted a large number of quantitative imaging features from the scans and used these features to build predictive models. They then tested how well these models performed when applied to data from the other two vendors that were not used in the initial model training. This allowed them to assess the cross-vendor reproducibility of the radiomics-based machine learning approach.

Critical Analysis

The paper provides a thorough evaluation of the cross-vendor reproducibility of radiomics-based machine learning models. However, the authors acknowledge that their analysis was limited to a single organ (prostate) and a specific imaging modality (MRI). Further research is needed to determine if the findings generalize to other disease contexts and imaging modalities. Additionally, the paper does not explore potential reasons why the models may have failed to achieve consistent performance across vendors, which could help guide future work in this area.

Conclusion

This study demonstrates the importance of evaluating the cross-vendor reproducibility of radiomics-based machine learning models for computer-aided diagnosis. The findings suggest that while these models can be effective, their performance may vary depending on the imaging vendor used. Ensuring the robustness of radiomics models across different imaging platforms is a crucial step toward their successful real-world deployment and clinical adoption.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis
Total Score

0

Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis

Jatin Chaudhary, Ivan Jambor, Hannu Aronen, Otto Ettala, Jani Saunavaara, Peter Bostrom, Jukka Heikkonen, Rajeev Kanth, Harri Merisaari

Background: The reproducibility of machine-learning models in prostate cancer detection across different MRI vendors remains a significant challenge. Methods: This study investigates Support Vector Machines (SVM) and Random Forest (RF) models trained on radiomic features extracted from T2-weighted MRI images using Pyradiomics and MRCradiomics libraries. Feature selection was performed using the maximum relevance minimum redundancy (MRMR) technique. We aimed to enhance clinical decision support through multimodal learning and feature fusion. Results: Our SVM model, utilizing combined features from Pyradiomics and MRCradiomics, achieved an AUC of 0.74 on the Multi-Improd dataset (Siemens scanner) but decreased to 0.60 on the Philips test set. The RF model showed similar trends, with notable robustness for models using Pyradiomics features alone (AUC of 0.78 on Philips). Conclusions: These findings demonstrate the potential of multimodal feature integration to improve the robustness and generalizability of machine-learning models for clinical decision support in prostate cancer detection. This study marks a significant step towards developing reliable AI-driven diagnostic tools that maintain efficacy across various imaging platforms.

Read more

7/26/2024

Texture Feature Analysis for Classification of Early-Stage Prostate Cancer in mpMRI
Total Score

0

Texture Feature Analysis for Classification of Early-Stage Prostate Cancer in mpMRI

Asmail Muftah, S M Schirmer, Frank C Langbein

Magnetic resonance imaging (MRI) has become a crucial tool in the diagnosis and staging of prostate cancer, owing to its superior tissue contrast. However, it also creates large volumes of data that must be assessed by trained experts, a time-consuming and laborious task. This has prompted the development of machine learning tools for the automation of Prostate cancer (PCa) risk classification based on multiple MRI modalities (T2W, ADC, and high-b-value DWI). Understanding and interpreting the predictions made by the models, however, remains a challenge. We analyze Random Forests (RF) and Support Vector Machines (SVM), for two complementary datasets, the public Prostate-X dataset, and an in-house, mostly early-stage PCa dataset to elucidate the contributions made by first-order statistical features, Haralick texture features, and local binary patterns to the classification. Using correlation analysis and Shapley impact scores, we find that many of the features typically used are strongly correlated, and that the majority of features have negligible impact on the classification. We identify a small set of features that determine the classification outcome, which may aid the development of explainable AI approaches.

Read more

6/26/2024

Unraveling Radiomics Complexity: Strategies for Optimal Simplicity in Predictive Modeling
Total Score

0

Unraveling Radiomics Complexity: Strategies for Optimal Simplicity in Predictive Modeling

Mahdi Ait Lhaj Loutfi, Teodora Boblea Podasca, Alex Zwanenburg, Taman Upadhaya, Jorge Barrios, David R. Raleigh, William C. Chen, Dante P. I. Capaldi, Hong Zheng, Olivier Gevaert, Jing Wu, Alvin C. Silva, Paul J. Zhang, Harrison X. Bai, Jan Seuntjens, Steffen Lock, Patrick O. Richard, Olivier Morin, Caroline Reinhold, Martin Lepage, Martin Valli`eres

Background: The high dimensionality of radiomic feature sets, the variability in radiomic feature types and potentially high computational requirements all underscore the need for an effective method to identify the smallest set of predictive features for a given clinical problem. Purpose: Develop a methodology and tools to identify and explain the smallest set of predictive radiomic features. Materials and Methods: 89,714 radiomic features were extracted from five cancer datasets: low-grade glioma, meningioma, non-small cell lung cancer (NSCLC), and two renal cell carcinoma cohorts (n=2104). Features were categorized by computational complexity into morphological, intensity, texture, linear filters, and nonlinear filters. Models were trained and evaluated on each complexity level using the area under the curve (AUC). The most informative features were identified, and their importance was explained. The optimal complexity level and associated most informative features were identified using systematic statistical significance analyses and a false discovery avoidance procedure, respectively. Their predictive importance was explained using a novel tree-based method. Results: MEDimage, a new open-source tool, was developed to facilitate radiomic studies. Morphological features were optimal for MRI-based meningioma (AUC: 0.65) and low-grade glioma (AUC: 0.68). Intensity features were optimal for CECT-based renal cell carcinoma (AUC: 0.82) and CT-based NSCLC (AUC: 0.76). Texture features were optimal for MRI-based renal cell carcinoma (AUC: 0.72). Tuning the Hounsfield unit range improved results for CECT-based renal cell carcinoma (AUC: 0.86). Conclusion: Our proposed methodology and software can estimate the optimal radiomics complexity level for specific medical outcomes, potentially simplifying the use of radiomics in predictive modeling across various contexts.

Read more

7/9/2024

🔗

Total Score

0

Towards robust radiomics and radiogenomics predictive models for brain tumor characterization

Maria Nadeem, Asma Shaheen, Muhammad F. A. Chaudhary, Hassan Mohy-ud-Din

In the context of brain tumor characterization, we focused on two key questions: (a) stability of radiomics features to variability in multiregional segmentation masks obtained with fully-automatic deep segmentation methods and (b) subsequent impact on predictive performance on downstream tasks: IDH prediction and Overall Survival (OS) classification. We further constrained our study to limited computational resources setting which are found in underprivileged, remote, and (or) resource-starved clinical sites in developing countries. We employed seven SOTA CNNs which can be trained with limited computational resources and have demonstrated superior segmentation performance on BraTS challenge. Subsequent selection of discriminatory features was done with RFE-SVM and MRMR. Our study revealed that highly stable radiomics features were: (1) predominantly texture features (79.1%), (2) mainly extracted from WT region (96.1%), and (3) largely representing T1Gd (35.9%) and T1 (28%) sequences. Shape features and radiomics features extracted from the ENC subregion had the lowest average stability. Stability filtering minimized non-physiological variability in predictive models as indicated by an order-of-magnitude decrease in the relative standard deviation of AUCs. The non-physiological variability is attributed to variability in multiregional segmentation maps obtained with fully-automatic CNNs. Stability filtering significantly improved predictive performance on the two downstream tasks substantiating the inevitability of learning novel radiomics and radiogenomics models with stable discriminatory features. The study (implicitly) demonstrates the importance of suboptimal deep segmentation networks which can be exploited as auxiliary networks for subsequent identification of radiomics features stable to variability in automatically generated multiregional segmentation maps.

Read more

6/12/2024