Clinical Validation of a Real-Time Machine Learning-based System for the Detection of Acute Myeloid Leukemia by Flow Cytometry

Read original: arXiv:2409.11350 - Published 9/18/2024 by Lauren M. Zuromski, Jacob Durtschi, Aimal Aziz, Jeffrey Chumley, Mark Dewey, Paul English, Muir Morrison, Keith Simmon, Blaine Whipple, Brendan O'Fallon and 1 other
Total Score

0

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Machine learning (ML) models can improve flow cytometry in clinical labs by reducing errors, increasing reproducibility, and boosting efficiency.
  • While many ML models for flow cytometry have been proposed, few studies have described their actual deployment in clinical settings.
  • Realizing the benefits of ML models in clinics requires not just an accurate model, but also infrastructure for automated inference, error detection, analytics, monitoring, and structured data extraction.

Plain English Explanation

Machine learning models for flow cytometry have the potential to significantly enhance clinical laboratories. These models can help reduce mistakes, make results more consistent, and make lab workflows more efficient. However, most research on these models has focused on developing them, rather than on actually deploying them in real-world clinical settings.

To get the full benefits of ML models in clinics, it's not enough to just have an accurate model. Clinics also need the right infrastructure to put the model into practice. This includes systems for automatically running the model on new data, detecting errors, analyzing the results, monitoring the model's performance over time, and extracting structured medical information from unstructured reports.

In this paper, the researchers describe an ML model they developed for detecting a type of cancer called acute myeloid leukemia (AML), along with the infrastructure they built to support using this model in a clinical setting. Their infrastructure leverages cloud computing for scalable model inference, a workflow system based on Kubernetes to ensure reproducibility and manage resources, and a tool to extract structured diagnostic information from text reports.

The researchers also discuss their model monitoring and visualization platform, which is crucial for ensuring the model maintains its accuracy over time. Finally, they present an analysis of how using this system has impacted turnaround times and how the real-world accuracy compares to the original validation results.

Technical Explanation

The researchers developed an ML model for detecting acute myeloid leukemia (AML) from flow cytometry data. To enable the clinical deployment of this model, they built a supporting infrastructure with several key components:

  • Cloud-based model inference: The researchers leveraged the scalability and resilience of cloud computing to run the ML model for inference on new flow cytometry samples.
  • Kubernetes-based workflow system: This system ensured the reproducibility of the model execution and provided efficient resource management.
  • Structured data extraction: The researchers developed a tool to extract structured diagnostic information from the unstructured text of full medical reports.

In addition to these core components, the researchers also built a model monitoring and visualization platform. This is an essential element for ensuring the model maintains its accuracy over time in the real-world clinical setting.

The researchers present a post-deployment analysis that looks at the impact of this system on turnaround time and compares the real-world accuracy to the original validation statistics.

Critical Analysis

The paper demonstrates a comprehensive approach to deploying an ML model for flow cytometry in a clinical setting, addressing key challenges beyond just model development. The researchers have thoughtfully designed the supporting infrastructure to ensure the model can be used reliably and effectively in practice.

However, the paper does not delve into potential limitations or caveats of their approach. For example, it would be helpful to understand how the model and infrastructure perform under different workloads or with varying data quality, and what steps were taken to ensure patient privacy and data security.

Additionally, the paper does not discuss any plans for further research or improvements to the system. It would be valuable to understand how the researchers plan to iterate on this work, such as exploring ways to make the model more robust, expanding the range of conditions it can detect, or enhancing the monitoring and analytics capabilities.

Conclusion

This paper presents a promising approach for deploying ML models in clinical flow cytometry laboratories. By developing a comprehensive infrastructure that addresses key practical challenges, the researchers have demonstrated a path for realizing the benefits of ML, such as reduced errors, improved reproducibility, and increased efficiency.

The detailed description of the system components and the post-deployment analysis provide valuable insights for other researchers and clinicians interested in applying ML to flow cytometry and other medical domains. As the field of clinical ML continues to evolve, this work highlights the importance of moving beyond model development and focusing on the end-to-end systems required for successful real-world deployment.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Total Score

0

New!Clinical Validation of a Real-Time Machine Learning-based System for the Detection of Acute Myeloid Leukemia by Flow Cytometry

Lauren M. Zuromski, Jacob Durtschi, Aimal Aziz, Jeffrey Chumley, Mark Dewey, Paul English, Muir Morrison, Keith Simmon, Blaine Whipple, Brendan O'Fallon, David P. Ng

Machine-learning (ML) models in flow cytometry have the potential to reduce error rates, increase reproducibility, and boost the efficiency of clinical labs. While numerous ML models for flow cytometry data have been proposed, few studies have described the clinical deployment of such models. Realizing the potential gains of ML models in clinical labs requires not only an accurate model, but infrastructure for automated inference, error detection, analytics and monitoring, and structured data extraction. Here, we describe an ML model for detection of Acute Myeloid Leukemia (AML), along with the infrastructure supporting clinical implementation. Our infrastructure leverages the resilience and scalability of the cloud for model inference, a Kubernetes-based workflow system that provides model reproducibility and resource management, and a system for extracting structured diagnoses from full-text reports. We also describe our model monitoring and visualization platform, an essential element for ensuring continued model accuracy. Finally, we present a post-deployment analysis of impacts on turn-around time and compare production accuracy to the original validation statistics.

Read more

9/18/2024

Automated Immunophenotyping Assessment for Diagnosing Childhood Acute Leukemia using Set-Transformers
Total Score

0

Automated Immunophenotyping Assessment for Diagnosing Childhood Acute Leukemia using Set-Transformers

Elpiniki Maria Lygizou, Michael Reiter, Margarita Maurer-Granofszky, Michael Dworzak, Radu Grosu

Acute Leukemia is the most common hematologic malignancy in children and adolescents. A key methodology in the diagnostic evaluation of this malignancy is immunophenotyping based on Multiparameter Flow Cytometry (FCM). However, this approach is manual, and thus time-consuming and subjective. To alleviate this situation, we propose in this paper the FCM-Former, a machine learning, self-attention based FCM-diagnostic tool, automating the immunophenotyping assessment in Childhood Acute Leukemia. The FCM-Former is trained in a supervised manner, by directly using flow cytometric data. Our FCM-Former achieves an accuracy of 96.5% assigning lineage to each sample among 960 cases of either acute B-cell, T-cell lymphoblastic, and acute myeloid leukemia (B-ALL, T-ALL, AML). To the best of our knowledge, the FCM-Former is the first work that automates the immunophenotyping assessment with FCM data in diagnosing pediatric Acute Leukemia.

Read more

6/27/2024

Low dimensional representation of multi-patient flow cytometry datasets using optimal transport for minimal residual disease detection in leukemia
Total Score

0

Low dimensional representation of multi-patient flow cytometry datasets using optimal transport for minimal residual disease detection in leukemia

Erell Gachon, J'er'emie Bigot, Elsa Cazelles, Aguirre Mimoun, Jean-Philippe Vial

Representing and quantifying Minimal Residual Disease (MRD) in Acute Myeloid Leukemia (AML), a type of cancer that affects the blood and bone marrow, is essential in the prognosis and follow-up of AML patients. As traditional cytological analysis cannot detect leukemia cells below 5%, the analysis of flow cytometry dataset is expected to provide more reliable results. In this paper, we explore statistical learning methods based on optimal transport (OT) to achieve a relevant low-dimensional representation of multi-patient flow cytometry measurements (FCM) datasets considered as high-dimensional probability distributions. Using the framework of OT, we justify the use of the K-means algorithm for dimensionality reduction of multiple large-scale point clouds through mean measure quantization by merging all the data into a single point cloud. After this quantization step, the visualization of the intra and inter-patients FCM variability is carried out by embedding low-dimensional quantized probability measures into a linear space using either Wasserstein Principal Component Analysis (PCA) through linearized OT or log-ratio PCA of compositional data. Using a publicly available FCM dataset and a FCM dataset from Bordeaux University Hospital, we demonstrate the benefits of our approach over the popular kernel mean embedding technique for statistical learning from multiple high-dimensional probability distributions. We also highlight the usefulness of our methodology for low-dimensional projection and clustering patient measurements according to their level of MRD in AML from FCM. In particular, our OT-based approach allows a relevant and informative two-dimensional representation of the results of the FlowSom algorithm, a state-of-the-art method for the detection of MRD in AML using multi-patient FCM.

Read more

7/25/2024

🤿

Total Score

0

Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia

Dimitris Papaioannou, Ioannis Christou, Nikos Anagnou, Aristotelis Chatziioannou

Acute lymphoblastic leukemia (ALL) is a form of blood cancer that affects the white blood cells. ALL constitutes approximately 25% of pediatric cancers. Early diagnosis and treatment of ALL are crucial for improving patient outcomes. The task of identifying immature leukemic blasts from normal cells under the microscope can prove challenging, since the images of a healthy and cancerous cell appear similar morphologically. In this study, we propose a binary image classification model to assist in the diagnostic process of ALL. Our model takes as input microscopic images of blood samples and outputs a binary prediction of whether the sample is normal or cancerous. Our dataset consists of 10661 images out of 118 subjects. Deep learning techniques on convolutional neural network architectures were used to achieve accurate classification results. Our proposed method achieved 94.3% accuracy and could be used as an assisting tool for hematologists trying to predict the likelihood of a patient developing ALL.

Read more

7/16/2024