Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method

Read original: arXiv:2407.03542 - Published 7/24/2024 by Shiyi Wang, Yang Nan, Sheng Zhang, Federico Felder, Xiaodan Xing, Yingying Fang, Javier Del Ser, Simon L F Walsh, Guang Yang

↗️

Overview

Tracheal segmentation in medical imaging faces challenges due to limited annotated data
Deep learning (DL) methods have limitations like opacity and need for performance enhancement
The authors propose Human-Computer Interaction (HCI) based models to address these challenges

Plain English Explanation

The goal of this research is to improve the process of tracheal segmentation in medical imaging. Tracheal segmentation is the task of identifying the trachea, or windpipe, in medical scans like CT images. However, this is a difficult problem because there is a scarcity of labeled training data - medical experts have not annotated enough examples to train accurate AI models.

Additionally, current deep learning techniques used for medical image segmentation have some drawbacks. Their inner workings are like a "black box" that is hard for humans to understand. There is also a need to enhance the performance of these models.

To address these challenges, the researchers developed Human-Computer Interaction (HCI) based models. These models combine different strategies for selecting which unlabeled samples to have experts label, in order to most efficiently improve the model's performance with limited training data. The key steps are:

Query Strategy: The HCI models use techniques like Wasserstein Distance, Least Confidence, Entropy Sampling, and Random Sampling to identify the most valuable unlabeled samples for experts to label.
Central line correction: Experts correct the tracheal centerlines generated by the model after each training round.
Update training dataset: The training dataset is updated after each model training, improving the trustworthiness and performance.
Model training: The HCI models are trained using an enhanced version of the U-Net architecture.

The experimental results show these HCI-based approaches outperform state-of-the-art DL models, while using only 15-35% of the training data. This reduces the amount of time experts need to spend annotating data.

Technical Explanation

The researchers developed four HCI-based models for pulmonary tracheal segmentation: WD-UNet, LC-UNet, UUNet, and RS-UNet. These models combine different active learning query strategies with an enhanced U-Net deep learning architecture.

The query strategies used are:

Wasserstein Distance (WD): Selects unlabeled samples that are farthest from the current training data distribution.
Least Confidence (LC): Selects unlabeled samples where the model is least confident in its predictions.
Uncertainty (UU): Selects unlabeled samples with the highest uncertainty, measured by Shannon entropy.
Random Sampling (RS): Selects random unlabeled samples.

After each model training round, experts correct the tracheal centerlines generated by the model. The training dataset is then updated, and the model is retrained on the improved dataset. This iterative human-in-the-loop process continues, gradually enhancing the model's trustworthiness and performance.

The experiments show the HCI-based models achieve comparable or superior performance to state-of-the-art DL methods, while using 15-35% less training data. Notably, the WD-UNet model achieves this high performance with a 65-85% reduction in expert annotation time.

Critical Analysis

The paper presents a valuable approach to addressing the challenges of limited annotated data and black box opacity in medical image segmentation. The proposed HCI-based models leverage active learning techniques to efficiently utilize expert feedback and iteratively improve the models.

One potential limitation is that the study only evaluates the models on tracheal segmentation, so the generalizability to other medical segmentation tasks is unclear. Additionally, the paper does not provide much detail on the specific implementation of the central line correction process or the integration of expert feedback.

Further research could explore how these HCI-based methods scale to larger and more diverse medical imaging datasets, as well as investigate ways to better incorporate expert knowledge and reasoning into the model training process. Exploring the interpretability of the models' decision-making could also be a valuable area of investigation.

Overall, this research demonstrates a promising direction for enhancing the performance and trustworthiness of deep learning models in medical imaging applications, while reducing the burden on expert annotators.

Conclusion

This paper presents a novel approach to pulmonary tracheal segmentation that addresses key limitations of current deep learning methods. By combining diverse active learning strategies with an enhanced U-Net architecture, the proposed HCI-based models (WD-UNet, LC-UNet, UUNet, and RS-UNet) achieve strong performance while requiring significantly less training data annotated by experts.

The iterative human-in-the-loop process of data curation and model retraining is a compelling way to improve the trustworthiness and real-world applicability of deep learning in medical imaging. This research highlights the value of integrating human expertise and interactive feedback into the development of AI systems for healthcare applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

↗️

Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method

Shiyi Wang, Yang Nan, Sheng Zhang, Federico Felder, Xiaodan Xing, Yingying Fang, Javier Del Ser, Simon L F Walsh, Guang Yang

In pulmonary tracheal segmentation, the scarcity of annotated data is a prevalent issue in medical segmentation. Additionally, Deep Learning (DL) methods face challenges: the opacity of 'black box' models and the need for performance enhancement. Our Human-Computer Interaction (HCI) based models (RS_UNet, LC_UNet, UUNet, and WD_UNet) address these challenges by combining diverse query strategies with various DL models. We train four HCI models and repeat these steps: (1) Query Strategy: The HCI models select samples that provide the most additional representative information when labeled in each iteration and identify unlabeled samples with the greatest predictive disparity using Wasserstein Distance, Least Confidence, Entropy Sampling, and Random Sampling. (2) Central line correction: Selected samples are used for expert correction of system-generated tracheal central lines in each training round. (3) Update training dataset: Experts update the training dataset after each DL model's training epoch, enhancing the trustworthiness and performance of the models. (4) Model training: The HCI model is trained using the updated dataset and an enhanced UNet version. Experimental results confirm the effectiveness of these HCI-based approaches, showing that WD-UNet, LC-UNet, UUNet, and RS-UNet achieve comparable or superior performance to state-of-the-art DL models. Notably, WD-UNet achieves this with only 15%-35% of the training data, reducing physician annotation time by 65%-85%.

7/24/2024

A comparative analysis of deep learning models for lung segmentation on X-ray images

Weronika Hryniewska-Guzik, Jakub Bilski, Bartosz Chrostowski, Jakub Drak Sbahi, Przemys{l}aw Biecek

Robust and highly accurate lung segmentation in X-rays is crucial in medical imaging. This study evaluates deep learning solutions for this task, ranking existing methods and analyzing their performance under diverse image modifications. Out of 61 analyzed papers, only nine offered implementation or pre-trained models, enabling assessment of three prominent methods: Lung VAE, TransResUNet, and CE-Net. The analysis revealed that CE-Net performs best, demonstrating the highest values in dice similarity coefficient and intersection over union metric.

4/10/2024

MedUHIP: Towards Human-In-the-Loop Medical Segmentation

Jiayuan Zhu, Junde Wu

Although segmenting natural images has shown impressive performance, these techniques cannot be directly applied to medical image segmentation. Medical image segmentation is particularly complicated by inherent uncertainties. For instance, the ambiguous boundaries of tissues can lead to diverse but plausible annotations from different clinicians. These uncertainties cause significant discrepancies in clinical interpretations and impact subsequent medical interventions. Therefore, achieving quantitative segmentations from uncertain medical images becomes crucial in clinical practice. To address this, we propose a novel approach that integrates an textbf{uncertainty-aware model} with textbf{human-in-the-loop interaction}. The uncertainty-aware model proposes several plausible segmentations to address the uncertainties inherent in medical images, while the human-in-the-loop interaction iteratively modifies the segmentation under clinician supervision. This collaborative model ensures that segmentation is not solely dependent on automated techniques but is also refined through clinician expertise. As a result, our approach represents a significant advancement in the field which enhances the safety of medical image segmentation. It not only offers a comprehensive solution to produce quantitative segmentation from inherent uncertain medical images, but also establishes a synergistic balance between algorithmic precision and clincian knowledge. We evaluated our method on various publicly available multi-clinician annotated datasets: REFUGE2, LIDC-IDRI and QUBIQ. Our method showcases superior segmentation capabilities, outperforming a wide range of deterministic and uncertainty-aware models. We also demonstrated that our model produced significantly better results with fewer interactions compared to previous interactive models. We will release the code to foster further research in this area.

8/6/2024

Segmentation of Non-Small Cell Lung Carcinomas: Introducing DRU-Net and Multi-Lens Distortion

Soroush Oskouei, Marit Valla, Andr'e Pedersen, Erik Smistad, Vibeke Grotnes Dale, Maren H{o}ib{o}, Sissel Gyrid Freim Wahl, Mats Dehli Haugum, Thomas Lang{o}, Maria Paula Ramnefjell, Lars Andreas Akslen, Gabriel Kiss, Hanne Sorger

Considering the increased workload in pathology laboratories today, automated tools such as artificial intelligence models can help pathologists with their tasks and ease the workload. In this paper, we are proposing a segmentation model (DRU-Net) that can provide a delineation of human non-small cell lung carcinomas and an augmentation method that can improve classification results. The proposed model is a fused combination of truncated pre-trained DenseNet201 and ResNet101V2 as a patch-wise classifier followed by a lightweight U-Net as a refinement model. We have used two datasets (Norwegian Lung Cancer Biobank and Haukeland University Hospital lung cancer cohort) to create our proposed model. The DRU-Net model achieves an average of 0.91 Dice similarity coefficient. The proposed spatial augmentation method (multi-lens distortion) improved the network performance by 3%. Our findings show that choosing image patches that specifically include regions of interest leads to better results for the patch-wise classifier compared to other sampling methods. The qualitative analysis showed that the DRU-Net model is generally successful in detecting the tumor. On the test set, some of the cases showed areas of false positive and false negative segmentation in the periphery, particularly in tumors with inflammatory and reactive changes.

6/21/2024