CheXmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images

Read original: arXiv:2307.03293 - Published 5/15/2024 by Nicol'as Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H. Milone, Enzo Ferrante

CheXmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images

Overview

This paper presents CheXmask, a large-scale dataset of anatomical segmentation masks for multi-center chest X-ray images.
The dataset includes segmentation masks for 17 different anatomical structures, including the lungs, heart, and major blood vessels.
The goal of the dataset is to enable more accurate and interpretable machine learning models for medical image analysis, particularly for chest X-ray diagnosis and disease detection.

Plain English Explanation

The researchers have created a new dataset called CheXmask that contains detailed outlines, or "masks," of different anatomical structures in thousands of chest X-ray images. This includes things like the lungs, heart, and major blood vessels.

Having this extra information about the anatomy can help improve the performance of AI models that are trained to analyze chest X-rays. Typically, these models just look at the overall X-ray image, but with CheXmask, they can also focus on specific anatomical regions that may be important for detecting diseases or other medical conditions.

The researchers believe that this dataset will be very useful for developing more accurate and interpretable AI models for medical image analysis. By understanding exactly which parts of the X-ray the model is looking at, doctors and researchers can better understand how the model is making its decisions. This could lead to more reliable and trustworthy AI tools for diagnosing patients.

Technical Explanation

The CheXmask dataset consists of over 32,000 chest X-ray images from multiple medical centers, each with high-quality segmentation masks delineating 17 different anatomical structures. These include the lungs, heart, major blood vessels, ribs, and other key thoracic components.

The segmentation masks were generated through a combination of manual annotations by medical experts and automated deep learning-based segmentation models. The dataset is split into training, validation, and test sets to enable robust model evaluation.

The researchers demonstrate the utility of CheXmask by training state-of-the-art image segmentation models on the dataset and showing significant performance improvements over prior work on chest X-ray segmentation. They also illustrate how the anatomical masks can be used to build more interpretable disease classification models that highlight the relevant anatomical regions.

Critical Analysis

The CheXmask dataset represents an important step forward in enabling more accurate and explainable AI models for chest X-ray analysis. By providing detailed anatomical segmentation, it addresses a key limitation of prior chest X-ray datasets, which typically only provided the raw image data.

That said, the dataset is limited to chest X-rays and does not include other modalities like CT scans. Additionally, the segmentation masks were generated through a combination of manual and automated methods, which could introduce some inconsistencies or errors. Further research is needed to assess the practical utility of the dataset in real-world clinical settings.

It would also be valuable to see the dataset expanded to include a greater diversity of patient demographics and disease states, as the current version may not be fully representative of the broader population. Nonetheless, CheXmask is a significant contribution to the field and should enable substantial progress in developing more reliable and interpretable AI systems for medical image analysis.

Conclusion

The CheXmask dataset provides a large-scale, high-quality resource for training and evaluating machine learning models that analyze chest X-ray images. By including detailed anatomical segmentation masks, it opens up new possibilities for building more accurate and interpretable AI systems for medical diagnosis and disease detection.

This dataset represents an important step forward in the field of medical image analysis, and the researchers have demonstrated its utility through state-of-the-art experiments. While there are some limitations and areas for further research, CheXmask is a valuable addition to the AI toolkit for healthcare, and its impact is likely to be felt across a wide range of clinical applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CheXmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images

Nicol'as Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H. Milone, Enzo Ferrante

The development of successful artificial intelligence models for chest X-ray analysis relies on large, diverse datasets with high-quality annotations. While several databases of chest X-ray images have been released, most include disease diagnosis labels but lack detailed pixel-level anatomical segmentation labels. To address this gap, we introduce an extensive chest X-ray multi-center segmentation dataset with uniform and fine-grain anatomical annotations for images coming from five well-known publicly available databases: ChestX-ray8, Chexpert, MIMIC-CXR-JPG, Padchest, and VinDr-CXR, resulting in 657,566 segmentation masks. Our methodology utilizes the HybridGNet model to ensure consistent and high-quality segmentations across all datasets. Rigorous validation, including expert physician evaluation and automatic quality control, was conducted to validate the resulting masks. Additionally, we provide individualized quality indices per mask and an overall quality estimation per dataset. This dataset serves as a valuable resource for the broader scientific community, streamlining the development and assessment of innovative methodologies in chest X-ray analysis. The CheXmask dataset is publicly available at: https://physionet.org/content/chexmask-cxr-segmentation-data/

5/15/2024

🤿

MS-Twins: Multi-Scale Deep Self-Attention Networks for Medical Image Segmentation

Jing Xu

Although transformer is preferred in natural language processing, some studies has only been applied to the field of medical imaging in recent years. For its long-term dependency, the transformer is expected to contribute to unconventional convolution neural net conquer their inherent spatial induction bias. The lately suggested transformer-based segmentation method only uses the transformer as an auxiliary module to help encode the global context into a convolutional representation. How to optimally integrate self-attention with convolution has not been investigated in depth. To solve the problem, this paper proposes MS-Twins (Multi-Scale Twins), which is a powerful segmentation model on account of the bond of self-attention and convolution. MS-Twins can better capture semantic and fine-grained information by combining different scales and cascading features. Compared with the existing network structure, MS-Twins has made progress on the previous method based on the transformer of two in common use data sets, Synapse and ACDC. In particular, the performance of MS-Twins on Synapse is 8% higher than SwinUNet. Even compared with nnUNet, the best entirely convoluted medical image segmentation network, the performance of MS-Twins on Synapse and ACDC still has a bit advantage.

9/17/2024

🤖

Multi-Dataset Multi-Task Learning for COVID-19 Prognosis

Filippo Ruffini, Lorenzo Tronchin, Zhuoru Wu, Wenting Chen, Paolo Soda, Linlin Shen, Valerio Guarrasi

In the fight against the COVID-19 pandemic, leveraging artificial intelligence to predict disease outcomes from chest radiographic images represents a significant scientific aim. The challenge, however, lies in the scarcity of large, labeled datasets with compatible tasks for training deep learning models without leading to overfitting. Addressing this issue, we introduce a novel multi-dataset multi-task training framework that predicts COVID-19 prognostic outcomes from chest X-rays (CXR) by integrating correlated datasets from disparate sources, distant from conventional multi-task learning approaches, which rely on datasets with multiple and correlated labeling schemes. Our framework hypothesizes that assessing severity scores enhances the model's ability to classify prognostic severity groups, thereby improving its robustness and predictive power. The proposed architecture comprises a deep convolutional network that receives inputs from two publicly available CXR datasets, AIforCOVID for severity prognostic prediction and BRIXIA for severity score assessment, and branches into task-specific fully connected output networks. Moreover, we propose a multi-task loss function, incorporating an indicator function, to exploit multi-dataset integration. The effectiveness and robustness of the proposed approach are demonstrated through significant performance improvements in prognosis classification tasks across 18 different convolutional neural network backbones in different evaluation strategies. This improvement is evident over single-task baselines and standard transfer learning strategies, supported by extensive statistical analysis, showing great application potential.

5/24/2024

Enhancing chest X-ray datasets with privacy-preserving large language models and multi-type annotations: a data-driven approach for improved classification

Ricardo Bigolin Lanfredi, Pritam Mukherjee, Ronald Summers

In chest X-ray (CXR) image analysis, rule-based systems are usually employed to extract labels from reports for dataset releases. However, there is still room for improvement in label quality. These labelers typically output only presence labels, sometimes with binary uncertainty indicators, which limits their usefulness. Supervised deep learning models have also been developed for report labeling but lack adaptability, similar to rule-based systems. In this work, we present MAPLEZ (Medical report Annotations with Privacy-preserving Large language model using Expeditious Zero shot answers), a novel approach leveraging a locally executable Large Language Model (LLM) to extract and enhance findings labels on CXR reports. MAPLEZ extracts not only binary labels indicating the presence or absence of a finding but also the location, severity, and radiologists' uncertainty about the finding. Over eight abnormalities from five test sets, we show that our method can extract these annotations with an increase of 3.6 percentage points (pp) in macro F1 score for categorical presence annotations and more than 20 pp increase in F1 score for the location annotations over competing labelers. Additionally, using the combination of improved annotations and multi-type annotations in classification supervision, we demonstrate substantial advancements in model quality, with an increase of 1.1 pp in AUROC over models trained with annotations from the best alternative approach. We share code and annotations.

8/16/2024