Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

2310.16112

Published 4/3/2024 by Gregory Holste, Yiliang Zhou, Song Wang, Ajay Jaiswal, Mingquan Lin, Sherry Zhuge, Yuzhe Yang, Dongkyun Kim, Trong-Hieu Nguyen-Mau, Minh-Triet Tran and 15 others

cs.CV

🏷️

Abstract

Many real-world image recognition problems, such as diagnostic medical imaging exams, are long-tailed $unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.

Create account to get full access

Overview

Real-world image recognition problems, such as medical diagnostics, often have a "long-tailed" distribution - a few common findings and many rare conditions.
Chest X-ray diagnosis is a long-tailed and multi-label problem, as patients can present with multiple findings simultaneously.
Researchers have begun studying long-tailed learning in medical image recognition, but few have examined the interaction of label imbalance and co-occurrence in long-tailed, multi-label disease classification.

Plain English Explanation

Imagine you're a doctor looking at chest X-rays to diagnose patients. In the real world, you'd see a lot of common conditions, like pneumonia or a broken rib, but also many rare diseases that only show up occasionally. This "long-tailed" distribution makes it challenging to build AI systems that can accurately recognize all the different possibilities.

It gets even more complicated when patients have multiple issues at the same time. For example, someone might have both pneumonia and a collapsed lung. Recognizing these co-occurring problems is crucial for making the right diagnosis, but it adds an extra layer of complexity.

Researchers are starting to explore how to train AI models to handle this kind of long-tailed, multi-label medical imaging data. But there's still a lot to learn about the best ways to approach the problem.

Technical Explanation

The researchers conducted an open challenge, called CXR-LT, to study long-tailed, multi-label thorax disease classification from chest X-rays. They assembled a large dataset of over 350,000 chest X-rays, each labeled with at least one of 26 different clinical findings following a long-tailed distribution.

The top-performing solutions in the challenge provided insights into practical techniques for this type of medical image classification task. The researchers synthesized common themes from the successful approaches.

Finally, the researchers propose using vision-language foundation models as a promising path forward for few-shot and zero-shot disease classification, where the model can learn to recognize rare conditions from limited examples or even without any training data.

Critical Analysis

The paper provides a valuable contribution by highlighting the important, but understudied, challenge of long-tailed, multi-label disease classification in medical imaging. The open dataset and challenge are useful resources for the research community.

However, the paper does not delve deeply into the specific techniques or architectures used by the top-performing teams. More detailed technical insights would be helpful for other researchers looking to build on this work.

Additionally, the paper focuses on the computer vision aspects of the problem, but medical diagnosis involves integrating multiple sources of information beyond just the images. Incorporating other clinical data, such as patient history and symptoms, could further improve the accuracy and real-world applicability of these AI systems.

Conclusion

This research highlights the important, but often overlooked, challenge of long-tailed, multi-label disease classification in medical imaging. By releasing a large-scale benchmark dataset and synthesizing insights from a open challenge, the researchers have advanced the state of the art in this critical area. Their proposal of using vision-language foundation models is a promising direction that could lead to more robust and flexible AI systems for medical diagnosis.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔄

Expanding the Horizon: Enabling Hybrid Quantum Transfer Learning for Long-Tailed Chest X-Ray Classification

Skylar Chan, Pranav Kulkarni, Paul H. Yi, Vishwa S. Parekh

Quantum machine learning (QML) has the potential for improving the multi-label classification of rare, albeit critical, diseases in large-scale chest x-ray (CXR) datasets due to theoretical quantum advantages over classical machine learning (CML) in sample efficiency and generalizability. While prior literature has explored QML with CXRs, it has focused on binary classification tasks with small datasets due to limited access to quantum hardware and computationally expensive simulations. To that end, we implemented a Jax-based framework that enables the simulation of medium-sized qubit architectures with significant improvements in wall-clock time over current software offerings. We evaluated the performance of our Jax-based framework in terms of efficiency and performance for hybrid quantum transfer learning for long-tailed classification across 8, 14, and 19 disease labels using large-scale CXR datasets. The Jax-based framework resulted in up to a 58% and 95% speed-up compared to PyTorch and TensorFlow implementations, respectively. However, compared to CML, QML demonstrated slower convergence and an average AUROC of 0.70, 0.73, and 0.74 for the classification of 8, 14, and 19 CXR disease labels. In comparison, the CML models had an average AUROC of 0.77, 0.78, and 0.80 respectively. In conclusion, our work presents an accessible implementation of hybrid quantum transfer learning for long-tailed CXR classification with a computationally efficient Jax-based framework.

5/2/2024

cs.CV cs.AI cs.LG

Large-scale Long-tailed Disease Diagnosis on Radiology Images

Qiaoyu Zheng, Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Lisong Dai, Hengyu Guan, Yuehua Li, Ya Zhang, Yanfeng Wang, Weidi Xie

Developing a generalist radiology diagnosis system can greatly enhance clinical diagnostics. In this paper, we introduce RadDiag, a foundational model supporting 2D and 3D inputs across various modalities and anatomies, using a transformer-based fusion module for comprehensive disease diagnosis. Due to patient privacy concerns and the lack of large-scale radiology diagnosis datasets, we utilize high-quality, clinician-reviewed radiological images available online with diagnosis labels. Our dataset, RP3D-DiagDS, contains 40,936 cases with 195,010 scans covering 5,568 disorders (930 unique ICD-10-CM codes). Experimentally, our RadDiag achieves 95.14% AUC on internal evaluation with the knowledge-enhancement strategy. Additionally, RadDiag can be zero-shot applied or fine-tuned to external diagnosis datasets sourced from various hospitals, demonstrating state-of-the-art results. In conclusion, we show that publicly shared medical data on the Internet is a tremendous and valuable resource that can potentially support building a generalist AI for healthcare.

6/18/2024

cs.CV

🤖

Multi-Dataset Multi-Task Learning for COVID-19 Prognosis

Filippo Ruffini, Lorenzo Tronchin, Zhuoru Wu, Wenting Chen, Paolo Soda, Linlin Shen, Valerio Guarrasi

In the fight against the COVID-19 pandemic, leveraging artificial intelligence to predict disease outcomes from chest radiographic images represents a significant scientific aim. The challenge, however, lies in the scarcity of large, labeled datasets with compatible tasks for training deep learning models without leading to overfitting. Addressing this issue, we introduce a novel multi-dataset multi-task training framework that predicts COVID-19 prognostic outcomes from chest X-rays (CXR) by integrating correlated datasets from disparate sources, distant from conventional multi-task learning approaches, which rely on datasets with multiple and correlated labeling schemes. Our framework hypothesizes that assessing severity scores enhances the model's ability to classify prognostic severity groups, thereby improving its robustness and predictive power. The proposed architecture comprises a deep convolutional network that receives inputs from two publicly available CXR datasets, AIforCOVID for severity prognostic prediction and BRIXIA for severity score assessment, and branches into task-specific fully connected output networks. Moreover, we propose a multi-task loss function, incorporating an indicator function, to exploit multi-dataset integration. The effectiveness and robustness of the proposed approach are demonstrated through significant performance improvements in prognosis classification tasks across 18 different convolutional neural network backbones in different evaluation strategies. This improvement is evident over single-task baselines and standard transfer learning strategies, supported by extensive statistical analysis, showing great application potential.

5/24/2024

eess.IV cs.CV cs.LG

🏷️

Low-Resolution Chest X-ray Classification via Knowledge Distillation and Multi-task Learning

Yasmeena Akhter, Rishabh Ranjan, Richa Singh, Mayank Vatsa

This research addresses the challenges of diagnosing chest X-rays (CXRs) at low resolutions, a common limitation in resource-constrained healthcare settings. High-resolution CXR imaging is crucial for identifying small but critical anomalies, such as nodules or opacities. However, when images are downsized for processing in Computer-Aided Diagnosis (CAD) systems, vital spatial details and receptive fields are lost, hampering diagnosis accuracy. To address this, this paper presents the Multilevel Collaborative Attention Knowledge (MLCAK) method. This approach leverages the self-attention mechanism of Vision Transformers (ViT) to transfer critical diagnostic knowledge from high-resolution images to enhance the diagnostic efficacy of low-resolution CXRs. MLCAK incorporates local pathological findings to boost model explainability, enabling more accurate global predictions in a multi-task framework tailored for low-resolution CXR analysis. Our research, utilizing the Vindr CXR dataset, shows a considerable enhancement in the ability to diagnose diseases from low-resolution images (e.g. 28 x 28), suggesting a critical transition from the traditional reliance on high-resolution imaging (e.g. 224 x 224).

5/24/2024

eess.IV cs.CV cs.LG