Towards Enhanced Classification of Abnormal Lung sound in Multi-breath: A Light Weight Multi-label and Multi-head Attention Classification Method

Read original: arXiv:2407.10828 - Published 7/16/2024 by Yi-Wei Chua, Yun-Chien Cheng

🏷️

Overview

This study aims to develop an auxiliary diagnostic system for classifying abnormal lung respiratory sounds.
It uses a multi-label learning approach and multi-head attention mechanism to enhance the accuracy of automatic abnormal breath sound classification.
The study addresses the issue of class imbalance and lack of diversity in existing respiratory sound datasets by employing a lightweight and highly accurate model.
The method uses a two-dimensional label set to represent multiple respiratory sound characteristics and achieved a 59.2% ICBHI score on the ICBHI2017 dataset.

Plain English Explanation

This research focuses on creating a system to help identify abnormal lung sounds, which can be an important diagnostic tool for respiratory conditions. The researchers used a multi-task learning approach and a multi-head attention mechanism to improve the accuracy of automatically detecting abnormal breath sounds.

One of the challenges with existing respiratory sound datasets is that they can be imbalanced (with some types of sounds under-represented) and lack diversity. To address this, the researchers developed a lightweight but highly accurate model that uses a two-dimensional label system to capture multiple characteristics of the respiratory sounds.

The model was able to achieve a 59.2% score on the ICBHI2017 dataset, demonstrating its effectiveness at identifying abnormal lung sounds. This could be a valuable tool for clinicians, as improving the accuracy of automatic diagnosis of respiratory issues can lead to better patient care and outcomes.

Technical Explanation

The researchers employed a multi-label learning approach and a multi-head attention mechanism to enhance the accuracy of automatic abnormal breath sound classification. This innovative technique allows the model to simultaneously recognize multiple respiratory sound characteristics, addressing the issue of class imbalance and lack of diversity often seen in existing respiratory sound datasets.

The model uses a lightweight and highly accurate architecture, leveraging a two-dimensional label set to represent the various respiratory sound attributes. This approach was evaluated on the ICBHI2017 dataset, where the method achieved a 59.2% ICBHI score in the four-category task.

Critical Analysis

The paper acknowledges the limitations of existing respiratory sound datasets, which can be imbalanced and lack diversity. The researchers' use of a multi-label learning approach and multi-head attention mechanism appears to be a promising solution to address these challenges, leading to improved accuracy in abnormal lung sound classification.

However, the paper does not provide a detailed analysis of the model's performance on specific respiratory sound classes or discuss potential biases in the ICBHI2017 dataset. Additionally, the researchers could have explored the model's generalizability by evaluating its performance on other respiratory sound datasets.

Further research is needed to understand the model's robustness and identify any potential limitations or edge cases. Expanding the evaluation to include clinical trials or real-world deployment scenarios would also help validate the system's practical utility.

Conclusion

This study presents an innovative approach to improving the accuracy of automatic abnormal lung sound classification, using a multi-label learning technique and multi-head attention mechanism. By addressing the issues of class imbalance and lack of diversity in existing respiratory sound datasets, the researchers have developed a lightweight and highly accurate model that could have significant implications for clinical applications.

The findings of this study contribute to the growing body of research on respiratory sound analysis and multi-modal learning, paving the way for more accurate and efficient diagnostic tools for respiratory conditions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏷️

Towards Enhanced Classification of Abnormal Lung sound in Multi-breath: A Light Weight Multi-label and Multi-head Attention Classification Method

Yi-Wei Chua, Yun-Chien Cheng

This study aims to develop an auxiliary diagnostic system for classifying abnormal lung respiratory sounds, enhancing the accuracy of automatic abnormal breath sound classification through an innovative multi-label learning approach and multi-head attention mechanism. Addressing the issue of class imbalance and lack of diversity in existing respiratory sound datasets, our study employs a lightweight and highly accurate model, using a two-dimensional label set to represent multiple respiratory sound characteristics. Our method achieved a 59.2% ICBHI score in the four-category task on the ICBHI2017 dataset, demonstrating its advantages in terms of lightweight and high accuracy. This study not only improves the accuracy of automatic diagnosis of lung respiratory sound abnormalities but also opens new possibilities for clinical applications.

7/16/2024

Multi-Task Learning for Lung sound & Lung disease classification

Suma K V, Deepali Koppad, Preethi Kumar, Neha A Kantikar, Surabhi Ramesh

In recent years, advancements in deep learning techniques have considerably enhanced the efficiency and accuracy of medical diagnostics. In this work, a novel approach using multi-task learning (MTL) for the simultaneous classification of lung sounds and lung diseases is proposed. Our proposed model leverages MTL with four different deep learning models such as 2D CNN, ResNet50, MobileNet and Densenet to extract relevant features from the lung sound recordings. The ICBHI 2017 Respiratory Sound Database was employed in the current study. The MTL for MobileNet model performed better than the other models considered, with an accuracy of74% for lung sound analysis and 91% for lung diseases classification. Results of the experimentation demonstrate the efficacy of our approach in classifying both lung sounds and lung diseases concurrently. In this study,using the demographic data of the patients from the database, risk level computation for Chronic Obstructive Pulmonary Disease is also carried out. For this computation, three machine learning algorithms namely Logistic Regression, SVM and Random Forest classifierswere employed. Among these ML algorithms, the Random Forest classifier had the highest accuracy of 92%.This work helps in considerably reducing the physician's burden of not just diagnosing the pathology but also effectively communicating to the patient about the possible causes or outcomes.

4/8/2024

Improving Robustness and Clinical Applicability of Respiratory Sound Classification via Audio Enhancement

Jing-Tong Tzeng, Jeng-Lin Li, Huan-Yu Chen, Chun-Hsiang Huang, Chi-Hsin Chen, Cheng-Yi Fan, Edward Pei-Chuan Huang, Chi-Chun Lee

Deep learning techniques have shown promising results in the automatic classification of respiratory sounds. However, accurately distinguishing these sounds in real-world noisy conditions poses challenges for clinical deployment. Additionally, predicting signals with only background noise could undermine user trust in the system. In this study, we propose an audio enhancement (AE) pipeline as a pre-processing step before respiratory sound classification, aiming to improve performance in noisy environments. Multiple experiments were conducted using different audio enhancement model structures, demonstrating improved classification performance compared to the baseline method of noise injection data augmentation. Specifically, the integration of the AE pipeline resulted in a 2.59% increase in the ICBHI classification score on the ICBHI respiratory sound dataset and a 2.51% improvement on our recently collected Formosa Archive of Breath Sounds (FABS) in multi-class noisy scenarios. Furthermore, a physician validation study assessed the clinical utility of our system. Quantitative analysis revealed enhancements in efficiency, diagnostic confidence, and trust during model-assisted diagnosis with our system compared to raw noisy recordings. Workflows integrating enhanced audio led to an 11.61% increase in diagnostic sensitivity and facilitated high-confidence diagnoses. Our findings demonstrate that incorporating an audio enhancement algorithm significantly enhances robustness and clinical utility.

7/22/2024

👀

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer

Whenty Ariyanti, Kai-Chun Liu, Kuan-Yu Chen, Yu Tsao

Respiratory disease, the third leading cause of deaths globally, is considered a high-priority ailment requiring significant research on identification and treatment. Stethoscope-recorded lung sounds and artificial intelligence-powered devices have been used to identify lung disorders and aid specialists in making accurate diagnoses. In this study, audio-spectrogram vision transformer (AS-ViT), a new approach for identifying abnormal respiration sounds, was developed. The sounds of the lungs are converted into visual representations called spectrograms using a technique called short-time Fourier transform (STFT). These images are then analyzed using a model called vision transformer to identify different types of respiratory sounds. The classification was carried out using the ICBHI 2017 database, which includes various types of lung sounds with different frequencies, noise levels, and backgrounds. The proposed AS-ViT method was evaluated using three metrics and achieved 79.1% and 59.8% for 60:40 split ratio and 86.4% and 69.3% for 80:20 split ratio in terms of unweighted average recall and overall scores respectively for respiratory sound detection, surpassing previous state-of-the-art results.

5/15/2024