Cervical Auscultation Machine Learning for Dysphagia Assessment

Read original: arXiv:2407.05870 - Published 7/9/2024 by An An Chia, Stacy Lum, Michelle Boo, Rex Tan, Balamurali B T, Jer-Ming Chen

🖼️

Overview

This study explores the use of machine learning, specifically the Random Forest Classifier, to differentiate normal and pathological swallowing sounds.
The researchers recorded swallows from healthy adults and patients with dysphagia (difficulty swallowing) using a commercially available wearable stethoscope.
The analysis revealed statistically significant differences in certain acoustic features, such as spectral crest and zero-crossing rate, between normal and pathological swallows.
The system demonstrated fair sensitivity and specificity in detecting dysphagic (abnormal) swallows, with an overall accuracy of 83% and an F1 score of 78%.

Plain English Explanation

Swallowing is a complex process that involves the coordination of various muscles and nerves. When someone has difficulty swallowing, it's called dysphagia. This can be caused by various medical conditions, such as stroke, Parkinson's disease, or cancer.

In this study, the researchers wanted to see if they could use machine learning to differentiate between normal and abnormal (pathological) swallowing sounds. They used a wearable stethoscope to record swallowing sounds from both healthy adults and people with dysphagia. Then, they analyzed the recordings and found that certain acoustic features, like the "spectral crest" and "zero-crossing rate," were different between normal and abnormal swallows.

The researchers developed a machine learning model, called a Random Forest Classifier, to try to automatically detect abnormal swallowing sounds. The model was able to correctly identify abnormal swallows about 74% of the time (sensitivity) and normal swallows about 89% of the time (specificity). Overall, the model had an accuracy of 83% and an F1 score (a measure of the model's performance) of 78%.

This shows that machine learning can be a useful tool for non-invasive dysphagia assessment, which could help doctors diagnose and monitor swallowing problems more easily. However, the researchers note that there are still some challenges, such as limitations in the sampling rate of the recordings and variability in the model's performance, that need to be addressed with further research.

Technical Explanation

The researchers used a commercially available wearable stethoscope to record swallowing sounds from both healthy adults and patients with dysphagia. They then analyzed the acoustic features of these recordings, such as spectral crest, zero-crossing rate, and other time-domain and frequency-domain characteristics.

The analysis revealed statistically significant differences in several acoustic features between normal and pathological swallows. However, no discriminating differences were found between different fluid and diet consistencies.

The researchers then developed a Random Forest Classifier model to automatically distinguish between normal and dysphagic swallows. The model demonstrated a mean sensitivity of 74% ± 8% and a mean specificity of 89% ± 6% for detecting dysphagic swallows. The overall accuracy of the model was 83% ± 3%, and the F1 score was 78% ± 5%.

These results suggest that machine learning can be a valuable tool for non-invasive dysphagia assessment and multi-task learning of lung sound and lung disease. The researchers note that further research is needed to optimize these techniques for clinical use, addressing challenges such as sampling rate limitations and improving the consistency of the model's performance in differentiating normal and pathological sounds.

Critical Analysis

The study demonstrates the potential of machine learning, specifically the Random Forest Classifier, in non-invasive dysphagia assessment. The researchers were able to identify statistically significant differences in acoustic features between normal and pathological swallowing sounds, which is an important step in developing automated diagnosis tools.

However, the researchers also acknowledge several limitations and challenges that need to be addressed. The sampling rate of the wearable stethoscope used in the study may have been a limiting factor, as it could have missed certain high-frequency components of the swallowing sounds. Additionally, the variability in the model's sensitivity and specificity in discriminating between normal and pathological sounds suggests that further optimization and validation are required.

Another potential issue is the relatively small sample size, which may not have been representative of the full range of normal and pathological swallowing patterns. Expanding the dataset and testing the model on a more diverse population would help to improve its robustness and generalizability.

The researchers also note the need for further research to explore the relationship between specific acoustic features and the underlying physiological mechanisms of normal and pathological swallowing. This could lead to a better understanding of the causes of dysphagia and inform the development of more targeted interventions.

Overall, the study provides promising initial results, but more work is needed to fully realize the potential of machine learning in objective and interpretable speech disorder assessment and comprehensive analysis of heart sound using deep learning for clinical applications.

Conclusion

This study demonstrates the potential of using machine learning, specifically the Random Forest Classifier, to differentiate normal and pathological swallowing sounds. The researchers were able to identify statistically significant differences in acoustic features between the two groups and develop a model that achieved fair sensitivity, specificity, and overall accuracy in detecting dysphagic swallows.

While the results are promising, the researchers acknowledge several challenges that need to be addressed, such as limitations in the sampling rate of the wearable stethoscope and the variability in the model's performance. Further research is needed to optimize these techniques and improve their reliability and consistency for clinical use.

Overall, this study underscores the value of applying machine learning to non-invasive dysphagia assessment and highlights the need for continued innovation and collaboration between researchers, clinicians, and engineers to address this important healthcare challenge.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🖼️

Cervical Auscultation Machine Learning for Dysphagia Assessment

An An Chia, Stacy Lum, Michelle Boo, Rex Tan, Balamurali B T, Jer-Ming Chen

This study evaluates the use of machine learning, specifically the Random Forest Classifier, to differentiate normal and pathological swallowing sounds. Employing a commercially available wearable stethoscope, we recorded swallows from both healthy adults and patients with dysphagia. The analysis revealed statistically significant differences in acoustic features, such as spectral crest, and zero-crossing rate between normal and pathological swallows, while no discriminating differences were demonstrated between different fluidand diet consistencies. The system demonstrated fair sensitivity (mean plus or minus SD: 74% plus or minus 8%) and specificity (89% plus or minus 6%) for dysphagic swallows. The model attained an overall accuracy of 83% plus or minus 3%, and F1 score of 78% plus or minus 5%. These results demonstrate that machine learning can be a valuable tool in non-invasive dysphagia assessment, although challenges such as sampling rate limitations and variability in sensitivity and specificity in discriminating between normal and pathological sounds are noted. The study underscores the need for further research to optimize these techniques for clinical use.

7/9/2024

↗️

Machine learning-based algorithms for at-home respiratory disease monitoring and respiratory assessment

Negar Orangi-Fard, Alexandru Bogdan, Hersh Sagreiya

Respiratory diseases impose a significant burden on global health, with current diagnostic and management practices primarily reliant on specialist clinical testing. This work aims to develop machine learning-based algorithms to facilitate at-home respiratory disease monitoring and assessment for patients undergoing continuous positive airway pressure (CPAP) therapy. Data were collected from 30 healthy adults, encompassing respiratory pressure, flow, and dynamic thoraco-abdominal circumferential measurements under three breathing conditions: normal, panting, and deep breathing. Various machine learning models, including the random forest classifier, logistic regression, and support vector machine (SVM), were trained to predict breathing types. The random forest classifier demonstrated the highest accuracy, particularly when incorporating breathing rate as a feature. These findings support the potential of AI-driven respiratory monitoring systems to transition respiratory assessments from clinical settings to home environments, enhancing accessibility and patient autonomy. Future work involves validating these models with larger, more diverse populations and exploring additional machine learning techniques.

9/6/2024

🔎

COVID-19 Detection System: A Comparative Analysis of System Performance Based on Acoustic Features of Cough Audio Signals

Asmaa Shati, Ghulam Mubashar Hassan, Amitava Datta

A wide range of respiratory diseases, such as cold and flu, asthma, and COVID-19, affect people's daily lives worldwide. In medical practice, respiratory sounds are widely used in medical services to diagnose various respiratory illnesses and lung disorders. The traditional diagnosis of such sounds requires specialized knowledge, which can be costly and reliant on human expertise. Despite this, recent advancements, such as cough audio recordings, have emerged as a means to automate the detection of respiratory conditions. Therefore, this research aims to explore various acoustic features that enhance the performance of machine learning (ML) models in detecting COVID-19 from cough signals. It investigates the efficacy of three feature extraction techniques, including Mel Frequency Cepstral Coefficients (MFCC), Chroma, and Spectral Contrast features, when applied to two machine learning algorithms, Support Vector Machine (SVM) and Multilayer Perceptron (MLP), and therefore proposes an efficient CovCepNet detection system. The proposed system provides a practical solution and demonstrates state-of-the-art classification performance, with an AUC of 0.843 on the COUGHVID dataset and 0.953 on the Virufy dataset for COVID-19 detection from cough audio signals.

6/21/2024

🗣️

Detecting Throat Cancer from Speech Signals using Machine Learning: A Scoping Literature Review

Mary Paterson, James Moor, Luisa Cutillo

Introduction: Cases of throat cancer are rising worldwide. With survival decreasing significantly at later stages, early detection is vital. Artificial intelligence (AI) and machine learning (ML) have the potential to detect throat cancer from patient speech, facilitating earlier diagnosis and reducing the burden on overstretched healthcare systems. However, no comprehensive review has explored the use of AI and ML for detecting throat cancer from speech. This review aims to fill this gap by evaluating how these technologies perform and identifying issues that need to be addressed in future research. Materials and Methods: We conducted a scoping literature review across three databases: Scopus,Web of Science, and PubMed. We included articles that classified speech using machine learning and specified the inclusion of throat cancer patients in their data. Articles were categorized based on whether they performed binary or multi-class classification. Results: We found 27 articles fitting our inclusion criteria, 12 performing binary classification, 13 performing multi-class classification, and two that do both binary and multiclass classification. The most common classification method used was neural networks, and the most frequently extracted feature was mel-spectrograms. We also documented pre-processing methods and classifier performance. We compared each article against the TRIPOD-AI checklist, which showed a significant lack of open science, with only one article sharing code and only three using open-access data. Conclusion: Open-source code is essential for external validation and further development in this field. Our review indicates that no single method or specific feature consistently outperforms others in detecting throat cancer from speech. Future research should focus on standardizing methodologies and improving the reproducibility of results.

7/25/2024