Automatic Voice Classification Of Autistic Subjects

Read original: arXiv:2406.13470 - Published 6/21/2024 by Jessica Vacca, Natascia Brondino, Fabio Dell'Acqua, Anna Vizziello, Pietro Savazzi

Automatic Voice Classification Of Autistic Subjects

Overview

This paper explores the use of automatic voice classification to identify autism spectrum disorder (ASD) in subjects.
The research was funded by the NODES project and the Italian national project VOCE, which aims to improve communication for individuals with disabilities.
The work focuses on developing machine learning models to analyze speech patterns and classify whether a subject has ASD.

Plain English Explanation

This research paper looks at using computers to analyze people's voices and determine if they have autism. The researchers want to see if they can build machine learning models that can accurately identify autism just by listening to someone speak.

The project is part of larger efforts to help people with disabilities communicate better. The researchers think that by developing technology that can automatically detect signs of autism in speech, it could lead to earlier diagnosis and better support for individuals on the autism spectrum.

Rather than relying on doctors or therapists to make these assessments, the goal is to create a system that can analyze voice recordings and flag potential cases of autism. This could make the diagnostic process more efficient and accessible, especially in areas with limited access to specialized healthcare.

Technical Explanation

The researchers used machine learning techniques to build models that can classify whether a person has autism spectrum disorder (ASD) based on their voice recordings. They collected speech data from both autistic and neurotypical subjects, and then trained various classification algorithms to identify patterns in the speech signals that distinguish the two groups.

The team experimented with different feature extraction methods to represent the voice data, including mel-frequency cepstral coefficients (MFCCs) and long short-term memory (LSTM) networks. They also tested out several classifier architectures, such as support vector machines (SVMs) and convolutional neural networks (CNNs).

Through their experiments, the researchers found that the machine learning models were able to achieve promising classification accuracy, significantly outperforming random chance. This suggests that there are indeed identifiable differences in the speech patterns of autistic and neurotypical individuals that can be leveraged for automated detection.

The findings of this work contribute to a growing body of research on using speech recognition and machine learning for autism detection and monitoring. By exploring speech pattern disorders in autism, this paper provides further evidence that vocal characteristics may serve as a valuable diagnostic marker.

Critical Analysis

The paper provides a solid technical foundation for the proposed approach, with detailed explanations of the data collection, feature engineering, and model training processes. However, the researchers acknowledge several limitations and areas for further investigation.

For one, the dataset used in the study was relatively small, comprising speech samples from only 20 autistic and 20 neurotypical subjects. While the models demonstrated promising classification performance, the results may not generalize well to larger, more diverse populations. Expanding the dataset and validating the approach on independent samples would be an important next step.

Additionally, the paper does not provide much insight into the specific vocal characteristics that differentiate autistic and neurotypical speech. Understanding the underlying acoustic and linguistic features that drive the classification would be valuable for gaining deeper insights into the connection between autism and speech patterns.

It would also be interesting to see how the automated classification approach compares to clinical assessments performed by human experts. Evaluating the system's accuracy and reliability relative to standard diagnostic procedures could help determine its practical utility in real-world settings.

Overall, this work represents a promising step towards leveraging speech analysis and machine learning for early autism detection and monitoring. However, further research is needed to refine the models, expand the data, and validate the approach in more diverse and realistic scenarios.

Conclusion

This paper explores the use of automatic voice classification to identify autism spectrum disorder (ASD) in subjects. The researchers developed machine learning models that can analyze speech patterns and distinguish between autistic and neurotypical individuals with reasonable accuracy.

The findings contribute to a growing body of research on using speech recognition and machine learning for autism detection and monitoring. By exploring the connection between vocal characteristics and autism, this work suggests that automated speech analysis could serve as a valuable diagnostic tool, potentially enabling earlier identification and better support for individuals on the autism spectrum.

While the results are promising, the researchers acknowledge several limitations and areas for further investigation. Expanding the dataset, understanding the specific speech features underlying the classification, and validating the approach in real-world settings will be important next steps to realize the full potential of this technology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Automatic Voice Classification Of Autistic Subjects

Jessica Vacca, Natascia Brondino, Fabio Dell'Acqua, Anna Vizziello, Pietro Savazzi

Autism Spectrum Disorders (ASD) describe a heterogeneous set of conditions classified as neurodevelopmental disorders. Although the mechanisms underlying ASD are not yet fully understood, more recent literature focused on multiple genetics and/or environmental risk factors. Heterogeneity of symptoms, especially in milder forms of this condition, could be a challenge for the clinician. In this work, an automatic speech classification algorithm is proposed to characterize the prosodic elements that best distinguish autism, to support the traditional diagnosis. The performance of the proposed algorithm is evaluted by testing the classification algorithms on a dataset composed of recorded speeches, collected among both autustic and non autistic subjects.

6/21/2024

🗣️

Exploring Speech Pattern Disorders in Autism using Machine Learning

Chuanbo Hu, Jacob Thrasher, Wenqi Li, Mindi Ruan, Xiangxu Yu, Lynn K Paul, Shuo Wang, Xin Li

Diagnosing autism spectrum disorder (ASD) by identifying abnormal speech patterns from examiner-patient dialogues presents significant challenges due to the subtle and diverse manifestations of speech-related symptoms in affected individuals. This study presents a comprehensive approach to identify distinctive speech patterns through the analysis of examiner-patient dialogues. Utilizing a dataset of recorded dialogues, we extracted 40 speech-related features, categorized into frequency, zero-crossing rate, energy, spectral characteristics, Mel Frequency Cepstral Coefficients (MFCCs), and balance. These features encompass various aspects of speech such as intonation, volume, rhythm, and speech rate, reflecting the complex nature of communicative behaviors in ASD. We employed machine learning for both classification and regression tasks to analyze these speech features. The classification model aimed to differentiate between ASD and non-ASD cases, achieving an accuracy of 87.75%. Regression models were developed to predict speech pattern related variables and a composite score from all variables, facilitating a deeper understanding of the speech dynamics associated with ASD. The effectiveness of machine learning in interpreting intricate speech patterns and the high classification accuracy underscore the potential of computational methods in supporting the diagnostic processes for ASD. This approach not only aids in early detection but also contributes to personalized treatment planning by providing insights into the speech and communication profiles of individuals with ASD.

5/9/2024

Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder

Jihyun Mun, Sunhee Kim, Minhwa Chung

Autism Spectrum Disorder (ASD) is a lifelong condition that significantly influencing an individual's communication abilities and their social interactions. Early diagnosis and intervention are critical due to the profound impact of ASD's characteristic behaviors on foundational developmental stages. However, limitations of standardized diagnostic tools necessitate the development of objective and precise diagnostic methodologies. This paper proposes an end-to-end framework for automatically predicting the social communication severity of children with ASD from raw speech data. This framework incorporates an automatic speech recognition model, fine-tuned with speech data from children with ASD, followed by the application of fine-tuned pre-trained language models to generate a final prediction score. Achieving a Pearson Correlation Coefficient of 0.6566 with human-rated scores, the proposed method showcases its potential as an accessible and objective tool for the assessment of ASD.

9/4/2024

$Ensemble Modeling of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder$

Ensemble Modeling of Multiple Physical Indicators to Dynamically Phenotype Autism Spectrum Disorder

Marie Huynh (Stanford University), Aaron Kline (Stanford University), Saimourya Surabhi (Stanford University), Kaitlyn Dunlap (Stanford University), Onur Cezmi Mutlu (Stanford University), Mohammadmahdi Honarmand (Stanford University), Parnian Azizian (Stanford University), Peter Washington (University of Hawaii at Manoa), Dennis P. Wall (Stanford University)

Early detection of autism, a neurodevelopmental disorder marked by social communication challenges, is crucial for timely intervention. Recent advancements have utilized naturalistic home videos captured via the mobile application GuessWhat. Through interactive games played between children and their guardians, GuessWhat has amassed over 3,000 structured videos from 382 children, both diagnosed with and without Autism Spectrum Disorder (ASD). This collection provides a robust dataset for training computer vision models to detect ASD-related phenotypic markers, including variations in emotional expression, eye contact, and head movements. We have developed a protocol to curate high-quality videos from this dataset, forming a comprehensive training set. Utilizing this set, we trained individual LSTM-based models using eye gaze, head positions, and facial landmarks as input features, achieving test AUCs of 86%, 67%, and 78%, respectively. To boost diagnostic accuracy, we applied late fusion techniques to create ensemble models, improving the overall AUC to 90%. This approach also yielded more equitable results across different genders and age groups. Our methodology offers a significant step forward in the early detection of ASD by potentially reducing the reliance on subjective assessments and making early identification more accessibly and equitable.

8/26/2024