Stream-based Active Learning for Anomalous Sound Detection in Machine Condition Monitoring

Read original: arXiv:2408.05493 - Published 8/13/2024 by Tuan Vu Ho, Kota Dohi, Yohei Kawaguchi

Stream-based Active Learning for Anomalous Sound Detection in Machine Condition Monitoring

Overview

This paper explores the use of stream-based active learning for anomalous sound detection in machine condition monitoring.
Active learning is a technique where the learning algorithm selects the most informative data samples to label, rather than randomly sampling.
The researchers applied this approach to the problem of anomaly detection in machine sounds, which is important for predictive maintenance.

Plain English Explanation

The paper looks at a way to improve anomalous sound detection in machine monitoring. Usually, machine learning models are trained on a set of labeled data, but getting all that data labeled can be time-consuming and expensive.

Active learning is a technique where the model itself selects the most useful data points to get labeled, rather than randomly sampling. This can make the training process more efficient. The researchers in this paper applied active learning to the problem of detecting abnormal sounds coming from machines, which is important for predictive maintenance.

By having the model choose the most informative sound samples to get labeled, it can learn to identify anomalies faster and with less overall labeling effort. This could lead to better maintenance practices and reduced downtime for industrial machinery.

Technical Explanation

The researchers proposed a stream-based active learning approach for anomalous sound detection. In this setting, the model is presented with a continuous stream of unlabeled sound data, and must selectively request labels for the most informative samples.

The key components of their approach include:

Active Sample Selection: The model uses an uncertainty-based sampling strategy to identify the most informative unlabeled sound samples to request labels for from a human expert.
Online Model Update: After receiving labels for the selected samples, the model is updated incrementally rather than retraining from scratch.
Anomaly Score Estimation: The model outputs an anomaly score for each input sound, indicating how likely it is to be abnormal.

The researchers evaluated their approach on a real-world dataset of machine sounds, and showed that it could achieve strong anomaly detection performance with significantly less labeling effort compared to passive learning.

Critical Analysis

The paper provides a well-designed active learning framework for anomalous sound detection, with several novel components like the online model update. However, a few potential limitations and areas for future work are worth noting:

The active learning strategy is based on model uncertainty, which may not always align with the most informative samples for anomaly detection. Other sampling criteria could be explored.
The experiments were conducted on a single dataset, so the generalizability of the approach to other machine monitoring scenarios is unclear. Further testing on a wider range of datasets would strengthen the conclusions.
The paper does not address the challenge of class imbalance, where anomalous samples are much rarer than normal ones. Handling this effectively is crucial for real-world anomaly detection.

Overall, this is a promising step towards more efficient and robust anomaly detection systems for industrial applications.

Conclusion

This paper presents a stream-based active learning approach for anomalous sound detection in machine condition monitoring. By having the model selectively request labels for the most informative sound samples, the researchers showed it could achieve strong anomaly detection performance with significantly less labeling effort compared to traditional passive learning.

The technical innovations, like online model updating, and the potential real-world impact of more efficient predictive maintenance make this a valuable contribution to the field of industrial machine monitoring. While further research is needed to address certain limitations, this work demonstrates the power of active learning techniques for solving challenging anomaly detection problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Stream-based Active Learning for Anomalous Sound Detection in Machine Condition Monitoring

Tuan Vu Ho, Kota Dohi, Yohei Kawaguchi

This paper introduces an active learning (AL) framework for anomalous sound detection (ASD) in machine condition monitoring system. Typically, ASD models are trained solely on normal samples due to the scarcity of anomalous data, leading to decreased accuracy for unseen samples during inference. AL is a promising solution to solve this problem by enabling the model to learn new concepts more effectively with fewer labeled examples, thus reducing manual annotation efforts. However, its effectiveness in ASD remains unexplored. To minimize update costs and time, our proposed method focuses on updating the scoring backend of ASD system without retraining the neural network model. Experimental results on the DCASE 2023 Challenge Task 2 dataset confirm that our AL framework significantly improves ASD performance even with low labeling budgets. Moreover, our proposed sampling strategy outperforms other baselines in terms of the partial area under the receiver operating characteristic score.

8/13/2024

🤿

Deep Active Audio Feature Learning in Resource-Constrained Environments

Md Mohaimenuzzaman, Christoph Bergmeir, Bernd Meyer

The scarcity of labelled data makes training Deep Neural Network (DNN) models in bioacoustic applications challenging. In typical bioacoustics applications, manually labelling the required amount of data can be prohibitively expensive. To effectively identify both new and current classes, DNN models must continue to learn new features from a modest amount of fresh data. Active Learning (AL) is an approach that can help with this learning while requiring little labelling effort. Nevertheless, the use of fixed feature extraction approaches limits feature quality, resulting in underutilization of the benefits of AL. We describe an AL framework that addresses this issue by incorporating feature extraction into the AL loop and refining the feature extractor after each round of manual annotation. In addition, we use raw audio processing rather than spectrograms, which is a novel approach. Experiments reveal that the proposed AL framework requires 14.3%, 66.7%, and 47.4% less labelling effort on benchmark audio datasets ESC-50, UrbanSound8k, and InsectWingBeat, respectively, for a large DNN model and similar savings on a microcontroller-based counterpart. Furthermore, we showcase the practical relevance of our study by incorporating data from conservation biology projects. All codes are publicly available on GitHub.

7/2/2024

🤷

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo, Yohei Kawaguchi

We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2024 Challenge Task 2: First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring. Continuing from last year's DCASE 2023 Challenge Task 2, we organize the task as a first-shot problem under domain generalization required settings. The main goal of the first-shot problem is to enable rapid deployment of ASD systems for new kinds of machines without the need for machine-specific hyperparameter tunings. This problem setting was realized by (1) giving only one section for each machine type and (2) having completely different machine types for the development and evaluation datasets. For the DCASE 2024 Challenge Task 2, data of completely new machine types were newly collected and provided as the evaluation dataset. In addition, attribute information such as the machine operation conditions were concealed for several machine types to mimic situations where such information are unavailable. We will add challenge results and analysis of the submissions after the challenge submission deadline.

6/12/2024

Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models

Xinhu Zheng, Anbai Jiang, Bing Han, Yanmin Qian, Pingyi Fan, Jia Liu, Wei-Qiang Zhang

Anomalous Sound Detection (ASD) has gained significant interest through the application of various Artificial Intelligence (AI) technologies in industrial settings. Though possessing great potential, ASD systems can hardly be readily deployed in real production sites due to the generalization problem, which is primarily caused by the difficulty of data collection and the complexity of environmental factors. This paper introduces a robust ASD model that leverages audio pre-trained models. Specifically, we fine-tune these models using machine operation data, employing SpecAug as a data augmentation strategy. Additionally, we investigate the impact of utilizing Low-Rank Adaptation (LoRA) tuning instead of full fine-tuning to address the problem of limited data for fine-tuning. Our experiments on the DCASE2023 Task 2 dataset establish a new benchmark of 77.75% on the evaluation set, with a significant improvement of 6.48% compared with previous state-of-the-art (SOTA) models, including top-tier traditional convolutional networks and speech pre-trained models, which demonstrates the effectiveness of audio pre-trained models with LoRA tuning. Ablation studies are also conducted to showcase the efficacy of the proposed scheme.

9/12/2024