Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Additive Manufacturing

Read original: arXiv:2312.02491 - Published 9/5/2024 by Yuxuan Li, Tianxin Xie, Chenang Liu, Zhangyue Shi

❗

Overview

Modern manufacturing enterprises use sensors and machine learning to detect anomalies in their processes.
A challenge is that new types of defects can emerge over time, causing performance issues for previously trained models.
Continual learning methods can help models learn continuously, but memory-based approaches face data storage constraints.

Plain English Explanation

Continual Learning is a machine learning technique that allows models to adapt and learn new information over time, without forgetting what they've learned before. This is particularly important in manufacturing, where new types of defects can start appearing as processes evolve.

The paper proposes a novel pseudo replay-based continual learning framework that can generate high-quality data representing previous classes. This allows the model to be trained incrementally on new data without needing to store all the historical data, overcoming the data storage constraints of traditional memory-based continual learning methods.

The key idea is to combine class incremental learning (learning new classes over time) with oversampling-based data generation. This enables the model to learn about new defect types while maintaining its performance on previous ones, without requiring all the original training data to be kept.

Technical Explanation

The paper develops a continual learning framework that integrates class incremental learning and oversampling-based data generation. This allows the model to learn about new anomaly classes as they arise, without forgetting how to detect previous ones.

The framework works as follows:

When a new anomaly class emerges, the model is fine-tuned on the new data.
To prevent forgetting previous classes, the framework generates synthetic data points representing those classes using an oversampling technique.
The model is then trained on the combined real new data and synthetic previous data.

This approach avoids the need to store all historical training data, which is a key limitation of traditional memory-based continual learning methods. The generated synthetic data effectively "replays" the previous classes during training.

The effectiveness of the proposed framework is validated through three case studies focused on supervised classification for anomaly detection. The results show the method can detect novel anomalies while maintaining good performance on previous tasks, and provides more flexibility in model architecture.

Critical Analysis

The paper presents a promising approach to the important problem of continual learning in manufacturing anomaly detection. By combining class incremental learning and oversampling-based data generation, the framework can overcome the data storage constraints of memory-based continual learning.

However, the paper does not extensively discuss potential limitations or areas for further research. For example, it would be valuable to understand how the synthetic data generation technique performs compared to alternative approaches, or how the framework scales as the number of anomaly classes grows over time.

Additionally, the paper focuses on supervised classification tasks, but many real-world manufacturing anomaly detection problems may involve more complex, multi-modal data and models. Exploring the applicability of the pseudo replay-based approach to these more challenging settings could be an important avenue for future work.

Overall, the research represents a valuable contribution to the field of continual learning, with the potential to significantly improve the adaptability and long-term performance of anomaly detection systems in manufacturing.

Conclusion

This paper presents a novel continual learning framework that can effectively detect new types of manufacturing defects over time, without forgetting how to identify previous anomalies. By generating synthetic data to "replay" past classes during training, the approach overcomes the data storage limitations of traditional memory-based continual learning methods.

The experimental results demonstrate the effectiveness of the proposed framework, showing its ability to maintain good performance on previous tasks while also detecting novel anomalies. This work has important implications for the development of more flexible and adaptive manufacturing monitoring systems, which will be crucial as production processes continue to evolve and new types of defects emerge.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

❗

Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Additive Manufacturing

Yuxuan Li, Tianxin Xie, Chenang Liu, Zhangyue Shi

The incorporation of advanced sensors and machine learning techniques has enabled modern manufacturing enterprises to perform data-driven classification-based anomaly detection based on the sensor data collected in manufacturing processes. However, one critical challenge is that newly presented defect category may manifest as the manufacturing process continues, resulting in monitoring performance deterioration of previously trained machine learning models. Hence, there is an increasing need for empowering machine learning models to learn continually. Among all continual learning methods, memory-based continual learning has the best performance but faces the constraints of data storage capacity. To address this issue, this paper develops a novel pseudo replay-based continual learning framework by integrating class incremental learning and oversampling-based data generation. Without storing all the data, the developed framework could generate high-quality data representing previous classes to train machine learning model incrementally when new category anomaly occurs. In addition, it could even enhance the monitoring performance since it also effectively improves the data quality. The effectiveness of the proposed framework is validated in three cases studies, which leverages supervised classification problem for anomaly detection. The experimental results show that the developed method is very promising in detecting novel anomaly while maintaining a good performance on the previous task and brings up more flexibility in model architecture.

9/5/2024

🏋️

Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark

Marina Ceccon, Davide Dalle Pezze, Alessandro Fabris, Gian Antonio Susto

Despite the critical importance of the medical domain in Deep Learning, most of the research in this area solely focuses on training models in static environments. It is only in recent years that research has begun to address dynamic environments and tackle the Catastrophic Forgetting problem through Continual Learning (CL) techniques. Previous studies have primarily focused on scenarios such as Domain Incremental Learning and Class Incremental Learning, which do not fully capture the complexity of real-world applications. Therefore, in this work, we propose a novel benchmark combining the challenges of new class arrivals and domain shifts in a single framework, by considering the New Instances and New Classes (NIC) scenario. This benchmark aims to model a realistic CL setting for the multi-label classification problem in medical imaging. Additionally, it encompasses a greater number of tasks compared to previously tested scenarios. Specifically, our benchmark consists of two datasets (NIH and CXP), nineteen classes, and seven tasks, a stream longer than the previously tested ones. To solve common challenges (e.g., the task inference problem) found in the CIL and NIC scenarios, we propose a novel approach called Replay Consolidation with Label Propagation (RCLP). Our method surpasses existing approaches, exhibiting superior performance with minimal forgetting.

7/19/2024

⚙️

Adaptive Rentention & Correction for Continual Learning

Haoran Chen, Micah Goldblum, Zuxuan Wu, Yu-Gang Jiang

Continual learning, also known as lifelong learning or incremental learning, refers to the process by which a model learns from a stream of incoming data over time. A common problem in continual learning is the classification layer's bias towards the most recent task. Traditionally, methods have relied on incorporating data from past tasks during training to mitigate this issue. However, the recent shift in continual learning to memory-free environments has rendered these approaches infeasible. In this study, we propose a solution focused on the testing phase. We first introduce a simple Out-of-Task Detection method, OTD, designed to accurately identify samples from past tasks during testing. Leveraging OTD, we then propose: (1) an Adaptive Retention mechanism for dynamically tuning the classifier layer on past task data; (2) an Adaptive Correction mechanism for revising predictions when the model classifies data from previous tasks into classes from the current task. We name our approach Adaptive Retention & Correction (ARC). While designed for memory-free environments, ARC also proves effective in memory-based settings. Extensive experiments show that our proposed method can be plugged in to virtually any existing continual learning approach without requiring any modifications to its training procedure. Specifically, when integrated with state-of-the-art approaches, ARC achieves an average performance increase of 2.7% and 2.6% on the CIFAR-100 and Imagenet-R datasets, respectively.

5/24/2024

Adaptive Memory Replay for Continual Learning

James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogerio Feris, Zsolt Kira, Leonid Karlinsky

Foundation Models (FMs) have become the hallmark of modern AI, however, these models are trained on massive data, leading to financially expensive training. Updating FMs as new data becomes available is important, however, can lead to `catastrophic forgetting', where models underperform on tasks related to data sub-populations observed too long ago. This continual learning (CL) phenomenon has been extensively studied, but primarily in a setting where only a small amount of past data can be stored. We advocate for the paradigm where memory is abundant, allowing us to keep all previous data, but computational resources are limited. In this setting, traditional replay-based CL approaches are outperformed by a simple baseline which replays past data selected uniformly at random, indicating that this setting necessitates a new approach. We address this by introducing a framework of adaptive memory replay for continual learning, where sampling of past data is phrased as a multi-armed bandit problem. We utilize Bolzmann sampling to derive a method which dynamically selects past data for training conditioned on the current task, assuming full data access and emphasizing training efficiency. Through extensive evaluations on both vision and language pre-training tasks, we demonstrate the effectiveness of our approach, which maintains high performance while reducing forgetting by up to 10% at no training efficiency cost.

4/22/2024