Towards Efficient Pixel Labeling for Industrial Anomaly Detection and Localization

Read original: arXiv:2407.03130 - Published 7/8/2024 by Hanxi Li, Jingqi Wu, Lin Yuanbo Wu, Hao Chen, Deyin Liu, Chunhua Shen

Towards Efficient Pixel Labeling for Industrial Anomaly Detection and Localization

Overview

This paper proposes an efficient pixel labeling approach for industrial anomaly detection and localization.
The method aims to overcome challenges in accurate and scalable anomaly segmentation, which is crucial for quality control in industrial settings.
The approach combines a weakly-supervised learning framework with attention-based pixel-level anomaly detection to achieve efficient and effective anomaly segmentation.

Plain English Explanation

The paper describes a new technique for automatically identifying and locating defects or abnormalities in industrial products or processes. This is an important problem in manufacturing, as being able to quickly and reliably detect issues can help maintain high quality standards.

The key innovation is a weakly-supervised learning framework that can learn to perform pixel-level anomaly segmentation - that is, it can precisely outline the areas of an image that contain defects. This is more efficient than traditional approaches that require a lot of detailed human labeling of training data.

The method also incorporates attention-based mechanisms to focus on the most relevant parts of the image for anomaly detection. This allows the system to zero in on potential issues without getting distracted by normal features.

Overall, this research aims to make industrial quality control more accurate, scalable, and cost-effective by automating the process of identifying and localizing defects in a way that requires less human effort than previous techniques. This could have significant real-world impact in manufacturing and other industrial domains.

Technical Explanation

The paper presents a weakly-supervised learning framework for efficient pixel-level anomaly detection and localization in industrial images. The key components of the approach include:

Weakly-Supervised Learning: Instead of requiring detailed per-pixel annotations of defects, the model is trained using only image-level labels indicating the presence or absence of anomalies. This reduces the burden of data annotation.
Attention-Based Anomaly Segmentation: The model uses attention mechanisms to focus on the most relevant regions of the input image for anomaly detection. This allows it to accurately segment anomalies at the pixel level without being distracted by normal features.
Multi-Scale Feature Fusion: The architecture combines features from multiple scales to capture both local and global contextual information, improving the model's ability to precisely locate anomalies.

The authors evaluate their approach on several industrial anomaly detection benchmarks, demonstrating that it outperforms state-of-the-art methods in terms of both detection accuracy and efficiency. The technique shows promise for enabling scalable and cost-effective quality control in industrial settings.

Critical Analysis

The paper addresses an important real-world problem and presents a novel solution that appears to be effective based on the reported results. However, there are a few potential limitations and areas for further research:

Generalization to Diverse Industrial Domains: The experiments were conducted on a limited set of industrial datasets, so it's unclear how well the approach would generalize to highly diverse industrial environments and anomaly types. Further evaluation on a broader range of industrial scenarios would be valuable.
Computational Efficiency: While the weakly-supervised learning framework reduces the annotation burden, the attention-based anomaly segmentation may still be computationally intensive, especially for real-time industrial applications. Exploring more efficient architectures could be an area for future work.
Interpretability and Explainability: The paper does not discuss the interpretability or explainability of the model's anomaly detection decisions. Incorporating mechanisms to improve the model's transparency could enhance trust and enable better integration with human decision-making processes.

Overall, the proposed approach represents a promising step towards efficient and scalable industrial anomaly detection, but further research is needed to address potential limitations and broaden its real-world applicability.

Conclusion

This paper presents an efficient pixel labeling framework for industrial anomaly detection and localization. By leveraging weakly-supervised learning and attention-based mechanisms, the method can accurately segment anomalies at the pixel level while requiring less detailed training data than traditional approaches.

The reported results demonstrate the effectiveness of the proposed technique, which could have significant implications for improving quality control and reducing costs in a variety of industrial settings. Further research to address generalization, computational efficiency, and interpretability could help solidify the approach's real-world impact.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Towards Efficient Pixel Labeling for Industrial Anomaly Detection and Localization

Hanxi Li, Jingqi Wu, Lin Yuanbo Wu, Hao Chen, Deyin Liu, Chunhua Shen

In the realm of practical Anomaly Detection (AD) tasks, manual labeling of anomalous pixels proves to be a costly endeavor. Consequently, many AD methods are crafted as one-class classifiers, tailored for training sets completely devoid of anomalies, ensuring a more cost-effective approach. While some pioneering work has demonstrated heightened AD accuracy by incorporating real anomaly samples in training, this enhancement comes at the price of labor-intensive labeling processes. This paper strikes the balance between AD accuracy and labeling expenses by introducing ADClick, a novel Interactive Image Segmentation (IIS) algorithm. ADClick efficiently generates ground-truth anomaly masks for real defective images, leveraging innovative residual features and meticulously crafted language prompts. Notably, ADClick showcases a significantly elevated generalization capacity compared to existing state-of-the-art IIS approaches. Functioning as an anomaly labeling tool, ADClick generates high-quality anomaly labels (AP $= 94.1%$ on MVTec AD) based on only $3$ to $5$ manual click annotations per training image. Furthermore, we extend the capabilities of ADClick into ADClick-Seg, an enhanced model designed for anomaly detection and localization. By fine-tuning the ADClick-Seg model using the weak labels inferred by ADClick, we establish the state-of-the-art performances in supervised AD tasks (AP $= 86.4%$ on MVTec AD and AP $= 78.4%$, PRO $= 98.6%$ on KSDD2).

7/8/2024

👨‍🏫

Supervised Anomaly Detection for Complex Industrial Images

Aimira Baitieva, David Hurych, Victor Besnier, Olivier Bernard

Automating visual inspection in industrial production lines is essential for increasing product quality across various industries. Anomaly detection (AD) methods serve as robust tools for this purpose. However, existing public datasets primarily consist of images without anomalies, limiting the practical application of AD methods in production settings. To address this challenge, we present (1) the Valeo Anomaly Dataset (VAD), a novel real-world industrial dataset comprising 5000 images, including 2000 instances of challenging real defects across more than 20 subclasses. Acknowledging that traditional AD methods struggle with this dataset, we introduce (2) Segmentation-based Anomaly Detector (SegAD). First, SegAD leverages anomaly maps as well as segmentation maps to compute local statistics. Next, SegAD uses these statistics and an optional supervised classifier score as input features for a Boosted Random Forest (BRF) classifier, yielding the final anomaly score. Our SegAD achieves state-of-the-art performance on both VAD (+2.1% AUROC) and the VisA dataset (+0.4% AUROC). The code and the models are publicly available.

5/14/2024

Test Time Training for Industrial Anomaly Segmentation

Alex Costanzino, Pierluigi Zama Ramirez, Mirko Del Moro, Agostino Aiezzo, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano

Anomaly Detection and Segmentation (AD&S) is crucial for industrial quality control. While existing methods excel in generating anomaly scores for each pixel, practical applications require producing a binary segmentation to identify anomalies. Due to the absence of labeled anomalies in many real scenarios, standard practices binarize these maps based on some statistics derived from a validation set containing only nominal samples, resulting in poor segmentation performance. This paper addresses this problem by proposing a test time training strategy to improve the segmentation performance. Indeed, at test time, we can extract rich features directly from anomalous samples to train a classifier that can discriminate defects effectively. Our general approach can work downstream to any AD&S method that provides an anomaly score map as output, even in multimodal settings. We demonstrate the effectiveness of our approach over baselines through extensive experimentation and evaluation on MVTec AD and MVTec 3D-AD.

4/8/2024

✨

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong Liu, Guansong Pang, Dacheng Tao

Anomaly detection (AD) is often focused on detecting anomaly areas for industrial quality inspection and medical lesion examination. However, due to the specific scenario targets, the data scale for AD is relatively small, and evaluation metrics are still deficient compared to classic vision tasks, such as object detection and semantic segmentation. To fill these gaps, this work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field. This enables fair evaluation and sustainable development for different methods on this challenging benchmark. Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods. Inspired by the metrics in the segmentation field, we further propose several more practical threshold-dependent AD-specific metrics, ie, m$F_1$$^{.2}_{.8}$, mAcc$^{.2}_{.8}$, mIoU$^{.2}_{.8}$, and mIoU-max. Motivated by GAN inversion's high-quality reconstruction capability, we propose a simple but more powerful InvAD framework to achieve high-quality feature reconstruction. Our method improves the effectiveness of reconstruction-based methods on popular MVTec AD, VisA, and our newly proposed COCO-AD datasets under a multi-class unsupervised setting, where only a single detection model is trained to detect anomalies from different classes. Extensive ablation experiments have demonstrated the effectiveness of each component of our InvAD. Full codes and models are available at https://github.com/zhangzjn/ader.

4/17/2024