GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features

Read original: arXiv:2407.12427 - Published 7/18/2024 by Luc P. J. Strater, Mohammadreza Salehi, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano

GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features

Overview

This paper introduces GeneralAD, a novel approach for anomaly detection that can be applied across different domains.
The key innovation is an attention mechanism that focuses on distorted features to identify anomalies, even in the presence of domain shift.
GeneralAD outperforms state-of-the-art methods on various anomaly detection benchmarks, demonstrating its effectiveness and broad applicability.

Plain English Explanation

Anomaly detection is the task of identifying data points that deviate significantly from the normal patterns in a dataset. This is an important problem with applications in areas like fraud detection, predictive maintenance, and medical diagnosis. [link to https://aimodels.fyi/papers/arxiv/domain-independent-detection-known-anomalies]

However, developing anomaly detection models that can generalize well across different datasets or "domains" is challenging. Factors like changes in data distribution, sensor noise, or environmental conditions can cause a model trained on one dataset to perform poorly on another.

The researchers behind GeneralAD propose a novel solution to this problem. Their key insight is that anomalies often manifest in the form of "distorted" features - characteristics of the data that are unusual or out of the ordinary. By focusing the model's attention on these distorted features, GeneralAD can identify anomalies more effectively, even when the underlying data distributions differ between domains.

This approach builds on recent advances in self-supervised learning and anomaly localization. [links to https://aimodels.fyi/papers/arxiv/tocoad-two-stage-contrastive-learning-industrial-anomaly, https://aimodels.fyi/papers/arxiv/multimodal-industrial-anomaly-detection-by-crossmodal-feature, https://aimodels.fyi/papers/arxiv/supervised-anomaly-detection-complex-industrial-images]

The researchers demonstrate the effectiveness of GeneralAD on a variety of anomaly detection benchmarks, showing that it outperforms state-of-the-art methods. This suggests that their approach could have wide-ranging applications in industries like manufacturing, finance, and healthcare, where the ability to detect anomalies reliably across different contexts is crucial. [link to https://aimodels.fyi/papers/arxiv/learning-feature-inversion-multi-class-anomaly-detection]

Technical Explanation

The core innovation in GeneralAD is an attention mechanism that focuses the model's learning on "distorted" features - characteristics of the data that deviate from the normal patterns. The authors hypothesize that these distorted features are often the key indicators of anomalies, and by attending to them, the model can better identify anomalies even in the presence of domain shift.

The GeneralAD architecture consists of two main components:

A feature extractor that learns domain-agnostic representations of the input data.
An attention module that identifies the most "distorted" features and uses this information to classify data points as normal or anomalous.

The feature extractor is trained using a self-supervised contrastive learning objective, which encourages the model to learn representations that are invariant to domain-specific nuances. The attention module then takes these representations as input and computes an attention map that highlights the most salient, distorted features.

The final anomaly classification is made by combining the attention-weighted features with the original representations. This allows the model to focus on the most informative aspects of the data, leading to improved anomaly detection performance.

The researchers evaluate GeneralAD on a range of anomaly detection benchmarks, including both image and tabular datasets. They show that it outperforms state-of-the-art methods, often by a significant margin, demonstrating the effectiveness of the attention-based approach in handling domain shift.

Critical Analysis

The GeneralAD paper presents a compelling solution to the challenge of developing anomaly detection models that can generalize well across different domains. The attention-based approach is a novel and well-designed contribution that builds on recent advances in self-supervised learning and anomaly localization.

One potential limitation of the research is the reliance on synthetic data augmentation techniques to introduce domain shift during training. While this is a common practice in the field, it would be valuable to see the performance of GeneralAD evaluated on real-world datasets with more natural domain shifts.

Additionally, the paper does not provide a deep exploration of the types of distorted features that the attention mechanism focuses on. Understanding the specific characteristics of these features and how they relate to anomalies could provide valuable insights for future research and applications.

Finally, the authors mention the potential for GeneralAD to be extended to multi-modal data, but do not provide any experimental results in this direction. Exploring the model's performance on heterogeneous data sources could further demonstrate its versatility and broader applicability.

Overall, the GeneralAD paper presents a promising step forward in the field of anomaly detection, with the potential for significant impact across a wide range of industries and applications.

Conclusion

The GeneralAD paper introduces a novel approach to anomaly detection that can effectively handle domain shift by focusing on distorted features. The attention-based architecture outperforms state-of-the-art methods on a variety of benchmarks, demonstrating the effectiveness and broad applicability of the proposed solution.

This research contributes to the ongoing efforts to develop anomaly detection models that can generalize well across different datasets and contexts, a crucial capability for real-world deployment in industries like manufacturing, finance, and healthcare. The attention mechanism's ability to identify the most informative features for anomaly detection represents a significant advancement in the field and may inspire further innovations in self-supervised learning and anomaly localization.

While the paper leaves room for additional exploration, such as the analysis of distorted features and the extension to multi-modal data, the GeneralAD approach represents a important step forward in building robust and adaptable anomaly detection systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features

Luc P. J. Strater, Mohammadreza Salehi, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano

In the domain of anomaly detection, methods often excel in either high-level semantic or low-level industrial benchmarks, rarely achieving cross-domain proficiency. Semantic anomalies are novelties that differ in meaning from the training set, like unseen objects in self-driving cars. In contrast, industrial anomalies are subtle defects that preserve semantic meaning, such as cracks in airplane components. In this paper, we present GeneralAD, an anomaly detection framework designed to operate in semantic, near-distribution, and industrial settings with minimal per-task adjustments. In our approach, we capitalize on the inherent design of Vision Transformers, which are trained on image patches, thereby ensuring that the last hidden states retain a patch-based structure. We propose a novel self-supervised anomaly generation module that employs straightforward operations like noise addition and shuffling to patch features to construct pseudo-abnormal samples. These features are fed to an attention-based discriminator, which is trained to score every patch in the image. With this, our method can both accurately identify anomalies at the image level and also generate interpretable anomaly maps. We extensively evaluated our approach on ten datasets, achieving state-of-the-art results in six and on-par performance in the remaining for both localization and detection tasks.

7/18/2024

Domain-independent detection of known anomalies

Jonas Buhler, Jonas Fehrenbach, Lucas Steinmann, Christian Nauck, Marios Koulakis

One persistent obstacle in industrial quality inspection is the detection of anomalies. In real-world use cases, two problems must be addressed: anomalous data is sparse and the same types of anomalies need to be detected on previously unseen objects. Current anomaly detection approaches can be trained with sparse nominal data, whereas domain generalization approaches enable detecting objects in previously unseen domains. Utilizing those two observations, we introduce the hybrid task of domain generalization on sparse classes. To introduce an accompanying dataset for this task, we present a modification of the well-established MVTec AD dataset by generating three new datasets. In addition to applying existing methods for benchmark, we design two embedding-based approaches, Spatial Embedding MLP (SEMLP) and Labeled PatchCore. Overall, SEMLP achieves the best performance with an average image-level AUROC of 87.2 % vs. 80.4 % by MIRO. The new and openly available datasets allow for further research to improve industrial anomaly detection.

7/4/2024

❗

Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti, Luigi Di Stefano

The paper explores the industrial multimodal Anomaly Detection (AD) task, which exploits point clouds and RGB images to localize anomalies. We introduce a novel light and fast framework that learns to map features from one modality to the other on nominal samples. At test time, anomalies are detected by pinpointing inconsistencies between observed and mapped features. Extensive experiments show that our approach achieves state-of-the-art detection and segmentation performance in both the standard and few-shot settings on the MVTec 3D-AD dataset while achieving faster inference and occupying less memory than previous multimodal AD methods. Moreover, we propose a layer-pruning technique to improve memory and time efficiency with a marginal sacrifice in performance.

7/9/2024

ToCoAD: Two-Stage Contrastive Learning for Industrial Anomaly Detection

Yun Liang, Zhiguang Hu, Junjie Huang, Donglin Di, Anyang Su, Lei Fan

Current unsupervised anomaly detection approaches perform well on public datasets but struggle with specific anomaly types due to the domain gap between pre-trained feature extractors and target-specific domains. To tackle this issue, this paper presents a two-stage training strategy, called textbf{ToCoAD}. In the first stage, a discriminative network is trained by using synthetic anomalies in a self-supervised learning manner. This network is then utilized in the second stage to provide a negative feature guide, aiding in the training of the feature extractor through bootstrap contrastive learning. This approach enables the model to progressively learn the distribution of anomalies specific to industrial datasets, effectively enhancing its generalizability to various types of anomalies. Extensive experiments are conducted to demonstrate the effectiveness of our proposed two-stage training strategy, and our model produces competitive performance, achieving pixel-level AUROC scores of 98.21%, 98.43% and 97.70% on MVTec AD, VisA and BTAD respectively.

7/2/2024