ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Read original: arXiv:2406.03262 - Published 6/7/2024 by Jiangning Zhang, Haoyang He, Zhenye Gan, Qingdong He, Yuxuan Cai, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong Liu

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Overview

This paper introduces ADer, a comprehensive benchmark for multi-class visual anomaly detection.
Visual anomaly detection is the task of identifying abnormal or unusual objects or patterns in images, which is crucial for applications like quality control, medical imaging, and autonomous systems.
ADer provides a diverse dataset, evaluation protocols, and baseline models to advance research in this important field.

Plain English Explanation

The paper presents a new benchmark called ADer for evaluating algorithms that can detect unusual or abnormal objects in images. Detecting visual anomalies is an important problem with many real-world applications, like checking the quality of manufactured products, analyzing medical scans for signs of disease, and helping self-driving cars identify unusual obstacles.

The ADer benchmark includes a large, diverse dataset of images that contain different types of anomalies. It also defines ways to rigorously test how well anomaly detection algorithms perform across multiple classes of anomalies. By providing this standardized benchmark, the researchers hope to drive progress in this area of computer vision and make it easier to compare the capabilities of different anomaly detection approaches.

Technical Explanation

The paper introduces the ADer benchmark for evaluating multi-class visual anomaly detection algorithms. Visual anomaly detection is the task of identifying abnormal or unusual objects or patterns in images, which is crucial for applications like quality control, medical imaging, and autonomous systems.

ADer provides a diverse dataset of over 100,000 images across 15 object categories, with anomalies spanning multiple classes. The benchmark defines evaluation protocols to assess an algorithm's ability to detect anomalies of different types and severities. The paper also presents several baseline anomaly detection models based on popular deep learning architectures.

Experiments show that existing state-of-the-art anomaly detection methods struggle to generalize well across the varied anomalies in the ADer dataset. The authors hope that this benchmark will spur further research and innovation in this important but challenging computer vision problem.

Critical Analysis

The ADer benchmark represents an important step forward in visual anomaly detection research. By providing a large, diverse dataset and standardized evaluation protocols, the authors have created a valuable tool for comparing the capabilities of different anomaly detection algorithms.

However, the paper acknowledges that the benchmark has some limitations. The dataset, while extensive, may not capture the full breadth of anomalies that could occur in real-world scenarios. Additionally, the evaluation metrics used may not always align perfectly with the practical needs of end-users in different application domains.

Further research is needed to develop anomaly detection methods that can effectively handle the wide range of anomalies and variations present in the ADer dataset. Incorporating more advanced techniques, such as link to "Learning Feature Inversion for Multi-class Anomaly Detection" or link to "Supervised Anomaly Detection for Complex Industrial Images", may be a promising direction.

Conclusion

The ADer benchmark represents a significant advancement in the field of multi-class visual anomaly detection. By providing a standardized dataset, evaluation protocols, and baseline models, the authors have created a valuable resource for researchers and practitioners in this important area of computer vision.

While the benchmark has some limitations, it is an important step toward driving progress and ensuring a more rigorous and reproducible approach to developing anomaly detection algorithms. Ultimately, the success of ADer will be measured by its ability to spur the development of more robust and versatile anomaly detection systems, with the potential to impact a wide range of real-world applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Jiangning Zhang, Haoyang He, Zhenye Gan, Qingdong He, Yuxuan Cai, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong Liu

Visual anomaly detection aims to identify anomalous regions in images through unsupervised learning paradigms, with increasing application demand and value in fields such as industrial inspection and medical lesion detection. Despite significant progress in recent years, there is a lack of comprehensive benchmarks to adequately evaluate the performance of various mainstream methods across different datasets under the practical multi-class setting. The absence of standardized experimental setups can lead to potential biases in training epochs, resolution, and metric results, resulting in erroneous conclusions. This paper addresses this issue by proposing a comprehensive visual anomaly detection benchmark, textbf{textit{ADer}}, which is a modular framework that is highly extensible for new methods. The benchmark includes multiple datasets from industrial and medical domains, implementing fifteen state-of-the-art methods and nine comprehensive metrics. Additionally, we have open-sourced the GPU-assisted href{https://pypi.org/project/ADEval}{ADEval} package to address the slow evaluation problem of metrics like time-consuming mAU-PRO on large-scale data, significantly reducing evaluation time by more than textit{1000-fold}. Through extensive experimental results, we objectively reveal the strengths and weaknesses of different methods and provide insights into the challenges and future directions of multi-class visual anomaly detection. We hope that textbf{textit{ADer}} will become a valuable resource for researchers and practitioners in the field, promoting the development of more robust and generalizable anomaly detection systems. Full codes have been attached in Appendix and open-sourced at url{https://github.com/zhangzjn/ader}.

6/7/2024

BMAD: Benchmarks for Medical Anomaly Detection

Jinan Bao, Hanshi Sun, Hanqiu Deng, Yinsheng He, Zhaoxiang Zhang, Xingyu Li

Anomaly detection (AD) is a fundamental research problem in machine learning and computer vision, with practical applications in industrial inspection, video surveillance, and medical diagnosis. In medical imaging, AD is especially vital for detecting and diagnosing anomalies that may indicate rare diseases or conditions. However, there is a lack of a universal and fair benchmark for evaluating AD methods on medical images, which hinders the development of more generalized and robust AD methods in this specific domain. To bridge this gap, we introduce a comprehensive evaluation benchmark for assessing anomaly detection methods on medical images. This benchmark encompasses six reorganized datasets from five medical domains (i.e. brain MRI, liver CT, retinal OCT, chest X-ray, and digital histopathology) and three key evaluation metrics, and includes a total of fourteen state-of-the-art AD algorithms. This standardized and well-curated medical benchmark with the well-structured codebase enables comprehensive comparisons among recently proposed anomaly detection methods. It will facilitate the community to conduct a fair comparison and advance the field of AD on medical imaging. More information on BMAD is available in our GitHub repository: https://github.com/DorisBao/BMAD

4/30/2024

MedIAnomaly: A comparative study of anomaly detection in medical images

Yu Cai, Weiwen Zhang, Hao Chen, Kwang-Ting Cheng

Anomaly detection (AD) aims at detecting abnormal samples that deviate from the expected normal patterns. Generally, it can be trained merely on normal data, without a requirement for abnormal samples, and thereby plays an important role in the recognition of rare diseases and health screening in the medical domain. Despite the emergence of numerous methods for medical AD, we observe a lack of a fair and comprehensive evaluation, which causes ambiguous conclusions and hinders the development of this field. To address this problem, this paper builds a benchmark with unified comparison. Seven medical datasets with five image modalities, including chest X-rays, brain MRIs, retinal fundus images, dermatoscopic images, and histopathology whole slide images, are curated for extensive evaluation. Thirty typical AD methods, including reconstruction and self-supervised learning-based methods, are involved in comparison of image-level anomaly classification and pixel-level anomaly segmentation. Furthermore, for the first time, we formally explore the effect of key components in existing methods, clearly revealing unresolved challenges and potential future directions. The datasets and code are available at url{https://github.com/caiyu6666/MedIAnomaly}.

7/23/2024

✨

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong Liu, Guansong Pang, Dacheng Tao

Anomaly detection (AD) is often focused on detecting anomaly areas for industrial quality inspection and medical lesion examination. However, due to the specific scenario targets, the data scale for AD is relatively small, and evaluation metrics are still deficient compared to classic vision tasks, such as object detection and semantic segmentation. To fill these gaps, this work first constructs a large-scale and general-purpose COCO-AD dataset by extending COCO to the AD field. This enables fair evaluation and sustainable development for different methods on this challenging benchmark. Moreover, current metrics such as AU-ROC have nearly reached saturation on simple datasets, which prevents a comprehensive evaluation of different methods. Inspired by the metrics in the segmentation field, we further propose several more practical threshold-dependent AD-specific metrics, ie, m$F_1$$^{.2}_{.8}$, mAcc$^{.2}_{.8}$, mIoU$^{.2}_{.8}$, and mIoU-max. Motivated by GAN inversion's high-quality reconstruction capability, we propose a simple but more powerful InvAD framework to achieve high-quality feature reconstruction. Our method improves the effectiveness of reconstruction-based methods on popular MVTec AD, VisA, and our newly proposed COCO-AD datasets under a multi-class unsupervised setting, where only a single detection model is trained to detect anomalies from different classes. Extensive ablation experiments have demonstrated the effectiveness of each component of our InvAD. Full codes and models are available at https://github.com/zhangzjn/ader.

4/17/2024