Few-shot Defect Image Generation based on Consistency Modeling

Read original: arXiv:2408.00372 - Published 8/2/2024 by Qingfeng Shi, Jing Wei, Fei Shen, Zhengtao Zhang

Few-shot Defect Image Generation based on Consistency Modeling

Overview

The paper proposes a novel few-shot defect image generation method based on consistency modeling
It aims to address the challenge of limited defect data availability in industrial settings
The method leverages a consistency-based diffusion model to generate high-quality defect images from a small number of seed samples

Plain English Explanation

The paper presents a new way to generate realistic defect images from just a few example images. This is an important problem in industrial settings, where there is often a lack of defect data to train machine learning models for defect detection and monitoring.

The key idea is to use a consistency modeling approach, which ensures that the generated images are visually consistent with the example defect images. This is done by training a diffusion model - a type of generative model - to learn the underlying distribution of the defect images.

Once trained, the diffusion model can then be used to generate new defect images that have a similar appearance and characteristics to the original examples, even with just a small number of samples. This allows companies to augment their defect datasets and improve the performance of their defect detection systems.

Technical Explanation

The paper introduces a few-shot defect image generation method based on a consistency-based diffusion model. The key technical components are:

Consistency Modeling: The diffusion model is trained to generate images that are visually consistent with the input defect examples, by incorporating a consistency loss that encourages the generated images to maintain the spatial and structural patterns of the seed samples.
Architecture: The diffusion model uses a U-Net-based architecture with self-attention layers to capture long-range dependencies in the defect patterns. This helps the model generate realistic and coherent defect images.
Training: The model is trained in a few-shot setting, using only a small number of defect examples. This is enabled by the consistency modeling approach, which allows the model to generalize from the limited data.
Evaluation: The generated defect images are evaluated both qualitatively and quantitatively, demonstrating their visual fidelity and diversity compared to existing few-shot generation methods.

Critical Analysis

The paper makes a valuable contribution by introducing a novel few-shot defect image generation approach that addresses the data scarcity challenge in industrial settings. The consistency modeling technique is a clever way to leverage limited data and generate diverse, high-quality defect images.

However, the paper does not provide a thorough discussion of the limitations or potential drawbacks of the proposed method. For example, it is unclear how the method would perform on more complex or diverse defect types, or how sensitive it is to the quality and representativeness of the seed examples.

Additionally, while the quantitative evaluation demonstrates the effectiveness of the approach, it would be helpful to see more real-world case studies or comparisons to alternative few-shot generation techniques to fully assess the practical impact of the method.

Conclusion

This paper presents a promising approach for few-shot defect image generation based on consistency modeling. By training a diffusion model to generate images that are visually consistent with a small set of seed defect samples, the method can effectively augment limited defect datasets and support the development of robust defect detection systems.

The technical innovation and experimental results showcase the potential of this approach to address a crucial problem in industrial machine vision applications. Further research to explore the method's scalability, robustness, and real-world deployment would be valuable next steps to build on this work.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen, Zhengtao Zhang

Image generation can solve insufficient labeled data issues in defect detection. Most defect generation methods are only trained on a single product without considering the consistencies among multiple products, leading to poor quality and diversity of generated results. To address these issues, we propose DefectDiffu, a novel text-guided diffusion method to model both intra-product background consistency and inter-product defect consistency across multiple products and modulate the consistency perturbation directions to control product type and defect strength, achieving diversified defect image generation. Firstly, we leverage a text encoder to separately provide consistency prompts for background, defect, and fusion parts of the disentangled integrated architecture, thereby disentangling defects and normal backgrounds. Secondly, we propose the double-free strategy to generate defect images through two-stage perturbation of consistency direction, thereby controlling product type and defect strength by adjusting the perturbation scale. Besides, DefectDiffu can generate defect mask annotations utilizing cross-attention maps from the defect part. Finally, to improve the generation quality of small defects and masks, we propose the adaptive attention-enhance loss to increase the attention to defects. Experimental results demonstrate that DefectDiffu surpasses state-of-the-art methods in terms of generation quality and diversity, thus effectively improving downstream defection performance. Moreover, defect perturbation directions can be transferred among various products to achieve zero-shot defect generation, which is highly beneficial for addressing insufficient data issues. The code are available at https://github.com/FFDD-diffusion/DefectDiffu.

8/2/2024

Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection

Luigi Capogrosso, Federico Girella, Francesco Taioli, Michele Dalla Chiara, Muhammad Aqeel, Franco Fummi, Francesco Setti, Marco Cristani

In this study, we show that diffusion models can be used in industrial scenarios to improve the data augmentation procedure in the context of surface defect detection. In general, defect detection classifiers are trained on ground-truth data formed by normal samples (negative data) and samples with defects (positive data), where the latter are consistently fewer than normal samples. For these reasons, state-of-the-art data augmentation procedures add synthetic defect data by superimposing artifacts to normal samples. This leads to out-of-distribution augmented data so that the classification system learns what is not a normal sample but does not know what a defect really is. We show that diffusion models overcome this situation, providing more realistic in-distribution defects so that the model can learn the defect's genuine appearance. We propose a novel approach for data augmentation that mixes out-of-distribution with in-distribution samples, which we call In&Out. The approach can deal with two data augmentation setups: i) when no defects are available (zero-shot data augmentation) and ii) when defects are available, which can be in a small number (few-shot) or a large one (full-shot). We focus the experimental part on the most challenging benchmark in the state-of-the-art, i.e., the Kolektor Surface-Defect Dataset 2, defining the new state-of-the-art classification AP score under weak supervision of .782. The code is available at https://github.com/intelligolabs/in_and_out.

6/4/2024

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Ying Jin, Jinlong Peng, Qingdong He, Teng Hu, Hao Chen, Jiafu Wu, Wenbing Zhu, Mingmin Chi, Jun Liu, Yabiao Wang, Chengjie Wang

The performance of anomaly inspection in industrial manufacturing is constrained by the scarcity of anomaly data. To overcome this challenge, researchers have started employing anomaly generation approaches to augment the anomaly dataset. However, existing anomaly generation methods suffer from limited diversity in the generated anomalies and struggle to achieve a seamless blending of this anomaly with the original image. In this paper, we overcome these challenges from a new perspective, simultaneously generating a pair of the overall image and the corresponding anomaly part. We propose DualAnoDiff, a novel diffusion-based few-shot anomaly image generation model, which can generate diverse and realistic anomaly images by using a dual-interrelated diffusion model, where one of them is employed to generate the whole image while the other one generates the anomaly part. Moreover, we extract background and shape information to mitigate the distortion and blurriness phenomenon in few-shot image generation. Extensive experiments demonstrate the superiority of our proposed model over state-of-the-art methods in terms of both realism and diversity. Overall, our approach significantly improves the performance of downstream anomaly detection tasks, including anomaly detection, anomaly localization, and anomaly classification tasks.

8/29/2024

Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition

Yichun Tai, Kun Yang, Tao Peng, Zhenzhen Huang, Zhijiang Zhang

The task of steel surface defect recognition is an industrial problem with great industry values. The data insufficiency is the major challenge in training a robust defect recognition network. Existing methods have investigated to enlarge the dataset by generating samples with generative models. However, their generation quality is still limited by the insufficiency of defect image samples. To this end, we propose Stable Surface Defect Generation (StableSDG), which transfers the vast generation distribution embedded in Stable Diffusion model for steel surface defect image generation. To tackle with the distinctive distribution gap between steel surface images and generated images of the diffusion model, we propose two processes. First, we align the distribution by adapting parameters of the diffusion model, adopted both in the token embedding space and network parameter space. Besides, in the generation process, we propose image-oriented generation rather than from pure Gaussian noises. We conduct extensive experiments on steel surface defect dataset, demonstrating state-of-the-art performance on generating high-quality samples and training recognition models, and both designed processes are significant for the performance.

5/6/2024