MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects

Read original: arXiv:2405.16096 - Published 5/28/2024 by Kunye Shen, Xiaofei Zhou, Zhi Liu

MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects

Overview

This paper presents a novel multi-scale interactive network (MINet) for real-time detection of surface defects on strip steel.
The network leverages multi-scale feature extraction and interactive fusion to achieve high accuracy and computational efficiency.
The proposed approach outperforms state-of-the-art methods in terms of both detection performance and inference speed.

Plain English Explanation

The paper describes a new deep learning model called MINet that is designed to quickly and accurately identify defects on the surface of steel strips. Surface defects can be a major problem in steel production, leading to waste and quality issues, so being able to detect them efficiently is important.

MINet works by extracting features from the steel surface image at multiple scales - looking at both the big picture and the fine details. It then intelligently combines these multi-scale features to make the final defect detection. This multi-scale, interactive approach allows MINet to achieve high accuracy while also being computationally efficient, enabling real-time processing.

Compared to other state-of-the-art defect detection methods, MINet demonstrates superior performance in terms of both detection quality and speed. This makes it a promising solution for practical steel surface inspection applications.

Technical Explanation

The key innovations of the MINet architecture include:

Multi-scale Feature Extraction: MINet uses a backbone network (e.g. ResNet) to extract features at multiple scales, capturing both coarse and fine-grained details of the steel surface. This allows the model to better recognize different types and sizes of defects.
Interactive Feature Fusion: MINet employs a novel interactive fusion module that dynamically integrates the multi-scale features. This helps the model learn richer representations that are robust to variations in defect appearance.
Efficient Design: The authors carefully design the network topology and leverage techniques like depthwise separable convolutions to reduce computational cost. This enables real-time defect detection on industrial hardware.

The authors evaluate MINet on two public steel surface defect datasets, MSFN and SDDNet. Compared to state-of-the-art methods like MSFN, MFDS-Net, and MCSDNet, MINet achieves superior performance in terms of both detection accuracy and inference speed.

Critical Analysis

The authors acknowledge that while MINet demonstrates strong results, there is still room for improvement in certain areas. For example, the model may struggle with detecting very small or hard-to-distinguish defects, especially in complex or noisy steel surface images.

Additionally, the paper does not provide a detailed analysis of the model's sensitivity to various hyperparameters or its robustness to dataset shifts. Further research would be needed to fully characterize the model's strengths and limitations.

That said, the core ideas behind MINet, such as the multi-scale interactive feature fusion, seem well-justified and could potentially be applied to other computer vision tasks beyond steel surface inspection. The authors' focus on balancing accuracy and efficiency is also commendable and aligns well with real-world industrial needs.

Conclusion

In summary, the MINet architecture presented in this paper is a promising approach for real-time detection of surface defects on strip steel. By leveraging multi-scale feature extraction and interactive fusion, the model achieves state-of-the-art performance while maintaining high computational efficiency.

This research contributes to the ongoing efforts to improve quality control and automation in steel manufacturing. As the authors note, the MINet framework could potentially be extended to other industrial inspection tasks beyond just steel surfaces. Overall, this work represents a valuable step forward in the field of computer vision-based defect detection.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects

Kunye Shen, Xiaofei Zhou, Zhi Liu

The automated surface defect detection is a fundamental task in industrial production, and the existing saliencybased works overcome the challenging scenes and give promising detection results. However, the cutting-edge efforts often suffer from large parameter size, heavy computational cost, and slow inference speed, which heavily limits the practical applications. To this end, we devise a multi-scale interactive (MI) module, which employs depthwise convolution (DWConv) and pointwise convolution (PWConv) to independently extract and interactively fuse features of different scales, respectively. Particularly, the MI module can provide satisfactory characterization for defect regions with fewer parameters. Embarking on this module, we propose a lightweight Multi-scale Interactive Network (MINet) to conduct real-time salient object detection of strip steel surface defects. Comprehensive experimental results on SD-Saliency-900 dataset, which contains three kinds of strip steel surface defect detection images (i.e., inclusion, patches, and scratches), demonstrate that the proposed MINet presents comparable detection accuracy with the state-of-the-art methods while running at a GPU speed of 721FPS and a CPU speed of 6.3FPS for 368*368 images with only 0.28M parameters. The code is available at https://github.com/Kunye-Shen/MINet.

5/28/2024

🔎

Faster Metallic Surface Defect Detection Using Deep Learning with Channel Shuffling

Siddiqui Muhammad Yasir, Hyunsik Ahn

Deep learning has been constantly improving in recent years and a significant number of researchers have devoted themselves to the research of defect detection algorithms. Detection and recognition of small and complex targets is still a problem that needs to be solved. The authors of this research would like to present an improved defect detection model for detecting small and complex defect targets in steel surfaces. During steel strip production mechanical forces and environmental factors cause surface defects of the steel strip. Therefore the detection of such defects is key to the production of high-quality products. Moreover surface defects of the steel strip cause great economic losses to the high-tech industry. So far few studies have explored methods of identifying the defects and most of the currently available algorithms are not sufficiently effective. Therefore this study presents an improved real-time metallic surface defect detection model based on You Only Look Once (YOLOv5) specially designed for small networks. For the smaller features of the target the conventional part is replaced with a depth-wise convolution and channel shuffle mechanism. Then assigning weights to Feature Pyramid Networks (FPN) output features and fusing them increases feature propagation and the networks characterization ability. The experimental results reveal that the improved proposed model outperforms other comparable models in terms of accuracy and detection time. The precision of the proposed model achieved by @mAP is 77.5% on the Northeastern University Dataset NEU-DET and 70.18% on the GC10-DET datasets

6/24/2024

🚀

Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects

Nadeem Jabbar Chaudhry, M. Bilal Khan, M. Javaid Iqbal, Siddiqui Muhammad Yasir

Recently, outstanding identification rates in image classification tasks were achieved by convolutional neural networks (CNNs). to use such skills, selective CNNs trained on a dataset of well-known images of metal surface defects captured with an RGB camera. Defects must be detected early to take timely corrective action due to production concerns. For image classification up till now, a model-based method has been utilized, which indicated the predicted reflection characteristics of surface defects in comparison to flaw-free surfaces. The problem of detecting steel surface defects has grown in importance as a result of the vast range of steel applications in end-product sectors such as automobiles, households, construction, etc. The manual processes for detections are time-consuming, labor-intensive, and expensive. Different strategies have been used to automate manual processes, but CNN models have proven to be the most effective rather than image processing and machine learning techniques. By using different CNN models with fine-tuning, easily compare their performance and select the best-performing model for the same kinds of tasks. However, it is important that using different CNN models either from fine tuning can be computationally expensive and time-consuming. Therefore, our study helps the upcoming researchers to choose the CNN without considering the issues of model complexity, performance, and computational resources. In this article, the performance of various CNN models with transfer learning techniques are evaluated. These models were chosen based on their popularity and impact in the field of computer vision research, as well as their performance on benchmark datasets. According to the outcomes, DenseNet201 outperformed the other CNN models and had the greatest detection rate on the NEU dataset, falling in at 98.37 percent.

6/24/2024

Msmsfnet: a multi-stream and multi-scale fusion net for edge detection

Chenguang Liu, Chisheng Wang, Feifei Dong, Xin Su, Chuanhua Zhu, Dejin Zhang, Qingquan Li

Edge detection is a long standing problem in computer vision. Recent deep learning based algorithms achieve state of-the-art performance in publicly available datasets. Despite the efficiency of these algorithms, their performance, however, relies heavily on the pretrained weights of the backbone network on the ImageNet dataset. This limits heavily the design space of deep learning based edge detectors. Whenever we want to devise a new model, we have to train this new model on the ImageNet dataset first, and then fine tune the model using the edge detection datasets. The comparison would be unfair otherwise. However, it is usually not feasible for many researchers to train a model on the ImageNet dataset due to the limited computation resources. In this work, we study the performance that can be achieved by state-of-the-art deep learning based edge detectors in publicly available datasets when they are trained from scratch, and devise a new network architecture, the multi-stream and multi scale fusion net (msmsfnet), for edge detection. We show in our experiments that by training all models from scratch to ensure the fairness of comparison, out model outperforms state-of-the art deep learning based edge detectors in three publicly available datasets.

4/9/2024