Density-Guided Dense Pseudo Label Selection For Semi-supervised Oriented Object Detection

Read original: arXiv:2311.12608 - Published 5/16/2024 by Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu

🔎

Overview

The paper proposes a new method called Density-Guided Dense Pseudo Label Selection (DDPLS) for semi-supervised oriented object detection.
This method addresses the limitations of existing dense pseudo-label selection methods, which ignore the significant density difference in aerial scenes with multi-oriented and dense objects.
The proposed DDPLS method uses a simple but effective adaptive mechanism to guide the selection of dense pseudo labels, by introducing a Pseudo Density Score (PDS) to estimate the density of potential objects.
Experiments on the DOTA-v1.5 benchmark show that DDPLS outperforms previous methods, especially when labeled data is scarce.

Plain English Explanation

In the field of computer vision, object detection is a crucial task that involves identifying and locating objects within an image. Semi-supervised object detection is a technique that aims to improve object detection performance by using both labeled and unlabeled data.

One approach to semi-supervised object detection is "dense pseudo-labeling," which directly selects pseudo-labels (predictions) from the original output of a teacher model, without any complicated post-processing steps. However, for aerial scenes with multi-oriented and dense objects, existing dense pseudo-label selection methods are inefficient because they ignore the significant density difference between objects.

To address this, the researchers propose a new method called Density-Guided Dense Pseudo Label Selection (DDPLS). DDPLS uses a simple but effective adaptive mechanism to guide the selection of dense pseudo labels. Specifically, it introduces a Pseudo Density Score (PDS) to estimate the density of potential objects, and then uses this score to select reliable dense pseudo labels.

The researchers tested DDPLS on the DOTA-v1.5 benchmark, a dataset of aerial images, and found that it outperforms previous methods, especially when labeled data is scarce. For example, DDPLS achieves 49.78 mAP (a performance metric) with just 5% of the annotated data, which is better than the previous state-of-the-art method with 10% of the annotated data.

Technical Explanation

The paper proposes a new method called Density-Guided Dense Pseudo Label Selection (DDPLS) for semi-supervised oriented object detection. This method addresses the limitations of existing dense pseudo-label selection methods, which ignore the significant density difference in aerial scenes with multi-oriented and dense objects.

In DDPLS, the researchers design a simple but effective adaptive mechanism to guide the selection of dense pseudo labels. Specifically, they introduce a Pseudo Density Score (PDS) to estimate the density of potential objects, and then use this score to select reliable dense pseudo labels.

The PDS is calculated based on the confidence scores and the distance between neighboring bounding boxes predicted by the teacher model. Objects with a higher PDS are more likely to be selected as pseudo labels.

The researchers evaluate DDPLS on the DOTA-v1.5 benchmark, a dataset of aerial images with multi-oriented and dense objects. The results show that DDPLS outperforms previous semi-supervised object detection methods, especially when labeled data is scarce.

For example, DDPLS achieves 49.78 mAP (mean Average Precision) with only 5% of the annotated data, which surpasses the previous state-of-the-art method given 10% of annotated data by 1.15 mAP. This demonstrates the effectiveness of the density-guided pseudo label selection approach in handling the challenges of aerial object detection.

Critical Analysis

The paper presents a novel and effective approach to semi-supervised object detection in aerial scenes, which are characterized by multi-oriented and dense objects. By introducing the Pseudo Density Score (PDS) to guide the selection of dense pseudo labels, DDPLS addresses the limitations of existing methods that ignore the significant density differences in these types of scenes.

One potential limitation of the DDPLS method is that it relies on the accuracy of the teacher model's predictions, which may not always be reliable, especially when the labeled data is scarce. [The researchers acknowledge this issue and suggest exploring ways to improve the robustness of the teacher model, such as using dynamic loss decay or reliable student techniques.]

Additionally, the paper focuses on oriented object detection in aerial scenes, but the DDPLS method could potentially be extended to other types of object detection tasks with dense and multi-oriented objects, such as open-vocabulary object detection. Further research could explore the applicability of DDPLS to these broader object detection scenarios.

Overall, the DDPLS method presents a promising approach to addressing the challenges of semi-supervised object detection in aerial scenes, and the researchers have made their code publicly available for further exploration and development.

Conclusion

The paper proposes a new method called Density-Guided Dense Pseudo Label Selection (DDPLS) for semi-supervised oriented object detection in aerial scenes. DDPLS addresses the limitations of existing dense pseudo-label selection methods by introducing a Pseudo Density Score (PDS) to guide the selection of reliable dense pseudo labels.

Experiments on the DOTA-v1.5 benchmark show that DDPLS outperforms previous semi-supervised object detection methods, especially when labeled data is scarce. This demonstrates the effectiveness of the density-guided pseudo label selection approach in handling the challenges of multi-oriented and dense objects in aerial scenes.

The DDPLS method represents a significant contribution to the field of semi-supervised object detection, and its potential applications extend beyond aerial scenes to other object detection tasks with dense and multi-oriented objects. The researchers have made their code publicly available, allowing for further exploration and development of this promising approach.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Density-Guided Dense Pseudo Label Selection For Semi-supervised Oriented Object Detection

Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu

Recently, dense pseudo-label, which directly selects pseudo labels from the original output of the teacher model without any complicated post-processing steps, has received considerable attention in semi-supervised object detection (SSOD). However, for the multi-oriented and dense objects that are common in aerial scenes, existing dense pseudo-label selection methods are inefficient because they ignore the significant density difference. Therefore, we propose Density-Guided Dense Pseudo Label Selection (DDPLS) for semi-supervised oriented object detection. In DDPLS, we design a simple but effective adaptive mechanism to guide the selection of dense pseudo labels. Specifically, we propose the Pseudo Density Score (PDS) to estimate the density of potential objects and use this score to select reliable dense pseudo labels. On the DOTA-v1.5 benchmark, the proposed method outperforms previous methods especially when labeled data are scarce. For example, it achieves 49.78 mAP given only 5% of annotated data, which surpasses previous state-of-the-art method given 10% of annotated data by 1.15 mAP. Our codes is available at https://github.com/Haru-zt/DDPLS.

5/16/2024

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

Jiacheng Zhang, Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li

We delve into pseudo-labeling for semi-supervised monocular 3D object detection (SSM3OD) and discover two primary issues: a misalignment between the prediction quality of 3D and 2D attributes and the tendency of depth supervision derived from pseudo-labels to be noisy, leading to significant optimization conflicts with other reliable forms of supervision. We introduce a novel decoupled pseudo-labeling (DPL) approach for SSM3OD. Our approach features a Decoupled Pseudo-label Generation (DPG) module, designed to efficiently generate pseudo-labels by separately processing 2D and 3D attributes. This module incorporates a unique homography-based method for identifying dependable pseudo-labels in BEV space, specifically for 3D attributes. Additionally, we present a DepthGradient Projection (DGP) module to mitigate optimization conflicts caused by noisy depth supervision of pseudo-labels, effectively decoupling the depth gradient and removing conflicting gradients. This dual decoupling strategy-at both the pseudo-label generation and gradient levels-significantly improves the utilization of pseudo-labels in SSM3OD. Our comprehensive experiments on the KITTI benchmark demonstrate the superiority of our method over existing approaches.

4/24/2024

SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection

Dingkang Liang, Wei Hua, Chunsheng Shi, Zhikang Zou, Xiaoqing Ye, Xiang Bai

Semi-supervised object detection (SSOD), leveraging unlabeled data to boost object detectors, has become a hot topic recently. However, existing SSOD approaches mainly focus on horizontal objects, leaving multi-oriented objects common in aerial images unexplored. At the same time, the annotation cost of multi-oriented objects is significantly higher than that of their horizontal counterparts. Therefore, in this paper, we propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++. Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation, which inspires the following core designs: a Simple Instance-aware Dense Sampling (SIDS) strategy is used to generate comprehensive dense pseudo-labels; the Geometry-aware Adaptive Weighting (GAW) loss dynamically modulates the importance of each pair between pseudo-label and corresponding prediction by leveraging the intricate geometric information of aerial objects; we treat aerial images as global layouts and explicitly build the many-to-many relationship between the sets of pseudo-labels and predictions via the proposed Noise-driven Global Consistency (NGC). Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method. For example, on the DOTA-V1.5 benchmark, the proposed method outperforms previous state-of-the-art (SOTA) by a large margin (+2.92, +2.39, and +2.57 mAP under 10%, 20%, and 30% labeled data settings, respectively) with single-scale training and testing. More importantly, it still improves upon a strong supervised baseline with 70.66 mAP, trained using the full DOTA-V1.5 train-val set, by +1.82 mAP, resulting in a 72.48 mAP, pushing the new state-of-the-art. The code will be made available.

7/2/2024

Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection

Chenxu Wang, Chunyan Xu, Ziqi Gu, Zhen Cui

While existing semi-supervised object detection (SSOD) methods perform well in general scenes, they encounter challenges in handling oriented objects in aerial images. We experimentally find three gaps between general and oriented object detection in semi-supervised learning: 1) Sampling inconsistency: the common center sampling is not suitable for oriented objects with larger aspect ratios when selecting positive labels from labeled data. 2) Assignment inconsistency: balancing the precision and localization quality of oriented pseudo-boxes poses greater challenges which introduces more noise when selecting positive labels from unlabeled data. 3) Confidence inconsistency: there exists more mismatch between the predicted classification and localization qualities when considering oriented objects, affecting the selection of pseudo-labels. Therefore, we propose a Multi-clue Consistency Learning (MCL) framework to bridge gaps between general and oriented objects in semi-supervised detection. Specifically, considering various shapes of rotated objects, the Gaussian Center Assignment is specially designed to select the pixel-level positive labels from labeled data. We then introduce the Scale-aware Label Assignment to select pixel-level pseudo-labels instead of unreliable pseudo-boxes, which is a divide-and-rule strategy suited for objects with various scales. The Consistent Confidence Soft Label is adopted to further boost the detector by maintaining the alignment of the predicted results. Comprehensive experiments on DOTA-v1.5 and DOTA-v1.0 benchmarks demonstrate that our proposed MCL can achieve state-of-the-art performance in the semi-supervised oriented object detection task.

7/9/2024