Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation

2406.16776

Published 6/26/2024 by Yizheng Wu, Zhiyu Pan, Kewei Wang, Xingyi Li, Jiahao Cui, Liwen Xiao, Guosheng Lin, Zhiguo Cao

Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation

Abstract

Large-scale datasets with point-wise semantic and instance labels are crucial to 3D instance segmentation but also expensive. To leverage unlabeled data, previous semi-supervised 3D instance segmentation approaches have explored self-training frameworks, which rely on high-quality pseudo labels for consistency regularization. They intuitively utilize both instance and semantic pseudo labels in a joint learning manner. However, semantic pseudo labels contain numerous noise derived from the imbalanced category distribution and natural confusion of similar but distinct categories, which leads to severe collapses in self-training. Motivated by the observation that 3D instances are non-overlapping and spatially separable, we ask whether we can solely rely on instance consistency regularization for improved semi-supervised segmentation. To this end, we propose a novel self-training network InsTeacher3D to explore and exploit pure instance knowledge from unlabeled data. We first build a parallel base 3D instance segmentation model DKNet, which distinguishes each instance from the others via discriminative instance kernels without reliance on semantic segmentation. Based on DKNet, we further design a novel instance consistency regularization framework to generate and leverage high-quality instance pseudo labels. Experimental results on multiple large-scale datasets show that the InsTeacher3D significantly outperforms prior state-of-the-art semi-supervised approaches. Code is available: https://github.com/W1zheng/InsTeacher3D.

Create account to get full access

Introduction

This post provides a detailed guide on how to use the IEEEtran LaTeX templates. The IEEEtran templates are a set of LaTeX document classes developed by Michael Shell for formatting papers to be published in IEEE (Institute of Electrical and Electronics Engineers) journals and conferences. These templates ensure that your paper adheres to the IEEE's formatting requirements, making the submission process smoother.

The Design, Intent and Limitations of the Templates

The IEEEtran templates are designed to handle the complex formatting requirements of IEEE publications, including specific margins, font sizes, and section layouts. The templates are intended to save authors time and effort by automating these formatting tasks, allowing them to focus on the content of their paper.

However, the templates do have some limitations. They may not be suitable for all types of publications or for authors who require a high degree of customization. Additionally, the templates may not be compatible with all LaTeX packages or configurations, and users may need to troubleshoot compatibility issues.

Using the IEEEtran Templates

To use the IEEEtran templates, authors need to download the necessary files from the IEEE website or other online repositories. The templates typically include a LaTeX class file (e.g., IEEEtran.cls) and supporting files, such as bibliography style files.

Once the templates are downloaded, authors can incorporate them into their LaTeX project by adding the appropriate \documentclass command at the beginning of their document. The templates provide various options and commands that authors can use to customize the appearance and layout of their paper, such as adjusting the font size, column layout, and bibliography style.

Conclusion

The IEEEtran LaTeX templates are a valuable resource for authors who need to publish their work in IEEE journals and conferences. By automating the formatting process, the templates can save time and help ensure that the paper adheres to the IEEE's strict requirements. However, users should be aware of the templates' limitations and be prepared to troubleshoot any compatibility issues that may arise.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🖼️

Semi-supervised Medical Image Segmentation via Geometry-aware Consistency Training

Zihang Liu, Chunhui Zhao

The performance of supervised deep learning methods for medical image segmentation is often limited by the scarcity of labeled data. As a promising research direction, semi-supervised learning addresses this dilemma by leveraging unlabeled data information to assist the learning process. In this paper, a novel geometry-aware semi-supervised learning framework is proposed for medical image segmentation, which is a consistency-based method. Considering that the hard-to-segment regions are mainly located around the object boundary, we introduce an auxiliary prediction task to learn the global geometric information. Based on the geometric constraint, the ambiguous boundary regions are emphasized through an exponentially weighted strategy for the model training to better exploit both labeled and unlabeled data. In addition, a dual-view network is designed to perform segmentation from different perspectives and reduce the prediction uncertainty. The proposed method is evaluated on the public left atrium benchmark dataset and improves fully supervised method by 8.7% in Dice with 10% labeled images, while 4.3% with 20% labeled images. Meanwhile, our framework outperforms six state-of-the-art semi-supervised segmentation methods.

5/13/2024

eess.IV cs.CV

🤷

UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes

David Rozenberszki, Or Litany, Angela Dai

3D instance segmentation is fundamental to geometric understanding of the world around us. Existing methods for instance segmentation of 3D scenes rely on supervision from expensive, manual 3D annotations. We propose UnScene3D, the first fully unsupervised 3D learning approach for class-agnostic 3D instance segmentation of indoor scans. UnScene3D first generates pseudo masks by leveraging self-supervised color and geometry features to find potential object regions. We operate on a basis of geometric oversegmentation, enabling efficient representation and learning on high-resolution 3D data. The coarse proposals are then refined through self-training our model on its predictions. Our approach improves over state-of-the-art unsupervised 3D instance segmentation methods by more than 300% Average Precision score, demonstrating effective instance segmentation even in challenging, cluttered 3D scenes.

5/1/2024

cs.CV

🔎

Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection

Farzad Nozarian, Shashank Agarwal, Farzaneh Rezaeianaran, Danish Shahzad, Atanas Poibrenski, Christian Muller, Philipp Slusallek

Semi-supervised 3D object detection can benefit from the promising pseudo-labeling technique when labeled data is limited. However, recent approaches have overlooked the impact of noisy pseudo-labels during training, despite efforts to enhance pseudo-label quality through confidence-based filtering. In this paper, we examine the impact of noisy pseudo-labels on IoU-based target assignment and propose the Reliable Student framework, which incorporates two complementary approaches to mitigate errors. First, it involves a class-aware target assignment strategy that reduces false negative assignments in difficult classes. Second, it includes a reliability weighting strategy that suppresses false positive assignment errors while also addressing remaining false negatives from the first step. The reliability weights are determined by querying the teacher network for confidence scores of the student-generated proposals. Our work surpasses the previous state-of-the-art on KITTI 3D object detection benchmark on point clouds in the semi-supervised setting. On 1% labeled data, our approach achieves a 6.2% AP improvement for the pedestrian class, despite having only 37 labeled samples available. The improvements become significant for the 2% setting, achieving 6.0% AP and 5.7% AP improvements for the pedestrian and cyclist classes, respectively.

4/30/2024

cs.CV cs.LG

Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation

Amir El-Ghoussani, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis

In monocular depth estimation, unsupervised domain adaptation has recently been explored to relax the dependence on large annotated image-based depth datasets. However, this comes at the cost of training multiple models or requiring complex training protocols. We formulate unsupervised domain adaptation for monocular depth estimation as a consistency-based semi-supervised learning problem by assuming access only to the source domain ground truth labels. To this end, we introduce a pairwise loss function that regularises predictions on the source domain while enforcing perturbation consistency across multiple augmented views of the unlabelled target samples. Importantly, our approach is simple and effective, requiring only training of a single model in contrast to the prior work. In our experiments, we rely on the standard depth estimation benchmarks KITTI and NYUv2 to demonstrate state-of-the-art results compared to related approaches. Furthermore, we analyse the simplicity and effectiveness of our approach in a series of ablation studies. The code is available at url{https://github.com/AmirMaEl/SemiSupMDE}.

5/29/2024

cs.CV