Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images

2405.18078

Published 5/29/2024 by Lianlei Shan, Weiqiang Wang, Ke Lv, Bin Luo

Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images

Abstract

Semantic segmentation requires pixel-level annotation, which is time-consuming. Active Learning (AL) is a promising method for reducing data annotation costs. Due to the gap between aerial and natural images, the previous AL methods are not ideal, mainly caused by unreasonable labeling units and the neglect of class imbalance. Previous labeling units are based on images or regions, which does not consider the characteristics of segmentation tasks and aerial images, i.e., the segmentation network often makes mistakes in the edge region, and the edge of aerial images is often interlaced and irregular. Therefore, an edge-guided labeling unit is proposed and supplemented as the new unit. On the other hand, the class imbalance is severe, manifested in two aspects: the aerial image is seriously imbalanced, and the AL strategy does not fully consider the class balance. Both seriously affect the performance of AL in aerial images. We comprehensively ensure class balance from all steps that may occur imbalance, including initial labeled data, subsequent labeled data, and pseudo-labels. Through the two improvements, our method achieves more than 11.2% gains compared to state-of-the-art methods on three benchmark datasets, Deepglobe, Potsdam, and Vaihingen, and more than 18.6% gains compared to the baseline. Sufficient ablation studies show that every module is indispensable. Furthermore, we establish a fair and strong benchmark for future research on AL for aerial image segmentation.

Create account to get full access

Overview

This paper proposes an edge-guided and class-balanced active learning approach for semantic segmentation of aerial images.
The method selects the most informative samples based on edge information and class distribution to efficiently annotate the dataset.
The proposed approach aims to improve the annotation efficiency and model performance compared to random sampling and existing active learning techniques.

Plain English Explanation

Aerial images, such as those captured by drones or satellites, often contain complex scenes with various objects like buildings, roads, vegetation, and more. Semantic segmentation is the process of automatically labeling each pixel in the image with its corresponding object class, which is crucial for applications like urban planning, agriculture, and disaster response.

However, annotating these aerial images can be time-consuming and expensive, as it requires human experts to carefully label every pixel. Active learning is a technique that aims to reduce the annotation effort by selectively choosing the most informative samples for annotation, rather than randomly sampling the entire dataset.

This paper introduces an "edge-guided and class-balanced" active learning approach, which means it focuses on selecting samples that have important edge information (like the boundaries between objects) and ensures a balanced representation of all object classes. By prioritizing these informative and diverse samples, the model can learn more efficiently and achieve better performance with fewer annotated samples.

The key idea is to use the model's predictions and confidence to identify the most valuable samples for annotation, rather than randomly selecting them. The authors show that this approach outperforms traditional random sampling and other active learning methods, leading to higher predictive accuracy with less annotation effort.

Technical Explanation

The proposed method, called Edge-guided and Class-balanced Active Learning (EC-AL), consists of three main components:

Edge-guided Sample Selection: The model first identifies the most informative samples based on the edge information in the images. Edges represent the boundaries between objects, and samples with more prominent edges are more likely to contain valuable information for the segmentation task.
Class-balanced Sampling: To ensure a balanced representation of all object classes, the method also considers the class distribution of the samples. It selects samples that can help improve the model's performance on underrepresented classes.
Iterative Annotation and Model Retraining: The selected samples are annotated by human experts, and the model is retrained on the updated dataset. This process is repeated iteratively until the desired performance is achieved or the annotation budget is exhausted.

The authors evaluate their approach on two aerial image datasets and compare it to random sampling and other active learning techniques, such as uncertainty-based and diversity-based methods. The results show that the proposed EC-AL method consistently outperforms the baselines, achieving higher segmentation accuracy with fewer annotated samples.

Critical Analysis

The paper presents a well-designed and comprehensive approach to active learning for semantic segmentation of aerial images. The combination of edge-guided and class-balanced sampling is a novel and promising strategy that addresses the limitations of existing active learning techniques.

One potential limitation of the study is that it only evaluates the method on two specific aerial image datasets. It would be beneficial to test the approach on a wider range of datasets and scenarios to assess its generalizability and robustness.

Additionally, the paper does not provide much insight into the computational complexity and runtime performance of the proposed method. This information would be valuable for practitioners who need to consider the practical implementation and deployment of the technique.

Overall, the research makes a valuable contribution to the field of active learning for semantic segmentation, and the findings could have important implications for efficient annotation and model training in various applications, such as urban planning, agriculture, and disaster response.

Conclusion

This paper introduces an edge-guided and class-balanced active learning approach for semantic segmentation of aerial images. By prioritizing informative samples based on edge information and class distribution, the method can achieve higher segmentation accuracy with fewer annotated samples compared to random sampling and other active learning techniques.

The proposed approach has the potential to significantly streamline the annotation process and improve the efficiency of model training for a wide range of aerial image applications. The findings of this research could inspire further developments in active learning and efficient annotation strategies for complex computer vision tasks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation

Bart M. van Marrewijk, Charbel Dandjinou, Dan Jeric Arcega Rustia, Nicolas Franco Gonzalez, Boubacar Diallo, J'er^ome Dias, Paul Melki, Pieter M. Blok

Optimizing deep learning models requires large amounts of annotated images, a process that is both time-intensive and costly. Especially for semantic segmentation models in which every pixel must be annotated. A potential strategy to mitigate annotation effort is active learning. Active learning facilitates the identification and selection of the most informative images from a large unlabelled pool. The underlying premise is that these selected images can improve the model's performance faster than random selection to reduce annotation effort. While active learning has demonstrated promising results on benchmark datasets like Cityscapes, its performance in the agricultural domain remains largely unexplored. This study addresses this research gap by conducting a comparative study of three active learning-based acquisition functions: Bayesian Active Learning by Disagreement (BALD), stochastic-based BALD (PowerBALD), and Random. The acquisition functions were tested on two agricultural datasets: Sugarbeet and Corn-Weed, both containing three semantic classes: background, crop and weed. Our results indicated that active learning, especially PowerBALD, yields a higher performance than Random sampling on both datasets. But due to the relatively large standard deviations, the differences observed were minimal; this was partly caused by high image redundancy and imbalanced classes. Specifically, more than 89% of the pixels belonged to the background class on both datasets. The absence of significant results on both datasets indicates that further research is required for applying active learning on agricultural datasets, especially if they contain a high-class imbalance and redundant images. Recommendations and insights are provided in this paper to potentially resolve such issues.

4/4/2024

cs.CV cs.AI

An Active Learning Framework with a Class Balancing Strategy for Time Series Classification

Shemonto Das

Training machine learning models for classification tasks often requires labeling numerous samples, which is costly and time-consuming, especially in time series analysis. This research investigates Active Learning (AL) strategies to reduce the amount of labeled data needed for effective time series classification. Traditional AL techniques cannot control the selection of instances per class for labeling, leading to potential bias in classification performance and instance selection, particularly in imbalanced time series datasets. To address this, we propose a novel class-balancing instance selection algorithm integrated with standard AL strategies. Our approach aims to select more instances from classes with fewer labeled examples, thereby addressing imbalance in time series datasets. We demonstrate the effectiveness of our AL framework in selecting informative data samples for two distinct domains of tactile texture recognition and industrial fault detection. In robotics, our method achieves high-performance texture categorization while significantly reducing labeled training data requirements to 70%. We also evaluate the impact of different sliding window time intervals on robotic texture classification using AL strategies. In synthetic fiber manufacturing, we adapt AL techniques to address the challenge of fault classification, aiming to minimize data annotation cost and time for industries. We also address real-life class imbalances in the multiclass industrial anomalous dataset using our class-balancing instance algorithm integrated with AL strategies. Overall, this thesis highlights the potential of our AL framework across these two distinct domains.

5/21/2024

cs.LG

Predictive Accuracy-Based Active Learning for Medical Image Segmentation

Jun Shi, Shulan Ruan, Ziqi Zhu, Minfan Zhao, Hong An, Xudong Xue, Bing Yan

Active learning is considered a viable solution to alleviate the contradiction between the high dependency of deep learning-based segmentation methods on annotated data and the expensive pixel-level annotation cost of medical images. However, most existing methods suffer from unreliable uncertainty assessment and the struggle to balance diversity and informativeness, leading to poor performance in segmentation tasks. In response, we propose an efficient Predictive Accuracy-based Active Learning (PAAL) method for medical image segmentation, first introducing predictive accuracy to define uncertainty. Specifically, PAAL mainly consists of an Accuracy Predictor (AP) and a Weighted Polling Strategy (WPS). The former is an attached learnable module that can accurately predict the segmentation accuracy of unlabeled samples relative to the target model with the predicted posterior probability. The latter provides an efficient hybrid querying scheme by combining predicted accuracy and feature representation, aiming to ensure the uncertainty and diversity of the acquired samples. Extensive experiment results on multiple datasets demonstrate the superiority of PAAL. PAAL achieves comparable accuracy to fully annotated data while reducing annotation costs by approximately 50% to 80%, showcasing significant potential in clinical applications. The code is available at https://github.com/shijun18/PAAL-MedSeg.

5/2/2024

cs.CV

➖

Enhancing Active Learning for Sentinel 2 Imagery through Contrastive Learning and Uncertainty Estimation

David Pogorzelski, Peter Arlinghaus

In this paper, we introduce a novel method designed to enhance label efficiency in satellite imagery analysis by integrating semi-supervised learning (SSL) with active learning strategies. Our approach utilizes contrastive learning together with uncertainty estimations via Monte Carlo Dropout (MC Dropout), with a particular focus on Sentinel-2 imagery analyzed using the Eurosat dataset. We explore the effectiveness of our method in scenarios featuring both balanced and unbalanced class distributions. Our results show that for unbalanced classes, our method is superior to the random approach, enabling significant savings in labeling effort while maintaining high classification accuracy. These findings highlight the potential of our approach to facilitate scalable and cost-effective satellite image analysis, particularly advantageous for extensive environmental monitoring and land use classification tasks. Note on preliminary results: This paper presents a new method for active learning and includes results from an initial experiment comparing random selection with our proposed method. We acknowledge that these results are preliminary. We are currently conducting further experiments and will update this paper with additional findings, including comparisons with other methods, in the coming weeks.

5/24/2024

cs.CV cs.LG