CurbNet: Curb Detection Framework Based on LiDAR Point Cloud Segmentation

Read original: arXiv:2403.16794 - Published 5/31/2024 by Guoyang Zhao, Fulong Ma, Weiqing Qi, Yuxuan Liu, Ming Liu

CurbNet: Curb Detection Framework Based on LiDAR Point Cloud Segmentation

Overview

The paper presents CurbNet, a framework for detecting curbs from LiDAR point cloud data, which is an important task for autonomous driving and urban planning.
The framework uses a deep learning-based segmentation approach to classify points as curb or non-curb, leveraging both spatial and semantic features.
The authors evaluate CurbNet on several public datasets and demonstrate its superior performance compared to other state-of-the-art methods.

Plain English Explanation

In the world of autonomous driving and smart city planning, being able to accurately detect curbs is a crucial task. Curbs are the raised edges that separate sidewalks from roads, and they provide important information about the layout and structure of urban environments.

The researchers behind CurbNet have developed a new framework that uses advanced machine learning techniques to identify curbs from 3D LiDAR point cloud data. LiDAR is a technology that uses laser beams to create detailed 3D maps of the surrounding environment, and it's a crucial sensor for autonomous vehicles.

CurbNet works by taking the raw LiDAR data and using a deep learning-based segmentation approach to classify each individual point as either belonging to a curb or not. This allows the framework to accurately identify the exact location and shape of curbs in the environment.

The researchers tested CurbNet on several different public datasets and found that it outperformed other state-of-the-art methods for curb detection. This means CurbNet is better able to identify curbs accurately, which is an important capability for applications like autonomous navigation and urban traffic management.

Technical Explanation

The CurbNet framework uses a deep learning-based segmentation approach to detect curbs from LiDAR point cloud data. The key components of the framework are:

Spatial Feature Extraction: The first step is to extract spatial features from the raw LiDAR point cloud data. This involves analyzing the 3D geometry and distribution of the points to identify characteristics that are indicative of curbs.
Semantic Feature Extraction: In addition to the spatial features, the framework also extracts semantic features from the point cloud. This includes information about the type of surface (e.g., road, sidewalk) and the surrounding environment.
Point-wise Classification: The spatial and semantic features are then fed into a deep neural network that classifies each individual point as either curb or non-curb. This segmentation approach allows CurbNet to precisely locate the boundaries of curbs in the environment.
Curb Extraction: After the point-wise classification, the framework applies post-processing techniques to extract the final curb locations and shapes from the segmented point cloud.

The authors evaluated CurbNet on several public datasets, including the nuScenes and KITTI datasets. They compared its performance to other state-of-the-art curb detection methods and demonstrated that CurbNet achieves significantly higher accuracy in identifying curbs.

Critical Analysis

The researchers acknowledge several limitations of their work. First, the performance of CurbNet may be affected by the quality and density of the input LiDAR data, as lower-quality or sparser point clouds could make it more challenging to accurately extract the necessary spatial and semantic features.

Additionally, the framework currently relies on a single-frame approach, which may not be able to effectively leverage temporal information from sequential LiDAR scans. Incorporating temporal consistency could potentially improve the robustness and accuracy of curb detection.

Another area for potential improvement is the post-processing step for extracting the final curb locations. The authors note that this step could be made more efficient and accurate by incorporating more advanced techniques, such as curve fitting or boundary tracing algorithms.

Overall, the CurbNet framework represents a promising advance in the field of curb detection, but there are still opportunities for further research and refinement to address the limitations mentioned above.

Conclusion

The CurbNet framework presented in this paper offers a novel approach to detecting curbs from LiDAR point cloud data, a crucial task for autonomous driving and urban planning applications. By leveraging both spatial and semantic features, the framework demonstrates superior performance compared to other state-of-the-art methods.

While the current implementation has some limitations, the researchers have shown the potential of deep learning-based segmentation techniques for accurate curb detection. Continued advancements in this area could lead to more robust and reliable systems for understanding the complex 3D environments in which autonomous vehicles and smart city infrastructure operate.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CurbNet: Curb Detection Framework Based on LiDAR Point Cloud Segmentation

Guoyang Zhao, Fulong Ma, Weiqing Qi, Yuxuan Liu, Ming Liu

Curb detection is a crucial function in intelligent driving, essential for determining drivable areas on the road. However, the complexity of road environments makes curb detection challenging. This paper introduces CurbNet, a novel framework for curb detection utilizing point cloud segmentation. To address the lack of comprehensive curb datasets with 3D annotations, we have developed the 3D-Curb dataset based on SemanticKITTI, currently the largest and most diverse collection of curb point clouds. Recognizing that the primary characteristic of curbs is height variation, our approach leverages spatially rich 3D point clouds for training. To tackle the challenges posed by the uneven distribution of curb features on the xy-plane and their dependence on high-frequency features along the z-axis, we introduce the Multi-Scale and Channel Attention (MSCA) module, a customized solution designed to optimize detection performance. Additionally, we propose an adaptive weighted loss function group specifically formulated to counteract the imbalance in the distribution of curb point clouds relative to other categories. Extensive experiments conducted on 2 major datasets demonstrate that our method surpasses existing benchmarks set by leading curb detection and point cloud segmentation models. Through the post-processing refinement of the detection results, we have significantly reduced noise in curb detection, thereby improving precision by 4.5 points. Similarly, our tolerance experiments also achieved state-of-the-art results. Furthermore, real-world experiments and dataset analyses mutually validate each other, reinforcing CurbNet's superior detection capability and robust generalizability. The project website is available at: https://github.com/guoyangzhao/CurbNet/.

5/31/2024

👀

Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation

Diwei Sheng, Giles Hamilton-Fletcher, Mahya Beheshti, Chen Feng, John-Ross Rizzo

Curbs serve as vital borders that delineate safe pedestrian zones from potential vehicular traffic hazards. Curbs also represent a primary spatial hazard during dynamic navigation with significant stumbling potential. Such vulnerabilities are particularly exacerbated for persons with blindness and low vision (PBLV). Accurate visual-based discrimination of curbs is paramount for assistive technologies that aid PBLV with safe navigation in urban environments. Herein, we investigate the efficacy of curb segmentation for foundation models. We introduce the largest curb segmentation dataset to-date to benchmark leading foundation models. Our results show that state-of-the-art foundation models face significant challenges in curb segmentation. This is due to their high false-positive rates (up to 95%) with poor performance distinguishing curbs from curb-like objects or non-curb areas, such as sidewalks. In addition, the best-performing model averaged a 3.70-second inference time, underscoring problems in providing real-time assistance. In response, we propose solutions including filtered bounding box selections to achieve more accurate curb segmentation. Overall, despite the immediate flexibility of foundation models, their application for practical assistive technology applications still requires refinement. This research highlights the critical need for specialized datasets and tailored model training to address navigation challenges for PBLV and underscores implicit weaknesses in foundation models.

6/12/2024

👀

Multi-faceted Sensory Substitution for Curb Alerting: A Pilot Investigation in Persons with Blindness and Low Vision

Ligao Ruan, Giles Hamilton-Fletcher, Mahya Beheshti, Todd E Hudson, Maurizio Porfiri, JR Rizzo

Curbs -- the edge of a raised sidewalk at the point where it meets a street -- crucial in urban environments where they help delineate safe pedestrian zones, from dangerous vehicular lanes. However, curbs themselves are significant navigation hazards, particularly for people who are blind or have low vision (pBLV). The challenges faced by pBLV in detecting and properly orientating themselves for these abrupt elevation changes can lead to falls and serious injuries. Despite recent advancements in assistive technologies, the detection and early warning of curbs remains a largely unsolved challenge. This paper aims to tackle this gap by introducing a novel, multi-faceted sensory substitution approach hosted on a smart wearable; the platform leverages an RGB camera and an embedded system to capture and segment curbs in real time and provide early warning and orientation information. The system utilizes YOLO (You Only Look Once) v8 segmentation model, trained on our custom curb dataset for the camera input. The output of the system consists of adaptive auditory beeps, abstract sonification, and speech, conveying information about the relative distance and orientation of curbs. Through human-subjects experimentation, we demonstrate the effectiveness of the system as compared to the white cane. Results show that our system can provide advanced warning through a larger safety window than the cane, while offering nearly identical curb orientation information.

8/29/2024

GroundGrid:LiDAR Point Cloud Ground Segmentation and Terrain Estimation

Nicolai Steinke, Daniel Gohring, Ra`ul Rojas

The precise point cloud ground segmentation is a crucial prerequisite of virtually all perception tasks for LiDAR sensors in autonomous vehicles. Especially the clustering and extraction of objects from a point cloud usually relies on an accurate removal of ground points. The correct estimation of the surrounding terrain is important for aspects of the drivability of a surface, path planning, and obstacle prediction. In this article, we propose our system GroundGrid which relies on 2D elevation maps to solve the terrain estimation and point cloud ground segmentation problems. We evaluate the ground segmentation and terrain estimation performance of GroundGrid and compare it to other state-of-the-art methods using the SemanticKITTI dataset and a novel evaluation method relying on airborne LiDAR scanning. The results show that GroundGrid is capable of outperforming other state-of-the-art systems with an average IoU of 94.78% while maintaining a high run-time performance of 171Hz. The source code is available at https://github.com/dcmlr/groundgrid

5/27/2024