StreetSurfaceVis: a dataset of crowdsourced street-level imagery with semi-automated annotations of road surface type and quality

Read original: arXiv:2407.21454 - Published 9/26/2024 by Alexandra Kapp, Edith Hoffmann, Esther Weigmann, Helena Mihaljevi'c

StreetSurfaceVis: a dataset of crowdsourced street-level imagery with semi-automated annotations of road surface type and quality

Overview

The paper presents a dataset called StreetSurfaceVis, which contains crowdsourced street-level imagery with semi-automated annotations of road surface type and quality.
The dataset was created to enable research on computer vision techniques for assessing road conditions, which has applications in urban planning, transportation, and autonomous driving.
The paper describes the process of constructing the dataset, including data collection, annotation, and quality control.

Plain English Explanation

The StreetSurfaceVis dataset is a collection of street-level photos that have been labeled with information about the type and condition of the road surface. This dataset was created to help researchers develop computer vision techniques that can automatically assess the state of roads, which could be useful for things like urban planning, transportation management, and self-driving cars.

To build the dataset, the researchers collected a large number of street-level photos from crowdsourced sources. They then used a semi-automated process to label each photo with details about the road surface, such as whether it's made of concrete, asphalt, or something else, and how well-maintained it is. This labeling process involved both human annotators and automated algorithms to ensure the accuracy of the annotations.

The dataset construction process involved several key steps:

Collecting the street-level imagery from various crowdsourced sources
Developing a set of guidelines and training data for annotating the road surface type and quality
Recruiting human annotators to label the images, with quality control measures in place
Applying automated computer vision models to further refine the annotations

By creating this comprehensive dataset, the researchers hope to enable other scientists and engineers to develop more advanced computer vision systems for assessing road conditions. This could lead to improvements in urban planning, transportation infrastructure maintenance, and the capabilities of autonomous vehicles.

Technical Explanation

The dataset construction process for StreetSurfaceVis involved several key steps:

Data Collection: The researchers collected street-level imagery from various crowdsourced sources, including Google Street View, Mapillary, and their own custom data collection efforts. This resulted in a diverse set of images representing different road types, locations, and environmental conditions.
Annotation Guidelines and Training Data: The team developed a set of detailed guidelines for annotating the road surface type (e.g., asphalt, concrete, gravel) and quality (e.g., smooth, rough, potholed). They also created a training dataset of manually annotated images to serve as a reference for the human annotators.
Human Annotation: The researchers recruited a team of human annotators to label the road surface type and quality for each image in the dataset. To ensure high-quality annotations, they implemented quality control measures, such as having multiple annotators review each image and providing feedback and additional training as needed.
Automated Refinement: After the initial human annotation, the researchers applied computer vision models to further refine the annotations. This semi-automated approach allowed them to leverage the strengths of both human and machine intelligence to produce a high-quality dataset.

The resulting StreetSurfaceVis dataset contains over 100,000 street-level images with detailed annotations of road surface type and quality. This comprehensive dataset is intended to enable researchers to develop more advanced computer vision techniques for assessing road conditions, which could have a range of applications in urban planning, transportation infrastructure management, and autonomous driving.

Critical Analysis

The StreetSurfaceVis dataset represents a valuable contribution to the field of computer vision for road assessment. The authors have thoughtfully addressed several key challenges in creating a high-quality, annotated dataset, including the need for comprehensive data collection, clear annotation guidelines, and a robust quality control process.

One potential limitation of the dataset is the reliance on semi-automated annotation, which may introduce some degree of inconsistency or bias compared to a fully manual approach. However, the authors have acknowledged this challenge and have taken steps to mitigate it through the use of multiple human annotators and automated refinement.

Additionally, the dataset may not capture the full range of road conditions and environments, as it is primarily focused on urban and suburban areas. Expanding the dataset to include more diverse road types and locations, such as rural or unpaved roads, could further enhance its utility for a wider range of applications.

Overall, the StreetSurfaceVis dataset represents an important step forward in enabling research on computer vision techniques for assessing road conditions. By making this dataset publicly available, the authors have opened up new opportunities for collaboration and advancement in this critical field.

Conclusion

The StreetSurfaceVis dataset presents a comprehensive collection of street-level imagery with semi-automated annotations of road surface type and quality. This dataset has the potential to enable significant progress in the development of computer vision techniques for assessing road conditions, which could have far-reaching implications for urban planning, transportation infrastructure management, and the advancement of autonomous driving technology.

By carefully addressing the challenges of data collection, annotation, and quality control, the researchers have created a valuable resource for the research community. As the field continues to evolve, the StreetSurfaceVis dataset may serve as a foundation for further innovation and discovery, ultimately contributing to the betterment of our transportation systems and the built environment.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

StreetSurfaceVis: a dataset of crowdsourced street-level imagery with semi-automated annotations of road surface type and quality

Alexandra Kapp, Edith Hoffmann, Esther Weigmann, Helena Mihaljevi'c

Road unevenness significantly impacts the safety and comfort of traffic participants, especially vulnerable groups such as cyclists and wheelchair users. To train models for comprehensive road surface assessments, we introduce StreetSurfaceVis, a novel dataset comprising 9,122 street-level images mostly from Germany collected from a crowdsourcing platform and manually annotated by road surface type and quality. By crafting a heterogeneous dataset, we aim to enable robust models that maintain high accuracy across diverse image sources. As the frequency distribution of road surface types and qualities is highly imbalanced, we propose a sampling strategy incorporating various external label prediction resources to ensure sufficient images per class while reducing manual annotation. More precisely, we estimate the impact of (1) enriching the image data with OpenStreetMap tags, (2) iterative training and application of a custom surface type classification model, (3) amplifying underrepresented classes through prompt-based classification with GPT-4o and (4) similarity search using image embeddings. Combining these strategies effectively reduces manual annotation workload while ensuring sufficient class representation.

9/26/2024

📉

SurfaceAI: Automated creation of cohesive road surface quality datasets based on open street-level imagery

Alexandra Kapp, Edith Hoffmann, Esther Weigmann, Helena Mihaljevi'c

This paper introduces SurfaceAI, a pipeline designed to generate comprehensive georeferenced datasets on road surface type and quality from openly available street-level imagery. The motivation stems from the significant impact of road unevenness on the safety and comfort of traffic participants, especially vulnerable road users, emphasizing the need for detailed road surface data in infrastructure modeling and analysis. SurfaceAI addresses this gap by leveraging crowdsourced Mapillary data to train models that predict the type and quality of road surfaces visible in street-level images, which are then aggregated to provide cohesive information on entire road segment conditions.

9/30/2024

🏷️

Improving classification of road surface conditions via road area extraction and contrastive learning

Linh Trinh, Ali Anwar, Siegfried Mercelis

Maintaining roads is crucial to economic growth and citizen well-being because roads are a vital means of transportation. In various countries, the inspection of road surfaces is still done manually, however, to automate it, research interest is now focused on detecting the road surface defects via the visual data. While, previous research has been focused on deep learning methods which tend to process the entire image and leads to heavy computational cost. In this study, we focus our attention on improving the classification performance while keeping the computational cost of our solution low. Instead of processing the whole image, we introduce a segmentation model to only focus the downstream classification model to the road surface in the image. Furthermore, we employ contrastive learning during model training to improve the road surface condition classification. Our experiments on the public RTK dataset demonstrate a significant improvement in our proposed method when compared to previous works.

7/22/2024

A citizen science toolkit to collect human perceptions of urban environments using open street view images

Matthew Danish, SM Labib, Britta Ricker, Marco Helbich

Street View Imagery (SVI) is a valuable data source for studies (e.g., environmental assessments, green space identification or land cover classification). While commercial SVI is available, such providers commonly restrict copying or reuse in ways necessary for research. Open SVI datasets are readily available from less restrictive sources, such as Mapillary, but due to the heterogeneity of the images, these require substantial preprocessing, filtering, and careful quality checks. We present an efficient method for automated downloading, processing, cropping, and filtering open SVI, to be used in a survey of human perceptions of the streets portrayed in these images. We demonstrate our open-source reusable SVI preparation and smartphone-friendly perception-survey software with Amsterdam (Netherlands) as the case study. Using a citizen science approach, we collected from 331 people 22,637 ratings about their perceptions for various criteria. We have published our software in a public repository for future re-use and reproducibility.

6/4/2024