Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

Read original: arXiv:2407.05744 - Published 7/9/2024 by Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Vanessa Boey, Irene Lee, Joo Young Hong, Jian Kang and 3 others

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

Overview

This paper explores the use of AI-powered techniques to enhance urban soundscapes and assess their quality and restorative potential in traffic-exposed residential areas.
The researchers developed a framework to automatically identify, classify, and modify environmental sounds, with the goal of improving the overall acoustic experience for residents.
The study involved in-situ evaluations of the modified soundscapes, measuring factors like perceived quality, restorativeness, and emotional responses.

Plain English Explanation

The paper looks at using AI to improve the sounds in urban neighborhoods that are exposed to a lot of traffic noise. The researchers created a system that can automatically identify, categorize, and change the environmental sounds around these areas. The goal was to enhance the overall acoustic experience for people living there.

To test their approach, the team conducted on-site evaluations of the modified soundscapes. They measured things like how much people enjoyed the new sounds, whether the sounds helped people relax and recover from stress, and how the sounds made people feel emotionally. This helped them understand if their AI-powered changes were actually improving the sound quality and having a positive impact on the residents.

Technical Explanation

The researchers developed a framework to automate urban soundscape enhancements using AI. Their system was designed to identify, classify, and modify environmental sounds in traffic-exposed residential areas, with the goal of improving the overall acoustic experience for residents.

The approach involved several key components:

Sound recognition and classification to identify different sound sources in the environment
Sound quality and affective modeling to assess how pleasing and restorative the sounds were perceived to be
Sound enhancement and synthesis to modify the soundscape by adding, removing, or transforming specific sounds

To evaluate the effectiveness of their system, the researchers conducted in-situ assessments in real-world traffic-exposed residential areas. They measured factors like perceived sound quality, restorativeness, and emotional responses to the modified soundscapes.

Critical Analysis

The paper presents a comprehensive approach to automating urban soundscape enhancements using AI, which could have significant implications for improving the quality of life in noise-polluted residential areas. However, the research also acknowledges several caveats and limitations that warrant further exploration.

For example, the in-situ evaluations were conducted over a relatively short time frame, and the long-term effects of the modified soundscapes on residents' well-being and behavior are still unclear. Additionally, the study focused on a limited number of residential areas, and the findings may not generalize to all urban environments with varying physical and cultural characteristics.

Furthermore, the paper does not address potential ethical concerns, such as the privacy implications of using AI-powered sound monitoring and modification in public spaces, or the potential for unintended consequences if the system is not carefully designed and implemented. Understanding pedestrian movement and behavior using urban sensing technologies could be an important consideration in this context.

Overall, the research presents an innovative approach to improving urban soundscapes, but further studies are needed to fully understand the long-term impacts and address potential ethical and practical challenges.

Conclusion

This paper demonstrates the potential of AI-powered techniques to enhance the soundscapes of traffic-exposed residential areas, with the goal of improving the overall acoustic experience and well-being of residents. The researchers developed a comprehensive framework to automatically identify, classify, and modify environmental sounds, and conducted in-situ evaluations to assess the quality and restorativeness of the modified soundscapes.

The findings suggest that this AI-powered approach can have a positive impact on residents' perceptions and emotional responses to their acoustic environment. However, the long-term effects, ethical considerations, and practical challenges of implementing such a system in diverse urban settings require further investigation.

As cities continue to grapple with the challenges of noise pollution and its impact on public health, the techniques outlined in this paper could serve as a valuable starting point for developing more sustainable and livable urban soundscapes that prioritize the needs and well-being of residents.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Vanessa Boey, Irene Lee, Joo Young Hong, Jian Kang, Kar Fye Alvin Lee, Georgios Christopoulos, Woon-Seng Gan

Formalized in ISO 12913, the soundscape approach is a paradigmatic shift towards perception-based urban sound management, aiming to alleviate the substantial socioeconomic costs of noise pollution to advance the United Nations Sustainable Development Goals. Focusing on traffic-exposed outdoor residential sites, we implemented an automatic masker selection system (AMSS) utilizing natural sounds to mask (or augment) traffic soundscapes. We employed a pre-trained AI model to automatically select the optimal masker and adjust its playback level, adapting to changes over time in the ambient environment to maximize Pleasantness, a perceptual dimension of soundscape quality in ISO 12913. Our validation study involving ($N=68$) residents revealed a significant 14.6 % enhancement in Pleasantness after intervention, correlating with increased restorativeness and positive affect. Perceptual enhancements at the traffic-exposed site matched those at a quieter control site with 6 dB(A) lower $L_text{A,eq}$ and road traffic noise dominance, affirming the efficacy of AMSS as a soundscape intervention, while streamlining the labour-intensive assessment of Pleasantness with probabilistic AI prediction.

7/9/2024

Soundscape Captioning using Sound Affective Quality Network and Large Language Model

Yuanbo Hou, Qiaoqiao Ren, Andrew Mitchell, Wenwu Wang, Jian Kang, Tony Belpaeme, Dick Botteldooren

We live in a rich and varied acoustic world, which is experienced by individuals or communities as a soundscape. Computational auditory scene analysis, disentangling acoustic scenes by detecting and classifying events, focuses on objective attributes of sounds, such as their category and temporal characteristics, ignoring the effect of sounds on people and failing to explore the relationship between sounds and the emotions they evoke within a context. To fill this gap and to automate soundscape analysis, which traditionally relies on labour-intensive subjective ratings and surveys, we propose the soundscape captioning (SoundSCap) task. SoundSCap generates context-aware soundscape descriptions by capturing the acoustic scene, event information, and the corresponding human affective qualities. To this end, we propose an automatic soundscape captioner (SoundSCaper) composed of an acoustic model, SoundAQnet, and a general large language model (LLM). SoundAQnet simultaneously models multi-scale information about acoustic scenes, events, and perceived affective qualities, while LLM generates soundscape captions by parsing the information captured by SoundAQnet to a common language. The soundscape caption's quality is assessed by a jury of 16 audio/soundscape experts. The average score (out of 5) of SoundSCaper-generated captions is lower than the score of captions generated by two soundscape experts by 0.21 and 0.25, respectively, on the evaluation set and the model-unknown mixed external dataset with varying lengths and acoustic properties, but the differences are not statistically significant. Overall, SoundSCaper-generated captions show promising performance compared to captions annotated by soundscape experts. The models' code, LLM scripts, human assessment data and instructions, and expert evaluation statistics are all publicly available.

6/11/2024

🧠

Extracting Urban Sound Information for Residential Areas in Smart Cities Using an End-to-End IoT System

Ee-Leng Tan, Furi Andi Karnapi, Linus Junjia Ng, Kenneth Ooi, Woon-Seng Gan

With rapid urbanization comes the increase of community, construction, and transportation noise in residential areas. The conventional approach of solely relying on sound pressure level (SPL) information to decide on the noise environment and to plan out noise control and mitigation strategies is inadequate. This paper presents an end-to-end IoT system that extracts real-time urban sound metadata using edge devices, providing information on the sound type, location and duration, rate of occurrence, loudness, and azimuth of a dominant noise in nine residential areas. The collected metadata on environmental sound is transmitted to and aggregated in a cloud-based platform to produce detailed descriptive analytics and visualization. Our approach to integrating different building blocks, namely, hardware, software, cloud technologies, and signal processing algorithms to form our real-time IoT system is outlined. We demonstrate how some of the sound metadata extracted by our system are used to provide insights into the noise in residential areas. A scalable workflow to collect and prepare audio recordings from nine residential areas to construct our urban sound dataset for training and evaluating a location-agnostic model is discussed. Some practical challenges of managing and maintaining a sensor network deployed at numerous locations are also addressed.

8/13/2024

🔄

Towards better visualizations of urban sound environments: insights from interviews

Modan Tailleur (LS2N), Pierre Aumond (UMRAE), Vincent Tourre (AAU), Mathieu Lagrange (LS2N)

Urban noise maps and noise visualizations traditionally provide macroscopic representations of noise levels across cities. However, those representations fail at accurately gauging the sound perception associated with these sound environments, as perception highly depends on the sound sources involved. This paper aims at analyzing the need for the representations of sound sources, by identifying the urban stakeholders for whom such representations are assumed to be of importance. Through spoken interviews with various urban stakeholders, we have gained insight into current practices, the strengths and weaknesses of existing tools and the relevance of incorporating sound sources into existing urban sound environment representations. Three distinct use of sound source representations emerged in this study: 1) noise-related complaints for industrials and specialized citizens, 2) soundscape quality assessment for citizens, and 3) guidance for urban planners. Findings also reveal diverse perspectives for the use of visualizations, which should use indicators adapted to the target audience, and enable data accessibility.

7/25/2024