Design Considerations for Automatic Musical Soundscapes of Visual Art for People with Blindness or Low Vision

Read original: arXiv:2405.14188 - Published 5/24/2024 by Stephen James Krol, Maria Teresa Llano, Matthew Butler, Cagatay Goncu
Total Score

0

👀

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper explores the use of automated soundscapes to enhance the accessibility of visual art for people who are blind or have low vision (BLV).
  • Composing music and designing soundscapes for visual art is a time-consuming and resource-intensive process, limiting its scalability for large exhibitions.
  • The researchers built a prototype system and conducted a qualitative study to evaluate the aesthetic experience provided by the automated soundscapes with 10 BLV participants.
  • The study identified a set of design considerations that reveal requirements from BLV people for the development of automated soundscape systems, setting new directions for enriching the aesthetic experience conveyed by these systems.

Plain English Explanation

The paper investigates using automated soundscapes to make visual art more accessible for people who are blind or have low vision (BLV). Traditionally, music and soundscapes have been used to enhance the experience of visual art for BLV individuals. However, creating these soundscapes manually is time-consuming and resource-intensive, limiting their use for large art exhibitions.

The researchers built a prototype system that can automatically generate soundscapes to accompany visual art. They then conducted a study with 10 BLV participants to understand their aesthetic experience with these automated soundscapes. From the study, the researchers identified key design considerations that BLV people have for these types of systems. This provides guidance on how creative systems could be developed to better enrich the aesthetic experience of BLV individuals engaging with visual art.

Technical Explanation

The researchers built a prototype system that can automatically generate soundscapes to accompany visual art. They then conducted a qualitative study with 10 BLV participants to evaluate the aesthetic experience provided by the automated soundscapes.

The study involved having the participants interact with the prototype system and provide feedback on their experience. The researchers used this feedback to identify a set of design considerations that reveal requirements from BLV people for the development of automated soundscape systems. These design considerations include factors like the use of spatial audio to enhance the sense of immersion, the incorporation of meaningful, context-aware sounds that connect with the visual art, and the ability to customize the soundscapes to individual preferences.

The insights from this study provide valuable guidance on how creative systems could be developed to enrich the aesthetic experience conveyed by automated soundscapes for BLV individuals engaging with visual art.

Critical Analysis

The paper presents a promising approach to enhancing the accessibility of visual art for BLV individuals. However, the study was limited to a small sample size of 10 participants, and the researchers acknowledge that further research is needed to validate the findings and explore the long-term effects of using automated soundscapes.

Additionally, the paper does not address the potential challenges of achieving high-quality, contextually relevant soundscapes through automation. There may be limitations in the ability of current AI and generative music systems to create truly meaningful and immersive soundscapes that seamlessly integrate with the visual art. Further research may be needed to explore the capabilities and limitations of these technologies.

Overall, the paper offers valuable insights and sets the stage for continued exploration of automated soundscapes as a means of improving accessibility and aesthetic experience for BLV individuals engaging with visual art.

Conclusion

This paper investigates the use of automated soundscapes to enhance the accessibility and experience of visual art for people who are blind or have low vision. The researchers built a prototype system and conducted a qualitative study to understand the design considerations for these types of systems from the perspective of BLV individuals.

The key insights from the study provide guidance on how creative systems could be developed to enrich the aesthetic experience conveyed by automated soundscapes, such as the importance of spatial audio, context-aware sounds, and customization. While further research is needed, this work represents an important step towards making visual art more inclusive and accessible for BLV individuals through the power of technology-enhanced soundscapes.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👀

Total Score

0

Design Considerations for Automatic Musical Soundscapes of Visual Art for People with Blindness or Low Vision

Stephen James Krol, Maria Teresa Llano, Matthew Butler, Cagatay Goncu

Music has been identified as a promising medium to enhance the accessibility and experience of visual art for people who are blind or have low vision (BLV). However, composing music and designing soundscapes for visual art is a time-consuming, resource intensive process - limiting its scalability for large exhibitions. In this paper, we investigate the use of automated soundscapes to increase the accessibility of visual art. We built a prototype system and ran a qualitative study to evaluate the aesthetic experience provided by the automated soundscapes with 10 BLV participants. From the study, we identified a set of design considerations that reveal requirements from BLV people for the development of automated soundscape systems, setting new directions in which creative systems could enrich the aesthetic experience conveyed by these.

Read more

5/24/2024

Engaging with Children's Artwork in Mixed Visual-Ability Families
Total Score

0

Engaging with Children's Artwork in Mixed Visual-Ability Families

Arnavi Chheda-Kothary, Jacob O. Wobbrock, Jon E. Froehlich

We present two studies exploring how blind or low-vision (BLV) family members engage with their sighted children's artwork, strategies to support understanding and interpretation, and the potential role of technology, such as AI, therein. Our first study involved 14 BLV individuals, and the second included five groups of BLV individuals with their children. Through semi-structured interviews with AI descriptions of children's artwork and multi-sensory design probes, we found that BLV family members value artwork engagement as a bonding opportunity, preferring the child's storytelling and interpretation over other nonvisual representations. Additionally, despite some inaccuracies, BLV family members felt that AI-generated descriptions could facilitate dialogue with their children and aid self-guided art discovery. We close with specific design considerations for supporting artwork engagement in mixed visual-ability families, including enabling artwork access through various methods, supporting children's corrections of AI output, and distinctions in context vs. content and interpretation vs. description of children's artwork.

Read more

7/31/2024

Soundscape Captioning using Sound Affective Quality Network and Large Language Model
Total Score

0

Soundscape Captioning using Sound Affective Quality Network and Large Language Model

Yuanbo Hou, Qiaoqiao Ren, Andrew Mitchell, Wenwu Wang, Jian Kang, Tony Belpaeme, Dick Botteldooren

We live in a rich and varied acoustic world, which is experienced by individuals or communities as a soundscape. Computational auditory scene analysis, disentangling acoustic scenes by detecting and classifying events, focuses on objective attributes of sounds, such as their category and temporal characteristics, ignoring the effect of sounds on people and failing to explore the relationship between sounds and the emotions they evoke within a context. To fill this gap and to automate soundscape analysis, which traditionally relies on labour-intensive subjective ratings and surveys, we propose the soundscape captioning (SoundSCap) task. SoundSCap generates context-aware soundscape descriptions by capturing the acoustic scene, event information, and the corresponding human affective qualities. To this end, we propose an automatic soundscape captioner (SoundSCaper) composed of an acoustic model, SoundAQnet, and a general large language model (LLM). SoundAQnet simultaneously models multi-scale information about acoustic scenes, events, and perceived affective qualities, while LLM generates soundscape captions by parsing the information captured by SoundAQnet to a common language. The soundscape caption's quality is assessed by a jury of 16 audio/soundscape experts. The average score (out of 5) of SoundSCaper-generated captions is lower than the score of captions generated by two soundscape experts by 0.21 and 0.25, respectively, on the evaluation set and the model-unknown mixed external dataset with varying lengths and acoustic properties, but the differences are not statistically significant. Overall, SoundSCaper-generated captions show promising performance compared to captions annotated by soundscape experts. The models' code, LLM scripts, human assessment data and instructions, and expert evaluation statistics are all publicly available.

Read more

6/11/2024

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas
Total Score

0

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Vanessa Boey, Irene Lee, Joo Young Hong, Jian Kang, Kar Fye Alvin Lee, Georgios Christopoulos, Woon-Seng Gan

Formalized in ISO 12913, the soundscape approach is a paradigmatic shift towards perception-based urban sound management, aiming to alleviate the substantial socioeconomic costs of noise pollution to advance the United Nations Sustainable Development Goals. Focusing on traffic-exposed outdoor residential sites, we implemented an automatic masker selection system (AMSS) utilizing natural sounds to mask (or augment) traffic soundscapes. We employed a pre-trained AI model to automatically select the optimal masker and adjust its playback level, adapting to changes over time in the ambient environment to maximize Pleasantness, a perceptual dimension of soundscape quality in ISO 12913. Our validation study involving ($N=68$) residents revealed a significant 14.6 % enhancement in Pleasantness after intervention, correlating with increased restorativeness and positive affect. Perceptual enhancements at the traffic-exposed site matched those at a quieter control site with 6 dB(A) lower $L_text{A,eq}$ and road traffic noise dominance, affirming the efficacy of AMSS as a soundscape intervention, while streamlining the labour-intensive assessment of Pleasantness with probabilistic AI prediction.

Read more

7/9/2024