Disability Representations: Finding Biases in Automatic Image Generation

Read original: arXiv:2406.14993 - Published 6/24/2024 by Yannis Tevissen

🖼️

Overview

Recent advancements in image generation technology have enabled widespread access to AI-generated imagery, used in many industries.
These technologies often perpetuate societal biases, including towards people with disabilities (PWD).
This study investigates the representation biases in popular text-to-image models towards PWD.

Plain English Explanation

Powerful AI models can now generate realistic images from text descriptions. These AI-generated images are being used more and more in advertising, entertainment, and other visual content. However, there is a concern that these AI models may be perpetuating harmful biases and stereotypes that exist in society.

This research looks specifically at how AI models depict people with disabilities. The researchers conducted a comprehensive experiment using several popular text-to-image models. They analyzed the generated images to see how disabilities were represented.

The results showed a significant bias, with most of the generated images portraying disabled individuals as old, sad, and predominantly using manual wheelchairs. This suggests the AI models have learned and replicated societal stereotypes about people with disabilities.

These findings highlight the urgent need for more inclusive AI development. The AI systems need to be trained on diverse data to ensure they can generate images that accurately and fairly represent people with disabilities. Addressing these biases is crucial for fostering equitable and realistic representations in AI-generated content.

Technical Explanation

The researchers conducted a comprehensive experiment to investigate representation biases towards people with disabilities (PWD) in popular text-to-image generation models. They analyzed the depiction of disability across several state-of-the-art text-to-image models.

The experiment involved prompting the models with text descriptions related to PWD, such as "a person in a wheelchair" or "a disabled person." The generated images were then carefully examined and categorized based on factors like body language, appearance, and assistive devices depicted.

The results revealed significant biases, with the majority of generated images portraying disabled individuals as old, sad, and predominantly using manual wheelchairs. This suggests the models have learned and reproduced societal stereotypes and biases towards PWD, rather than generating diverse and accurate representations.

These findings align with previous research on bias in text-based AI models and image-text retrieval systems. They also build upon studies on gender and racial biases in AI-generated faces.

Critical Analysis

The researchers acknowledge several limitations in their study. They note that the analysis was limited to a subset of popular text-to-image models, and that further research is needed to explore biases in a broader range of AI systems.

Additionally, the researchers did not investigate the underlying causes of the observed biases, such as the composition of the training data or the model architectures. Examining these factors could provide valuable insights for developing more inclusive AI systems.

While the study highlights the pervasive nature of disability biases in current text-to-image models, it remains to be seen whether these findings can be generalized to other types of AI-generated content, such as video or 3D models. Further research is needed to understand the broader implications of these biases.

Conclusion

This study's findings underscore the urgent need for more inclusive AI development practices. The significant biases towards people with disabilities observed in popular text-to-image models demonstrate the need to address these issues for the AI industry to foster equitable and realistic representations.

By focusing on diverse and accurate data collection, model training, and evaluation, AI developers can work towards mitigating these biases and ensuring that AI-generated content reflects the true diversity of our society. Addressing representation biases in AI is a crucial step towards building more inclusive and responsible technology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🖼️

Disability Representations: Finding Biases in Automatic Image Generation

Yannis Tevissen

Recent advancements in image generation technology have enabled widespread access to AI-generated imagery, prominently used in advertising, entertainment, and progressively in every form of visual content. However, these technologies often perpetuate societal biases. This study investigates the representation biases in popular image generation models towards people with disabilities (PWD). Through a comprehensive experiment involving several popular text-to-image models, we analyzed the depiction of disability. The results indicate a significant bias, with most generated images portraying disabled individuals as old, sad, and predominantly using manual wheelchairs. These findings highlight the urgent need for more inclusive AI development, ensuring diverse and accurate representation of PWD in generated images. This research underscores the importance of addressing and mitigating biases in AI models to foster equitable and realistic representations.

6/24/2024

🤯

Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

Yixin Wan, Arjun Subramonian, Anaelia Ovalle, Zongyu Lin, Ashima Suvarna, Christina Chance, Hritik Bansal, Rebecca Pattichis, Kai-Wei Chang

The recent advancement of large and powerful models with Text-to-Image (T2I) generation abilities -- such as OpenAI's DALLE-3 and Google's Gemini -- enables users to generate high-quality images from textual prompts. However, it has become increasingly evident that even simple prompts could cause T2I models to exhibit conspicuous social bias in generated images. Such bias might lead to both allocational and representational harms in society, further marginalizing minority groups. Noting this problem, a large body of recent works has been dedicated to investigating different dimensions of bias in T2I systems. However, an extensive review of these studies is lacking, hindering a systematic understanding of current progress and research gaps. We present the first extensive survey on bias in T2I generative models. In this survey, we review prior studies on dimensions of bias: Gender, Skintone, and Geo-Culture. Specifically, we discuss how these works define, evaluate, and mitigate different aspects of bias. We found that: (1) while gender and skintone biases are widely studied, geo-cultural bias remains under-explored; (2) most works on gender and skintone bias investigated occupational association, while other aspects are less frequently studied; (3) almost all gender bias works overlook non-binary identities in their studies; (4) evaluation datasets and metrics are scattered, with no unified framework for measuring biases; and (5) current mitigation methods fail to resolve biases comprehensively. Based on current limitations, we point out future research directions that contribute to human-centric definitions, evaluations, and mitigation of biases. We hope to highlight the importance of studying biases in T2I systems, as well as encourage future efforts to holistically understand and tackle biases, building fair and trustworthy T2I technologies for everyone.

5/3/2024

🛸

Gender Bias Evaluation in Text-to-image Generation: A Survey

Yankun Wu, Yuta Nakashima, Noa Garcia

The rapid development of text-to-image generation has brought rising ethical considerations, especially regarding gender bias. Given a text prompt as input, text-to-image models generate images according to the prompt. Pioneering models such as Stable Diffusion and DALL-E 2 have demonstrated remarkable capabilities in producing high-fidelity images from natural language prompts. However, these models often exhibit gender bias, as studied by the tendency of generating man from prompts such as a photo of a software developer. Given the widespread application and increasing accessibility of these models, bias evaluation is crucial for regulating the development of text-to-image generation. Unlike well-established metrics for evaluating image quality or fidelity, the evaluation of bias presents challenges and lacks standard approaches. Although biases related to other factors, such as skin tone, have been explored, gender bias remains the most extensively studied. In this paper, we review recent work on gender bias evaluation in text-to-image generation, involving bias evaluation setup, bias evaluation metrics, and findings and trends. We primarily focus on the evaluation of recent popular models such as Stable Diffusion, a diffusion model operating in the latent space and using CLIP text embedding, and DALL-E 2, a diffusion model leveraging Seq2Seq architectures like BART. By analyzing recent work and discussing trends, we aim to provide insights for future work.

8/22/2024

Identifying and Improving Disability Bias in GPT-Based Resume Screening

Kate Glazko, Yusuf Mohammed, Ben Kosa, Venkatesh Potluri, Jennifer Mankoff

As Generative AI rises in adoption, its use has expanded to include domains such as hiring and recruiting. However, without examining the potential of bias, this may negatively impact marginalized populations, including people with disabilities. To address this important concern, we present a resume audit study, in which we ask ChatGPT (specifically, GPT-4) to rank a resume against the same resume enhanced with an additional leadership award, scholarship, panel presentation, and membership that are disability related. We find that GPT-4 exhibits prejudice towards these enhanced CVs. Further, we show that this prejudice can be quantifiably reduced by training a custom GPTs on principles of DEI and disability justice. Our study also includes a unique qualitative analysis of the types of direct and indirect ableism GPT-4 uses to justify its biased decisions and suggest directions for additional bias mitigation work. Additionally, since these justifications are presumably drawn from training data containing real-world biased statements made by humans, our analysis suggests additional avenues for understanding and addressing human bias.

5/24/2024