Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority

Read original: arXiv:2405.14034 - Published 5/24/2024 by Alice Li, Luanne Sinnamon

🤖

Overview

This paper examines how new generative AI search engines like ChatGPT, Bing Chat, and Perplexity construct responses and establish authority on topics of public importance.
The researchers collected system responses to 48 authentic queries across 4 topics over a 7-day period, then analyzed the data using sentiment analysis, coding, and source classification.
The study provides insights into the nature of responses from these AI systems, including evidence of sentiment, commercial, and geographic biases, as well as uneven quality of sources used to support claims.

Plain English Explanation

The paper looks at how the latest AI-powered search engines, like ChatGPT, Bing Chat, and Perplexity, construct their responses and establish authority on important public topics. The researchers collected responses from these systems over a week, using real-world questions, and then analyzed the data to understand patterns and potential biases.

The key findings suggest that the AI systems can show biases in their responses, both in terms of sentiment (positive or negative) and in the types of sources they use to back up their claims. For example, the systems tended to rely heavily on news, business, and digital media websites, rather than academic or government sources. This uneven quality of sources could be problematic, especially when people are using these AI systems to make important decisions about public issues or their personal wellbeing.

The researchers recommend that users of these generative AI search engines should critically examine the outputs and not blindly accept them as authoritative. They need to be aware of the potential biases and limitations of these systems, especially when the information is related to matters of public interest.

Technical Explanation

The researchers conducted an audit study to investigate how new generative AI search engines like ChatGPT, Bing Chat, and Perplexity construct responses and establish authority on topics of public importance.

They collected system responses to a set of 48 authentic queries across 4 topics (politics, health, science, and current events) over a 7-day period. The data was then analyzed using sentiment analysis, inductive coding, and source classification techniques.

The results provide an overview of the nature of system responses across these AI models. The researchers found evidence of sentiment bias based on the queries and topics, as well as commercial and geographic biases in the sources used to support claims. The quality of sources relied upon was uneven, with a heavy emphasis on news, media, business, and digital media websites, rather than academic or government sources.

Critical Analysis

The paper acknowledges some limitations of the study, such as the relatively small sample size and the focus on only 4 specific topics. The researchers also note that their analysis of source quality is subjective and may not capture all nuances.

One potential issue not addressed in the paper is the evolving nature of these generative AI systems. As the technology continues to develop, the patterns and biases observed in this study may change over time, requiring ongoing monitoring and evaluation.

Additionally, the paper does not delve deeply into the potential societal implications of these biases in AI-generated content, especially when it comes to the impact on news production and consumption. Further research is needed to understand the broader ramifications of generative AI on information access and decision-making.

Conclusion

This study provides valuable insights into the behavior and potential biases of new generative AI search engines, such as ChatGPT, Bing Chat, and Perplexity. The findings highlight the importance of users critically examining the outputs of these systems, especially when the information is related to matters of public interest or personal well-being.

As the use of generative AI in information access continues to grow, it will be crucial to further investigate the sociotechnical implications and develop strategies to ensure these powerful tools are used responsibly and equitably.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤖

Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority

Alice Li, Luanne Sinnamon

This paper reports on an audit study of generative AI systems (ChatGPT, Bing Chat, and Perplexity) which investigates how these new search engines construct responses and establish authority for topics of public importance. We collected system responses using a set of 48 authentic queries for 4 topics over a 7-day period and analyzed the data using sentiment analysis, inductive coding and source classification. Results provide an overview of the nature of system responses across these systems and provide evidence of sentiment bias based on the queries and topics, and commercial and geographic bias in sources. The quality of sources used to support claims is uneven, relying heavily on News and Media, Business and Digital Media websites. Implications for system users emphasize the need to critically examine Generative AI system outputs when making decisions related to public interest and personal well-being.

5/24/2024

🤿

Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review

Ming Li, Ariunaa Enkhtur, Beverley Anne Yamamoto, Fei Cheng, Lilan Chen

Purpose:Generative Artificial Intelligence (GAI) models, such as ChatGPT, may inherit or amplify societal biases due to their training on extensive datasets. With the increasing usage of GAI by students, faculty, and staff in higher education institutions (HEIs), it is urgent to examine the ethical issues and potential biases associated with these technologies. Design/Approach/Methods:This scoping review aims to elucidate how biases related to GAI in HEIs have been researched and discussed in recent academic publications. We categorized the potential societal biases that GAI might cause in the field of higher education. Our review includes articles written in English, Chinese, and Japanese across four main databases, focusing on GAI usage in higher education and bias. Findings:Our findings reveal that while there is meaningful scholarly discussion around bias and discrimination concerning LLMs in the AI field, most articles addressing higher education approach the issue superficially. Few articles identify specific types of bias under different circumstances, and there is a notable lack of empirical research. Most papers in our review focus primarily on educational and research fields related to medicine and engineering, with some addressing English education. However, there is almost no discussion regarding the humanities and social sciences. Additionally, a significant portion of the current discourse is in English and primarily addresses English-speaking contexts. Originality/Value:To the best of our knowledge, our study is the first to summarize the potential societal biases in higher education. This review highlights the need for more in-depth studies and empirical work to understand the specific biases that GAI might introduce or amplify in educational settings, guiding the development of more ethical AI applications in higher education.

7/12/2024

🖼️

The Use of Generative Search Engines for Knowledge Work and Complex Tasks

Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang

Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine. Through the empirical analysis of Bing Copilot (Bing Chat), one of the first publicly available generative search engines, we analyze the types and complexity of tasks that people use Bing Copilot for compared to Bing Search. Findings indicate that people use the generative search engine for more knowledge work tasks that are higher in cognitive complexity than were commonly done with a traditional search engine.

4/9/2024

Sociotechnical Implications of Generative Artificial Intelligence for Information Access

Bhaskar Mitra, Henriette Cramer, Olya Gurevich

Robust access to trustworthy information is a critical need for society with implications for knowledge production, public health education, and promoting informed citizenry in democratic societies. Generative AI technologies may enable new ways to access information and improve effectiveness of existing information retrieval systems but we are only starting to understand and grapple with their long-term social implications. In this chapter, we present an overview of some of the systemic consequences and risks of employing generative AI in the context of information access. We also provide recommendations for evaluation and mitigation, and discuss challenges for future research.

7/17/2024