Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

Read original: arXiv:2404.13885 - Published 4/23/2024 by Qingyang Wu, Ying Xu, Tingsong Xiao, Yunze Xiao, Yitong Li, Tianyang Wang, Yichi Zhang, Shanghai Zhong, Yuwei Zhang, Wei Lu and 1 other

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

Overview

This paper investigates the alignment between large language models (LLMs) and humans in their attitudes towards the 17 Sustainable Development Goals (SDGs) set by the United Nations.
The researchers surveyed the perspectives of LLMs and humans on the importance and feasibility of achieving these SDGs, which cover a range of social, economic, and environmental issues.
By comparing the attitudinal alignment between LLMs and humans, the study aims to better understand the ethical and value alignment of these powerful AI systems with human values and priorities.

Plain English Explanation

The researchers in this study wanted to see how well the attitudes and views of large language models (LLMs) - powerful AI systems that can generate human-like text - align with the attitudes and views of humans when it comes to the 17 Sustainable Development Goals (SDGs) set by the United Nations. The SDGs cover a wide range of important issues, like ending poverty, improving education, and protecting the environment.

The researchers surveyed both the LLMs and a group of humans, asking them to rate the importance and feasibility of achieving each of the 17 SDGs. By comparing the responses of the LLMs and the humans, the researchers could see how well the AI systems' perspectives matched up with human values and priorities when it comes to these global challenges.

Understanding this alignment, or lack thereof, can provide important insights into the ethical and value alignment of these powerful AI systems. If the LLMs' views diverge significantly from human views on key issues, it could suggest potential problems or misalignments that need to be addressed as these AI systems become more advanced and influential.

Technical Explanation

The researchers surveyed a selection of large language models (LLMs), including GPT-3, BERT, and T5, as well as a group of human participants, to assess their attitudes towards the 17 Sustainable Development Goals (SDGs) set by the United Nations.

The survey asked participants to rate the importance and feasibility of achieving each of the 17 SDGs on a scale from 1 to 5. The researchers then compared the average ratings provided by the LLMs and the human participants for each SDG, calculating the degree of alignment or misalignment between the two groups.

To further analyze the results, the researchers categorized the SDGs into three broad groups - social, economic, and environmental - and examined the alignment patterns within each category. They also looked at the overall level of agreement between the LLMs and humans, as well as any notable differences in their perspectives.

The findings suggest that while there is generally a high degree of alignment between the LLMs and humans on the importance of the SDGs, there are some notable differences in their perspectives on the feasibility of achieving certain goals, particularly in the environmental domain. These insights have implications for understanding the ethical and value alignment of these powerful AI systems with human values and priorities.

Critical Analysis

The researchers acknowledge several limitations and caveats in their study. First, the sample of LLMs and human participants, while diverse, may not be fully representative of the broader population or the range of AI systems in development. Additionally, the survey-based approach relies on self-reported perceptions, which could be influenced by various cognitive biases or contextual factors.

Furthermore, the study does not delve into the underlying reasons for the observed alignment or misalignment between the LLMs and humans. It would be valuable to explore the specific mechanisms, reasoning, and potential biases that contribute to the differences in their perspectives on the SDGs.

Another potential concern is the extent to which the LLMs' responses truly reflect their own ethical reasoning and value systems, rather than simply mirroring the data on which they were trained. More research is needed to understand the degree to which these AI systems have developed their own autonomous decision-making capabilities versus simply replicating human-generated content and opinions.

Despite these limitations, the study provides a valuable starting point for further investigation into the complex issue of ethical and value alignment between AI systems and humans. As LLMs continue to grow in capability and influence, understanding these alignment challenges will be crucial for ensuring that these powerful technologies are developed and deployed in a way that is consistent with human values and priorities.

Conclusion

This study offers an important empirical investigation into the attitudinal alignment between large language models (LLMs) and humans regarding the 17 Sustainable Development Goals set by the United Nations. The findings suggest a generally high degree of alignment in terms of the perceived importance of these global priorities, but some notable differences in perspectives on the feasibility of achieving certain goals, particularly in the environmental domain.

These insights have significant implications for our understanding of the ethical and value alignment of these powerful AI systems with human values and priorities. As LLMs become increasingly influential, it will be crucial to continue exploring these alignment challenges and ensure that the development and deployment of these technologies are well-aligned with the values and priorities of humanity as a whole.

Further research is needed to delve deeper into the underlying reasons for the observed alignment and misalignment, as well as to explore the broader ethical and societal implications of these findings. Nonetheless, this study represents an important step forward in the ongoing effort to build AI systems that are truly beneficial and aligned with human values.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

Qingyang Wu, Ying Xu, Tingsong Xiao, Yunze Xiao, Yitong Li, Tianyang Wang, Yichi Zhang, Shanghai Zhong, Yuwei Zhang, Wei Lu, Yifan Yang

Large Language Models (LLMs) have emerged as potent tools for advancing the United Nations' Sustainable Development Goals (SDGs). However, the attitudinal disparities between LLMs and humans towards these goals can pose significant challenges. This study conducts a comprehensive review and analysis of the existing literature on the attitudes of LLMs towards the 17 SDGs, emphasizing the comparison between their attitudes and support for each goal and those of humans. We examine the potential disparities, primarily focusing on aspects such as understanding and emotions, cultural and regional differences, task objective variations, and factors considered in the decision-making process. These disparities arise from the underrepresentation and imbalance in LLM training data, historical biases, quality issues, lack of contextual understanding, and skewed ethical values reflected. The study also investigates the risks and harms that may arise from neglecting the attitudes of LLMs towards the SDGs, including the exacerbation of social inequalities, racial discrimination, environmental destruction, and resource wastage. To address these challenges, we propose strategies and recommendations to guide and regulate the application of LLMs, ensuring their alignment with the principles and goals of the SDGs, and therefore creating a more just, inclusive, and sustainable future.

4/23/2024

Predicting Sustainable Development Goals Using Course Descriptions -- from LLMs to Conventional Foundation Models

Lev Kharlashkin, Melany Macias, Leo Huovinen, Mika Hamalainen

We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training data given a noisy human-authored course description input as input. We use this data to train several different smaller language models to predict SDGs for university courses. This work contributes to better university level adaptation of SDGs. The best performing model in our experiments was BART with an F1-score of 0.786.

4/24/2024

Unintended Impacts of LLM Alignment on Global Representation

Michael J. Ryan, William Held, Diyi Yang

Before being deployed for user-facing applications, developers align Large Language Models (LLMs) to user preferences through a variety of procedures, such as Reinforcement Learning From Human Feedback (RLHF) and Direct Preference Optimization (DPO). Current evaluations of these procedures focus on benchmarks of instruction following, reasoning, and truthfulness. However, human preferences are not universal, and aligning to specific preference sets may have unintended effects. We explore how alignment impacts performance along three axes of global representation: English dialects, multilingualism, and opinions from and about countries worldwide. Our results show that current alignment procedures create disparities between English dialects and global opinions. We find alignment improves capabilities in several languages. We conclude by discussing design decisions that led to these unintended impacts and recommendations for more equitable preference tuning. We make our code and data publicly available on Github.

6/10/2024

Investigating Cultural Alignment of Large Language Models

Badr AlKhamissi, Muhammad ElNokrashy, Mai AlKhamissi, Mona Diab

The intricate relationship between language and culture has long been a subject of exploration within the realm of linguistic anthropology. Large Language Models (LLMs), promoted as repositories of collective human knowledge, raise a pivotal question: do these models genuinely encapsulate the diverse knowledge adopted by different cultures? Our study reveals that these models demonstrate greater cultural alignment along two dimensions -- firstly, when prompted with the dominant language of a specific culture, and secondly, when pretrained with a refined mixture of languages employed by that culture. We quantify cultural alignment by simulating sociological surveys, comparing model responses to those of actual survey participants as references. Specifically, we replicate a survey conducted in various regions of Egypt and the United States through prompting LLMs with different pretraining data mixtures in both Arabic and English with the personas of the real respondents and the survey questions. Further analysis reveals that misalignment becomes more pronounced for underrepresented personas and for culturally sensitive topics, such as those probing social values. Finally, we introduce Anthropological Prompting, a novel method leveraging anthropological reasoning to enhance cultural alignment. Our study emphasizes the necessity for a more balanced multilingual pretraining dataset to better represent the diversity of human experience and the plurality of different cultures with many implications on the topic of cross-lingual transfer.

7/9/2024