Values That Are Explicitly Present in Fairy Tales: Comparing Samples from German, Italian and Portuguese Traditions

2402.08318

YC

0

Reddit

0

Published 5/7/2024 by Alba Morollon Diaz-Faes, Carla Sofia Ribeiro Murteira, Martin Ruskov

🎲

Abstract

Looking at how social values are represented in fairy tales can give insights about the variations in communication of values across cultures. We study how values are communicated in fairy tales from Portugal, Italy and Germany using a technique called word embedding with a compass to quantify vocabulary differences and commonalities. We study how these three national traditions differ in their explicit references to values. To do this, we specify a list of value-charged tokens, consider their word stems and analyse the distance between these in a bespoke pre-trained Word2Vec model. We triangulate and critically discuss the validity of the resulting hypotheses emerging from this quantitative model. Our claim is that this is a reusable and reproducible method for the study of the values explicitly referenced in historical corpora. Finally, our preliminary findings hint at a shared cultural understanding and the expression of values such as Benevolence, Conformity, and Universalism across the studied cultures, suggesting the potential existence of a pan-European cultural memory.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • The study explores how social values are represented in fairy tales from different cultural traditions, using a technique called word embedding with a compass to quantify vocabulary differences and commonalities.
  • The researchers analyze fairy tales from Portugal, Italy, and Germany to understand how these national traditions differ in their explicit references to values.
  • The study claims this is a reusable and reproducible method for studying the values explicitly referenced in historical corpora.
  • The preliminary findings suggest a shared cultural understanding and the expression of values such as Benevolence, Conformity, and Universalism across the studied cultures, hinting at a potential pan-European cultural memory.

Plain English Explanation

Fairy tales can provide insights into the values and beliefs of different cultures. In this study, the researchers use a technique called word embedding with a compass to analyze the values represented in fairy tales from Portugal, Italy, and Germany. They look at how these three national traditions differ in their explicit references to values, such as Benevolence, Conformity, and Universalism.

The researchers claim this is a reusable and reproducible method for studying the values in historical texts. Their preliminary findings suggest that despite some differences, there may be a shared cultural understanding and expression of certain values across these European cultures, hinting at a potential pan-European cultural memory.

Technical Explanation

The researchers used a technique called word embedding with a compass to analyze the values represented in fairy tales from Portugal, Italy, and Germany. They specified a list of value-charged tokens, considered their word stems, and analyzed the distance between these in a bespoke pre-trained Word2Vec model.

This allowed them to quantify the vocabulary differences and commonalities across the three national traditions in their explicit references to values. The researchers claim this is a reusable and reproducible method for studying the values explicitly referenced in historical corpora.

Their preliminary findings suggest a shared cultural understanding and expression of values such as Benevolence, Conformity, and Universalism across the studied cultures, hinting at the potential existence of a pan-European cultural memory. However, the study also acknowledges the limitations of this one-size-fits-all approach and the need to engage with the pluralistic nature of human values.

Critical Analysis

The study presents a novel and potentially valuable approach to analyzing the values represented in historical texts, such as fairy tales. The researchers make a compelling case for the reusability and reproducibility of their method, which could be applied to other cultural corpora.

However, the study acknowledges the limitations of this quantitative approach, as it may not fully capture the nuances and complexities of how values are communicated in these narratives. There is a need to triangulate and critically discuss the validity of the resulting hypotheses, as the researchers have done.

Additionally, the study's focus on a pan-European cultural memory raises questions about the diversity and particularities of the different national traditions. Further research may be needed to better understand the interplay between shared values and cultural differences.

Conclusion

This study demonstrates the potential of using computational techniques, such as word embedding with a compass, to gain insights into the values represented in historical texts across different cultural traditions. The researchers' preliminary findings suggest the existence of a shared cultural understanding and expression of certain values, such as Benevolence, Conformity, and Universalism, among the fairy tales from Portugal, Italy, and Germany.

While the study's methodology is promising, it also highlights the need to engage with the pluralistic nature of human values and to critically examine the limitations of a one-size-fits-all approach. Further research in this area could shed light on the complex interplay between cultural commonalities and differences, ultimately contributing to a deeper understanding of the role of values in shaping our shared heritage.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

👁️

The Echoes of Multilinguality: Tracing Cultural Value Shifts during LM Fine-tuning

Rochelle Choenni, Anne Lauscher, Ekaterina Shutova

YC

0

Reddit

0

Texts written in different languages reflect different culturally-dependent beliefs of their writers. Thus, we expect multilingual LMs (MLMs), that are jointly trained on a concatenation of text in multiple languages, to encode different cultural values for each language. Yet, as the 'multilinguality' of these LMs is driven by cross-lingual sharing, we also have reason to belief that cultural values bleed over from one language into another. This limits the use of MLMs in practice, as apart from being proficient in generating text in multiple languages, creating language technology that can serve a community also requires the output of LMs to be sensitive to their biases (Naous et al., 2023). Yet, little is known about how cultural values emerge and evolve in MLMs (Hershcovich et al., 2022a). We are the first to study how languages can exert influence on the cultural values encoded for different test languages, by studying how such values are revised during fine-tuning. Focusing on the fine-tuning stage allows us to study the interplay between value shifts when exposed to new linguistic experience from different data sources and languages. Lastly, we use a training data attribution method to find patterns in the fine-tuning examples, and the languages that they come from, that tend to instigate value shifts.

Read more

5/22/2024

FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages

FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages

Bernardo Leite, Tom'as Freitas Os'orio, Henrique Lopes Cardoso

YC

0

Reddit

0

Question Answering (QA) datasets are crucial in assessing reading comprehension skills for both machines and humans. While numerous datasets have been developed in English for this purpose, a noticeable void exists in less-resourced languages. To alleviate this gap, our paper introduces machine-translated versions of FairytaleQA, a renowned QA dataset designed to assess and enhance narrative comprehension skills in young children. By employing fine-tuned, modest-scale models, we establish benchmarks for both Question Generation (QG) and QA tasks within the translated datasets. In addition, we present a case study proposing a model for generating question-answer pairs, with an evaluation incorporating quality metrics such as question well-formedness, answerability, relevance, and children suitability. Our evaluation prioritizes quantifying and describing error cases, along with providing directions for future work. This paper contributes to the advancement of QA and QG research in less-resourced languages, promoting accessibility and inclusivity in the development of these models for reading comprehension. The code and data is publicly available at github.com/bernardoleite/fairytaleqa-translated.

Read more

6/26/2024

Exploring Multilingual Concepts of Human Value in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

Exploring Multilingual Concepts of Human Value in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

Shaoyang Xu, Weilong Dong, Zishan Guo, Xinwei Wu, Deyi Xiong

YC

0

Reddit

0

Prior research in representation engineering has revealed that LLMs encode concepts within their representation spaces, predominantly centered around English. In this study, we extend this philosophy to a multilingual scenario, delving into multilingual human value concepts in LLMs. Through our comprehensive exploration covering 7 types of human values, 16 languages and 3 LLM series with distinct multilinguality, we empirically substantiate the existence of multilingual human values in LLMs. Further cross-lingual analysis on these concepts discloses 3 traits arising from language resource disparities: cross-lingual inconsistency, distorted linguistic relationships, and unidirectional cross-lingual transfer between high- and low-resource languages, all in terms of human value concepts. Additionally, we validate the feasibility of cross-lingual control over value alignment capabilities of LLMs, leveraging the dominant language as a source language. Drawing from our findings on multilingual value alignment, we prudently provide suggestions on the composition of multilingual data for LLMs pre-training: including a limited number of dominant languages for cross-lingual alignment transfer while avoiding their excessive prevalence, and keeping a balanced distribution of non-dominant languages. We aspire that our findings would contribute to enhancing the safety and utility of multilingual AI.

Read more

4/17/2024

💬

WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models

Wenlong Zhao, Debanjan Mondal, Niket Tandon, Danica Dillion, Kurt Gray, Yuling Gu

YC

0

Reddit

0

The awareness of multi-cultural human values is critical to the ability of language models (LMs) to generate safe and personalized responses. However, this awareness of LMs has been insufficiently studied, since the computer science community lacks access to the large-scale real-world data about multi-cultural values. In this paper, we present WorldValuesBench, a globally diverse, large-scale benchmark dataset for the multi-cultural value prediction task, which requires a model to generate a rating response to a value question based on demographic contexts. Our dataset is derived from an influential social science project, World Values Survey (WVS), that has collected answers to hundreds of value questions (e.g., social, economic, ethical) from 94,728 participants worldwide. We have constructed more than 20 million examples of the type (demographic attributes, value question) $rightarrow$ answer from the WVS responses. We perform a case study using our dataset and show that the task is challenging for strong open and closed-source models. On merely $11.1%$, $25.0%$, $72.2%$, and $75.0%$ of the questions, Alpaca-7B, Vicuna-7B-v1.5, Mixtral-8x7B-Instruct-v0.1, and GPT-3.5 Turbo can respectively achieve $<0.2$ Wasserstein 1-distance from the human normalized answer distributions. WorldValuesBench opens up new research avenues in studying limitations and opportunities in multi-cultural value awareness of LMs.

Read more

4/26/2024