Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models

Read original: arXiv:2406.13556 - Published 6/21/2024 by Yi Zhou, Danushka Bollegala, Jose Camacho-Collados

Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models

Overview

This paper investigates short-term temporal fluctuations of social biases in social media data and masked language models.
The researchers examine how social biases, such as gender and racial biases, can change over time in online discussions and language models.
They aim to provide a better understanding of the dynamic nature of social biases and their implications for AI systems.

Plain English Explanation

Social media and language models, like the ones used in chatbots and digital assistants, can sometimes reflect and amplify biases against certain social groups. These biases can be based on gender, race, or other factors. This paper explores how these biases can change over short periods of time in both social media data and the language models themselves.

The researchers wanted to understand how social biases fluctuate and evolve, rather than just looking at them as static or fixed. They analyzed data from social media platforms and also tested different language models to see how the biases shifted over time. This could help us better understand how biases form and change, which is important for developing AI systems that are more fair and inclusive.

Technical Explanation

The paper examines short-term temporal fluctuations of social biases in two key areas: social media data and masked language models.

For the social media analysis, the researchers collected data from online discussions and measured changes in biases related to gender, race, and other social categories over time. They used various bias measurement techniques to track how these biases evolved on a weekly or monthly basis.

In the language model experiments, the team tested several popular masked language models, such as BERT and RoBERTa, to see how the biases exhibited by these models fluctuated over short time periods. They probed the models with targeted prompts and measured biases using established evaluation metrics.

The results show that social biases can indeed vary considerably even over short time frames, both in social media discourse and in the internal representations of language models. This suggests that social biases are dynamic and highlights the importance of studying their temporal nature to better understand and mitigate these issues in AI systems. The findings build on prior research exploring the presence of social biases in large language models.

Critical Analysis

The paper provides a valuable contribution by shedding light on the temporal dynamics of social biases, a topic that has received relatively little attention compared to studying biases in a static manner. The researchers' approach of analyzing both social media data and language models offers a comprehensive view of how biases can fluctuate in different contexts.

However, the study is limited to relatively short time periods, typically on the scale of weeks or months. It would be interesting to see how social biases evolve over longer time frames and whether there are any seasonal or cyclical patterns. Additionally, the paper does not delve into the specific mechanisms or drivers behind the observed bias fluctuations, which could be an area for future research.

More broadly, while this work highlights the dynamic nature of social biases, it is important to note that simply understanding bias fluctuations does not necessarily solve the problem. Developing effective strategies to mitigate biases in AI systems remains a significant challenge that requires continued research and innovation.

Conclusion

This paper provides valuable insights into the temporal dynamics of social biases in both social media data and masked language models. The findings demonstrate that biases are not static but can fluctuate significantly even over short time periods. This underscores the importance of studying bias as a dynamic phenomenon rather than a fixed characteristic.

The implications of this research extend beyond the academic realm, as it has direct relevance for the development of responsible and inclusive AI systems. By better understanding how social biases evolve, researchers and practitioners can work towards designing AI models and applications that are more equitable and less susceptible to the amplification of harmful biases over time.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models

Yi Zhou, Danushka Bollegala, Jose Camacho-Collados

Social biases such as gender or racial biases have been reported in language models (LMs), including Masked Language Models (MLMs). Given that MLMs are continuously trained with increasing amounts of additional data collected over time, an important yet unanswered question is how the social biases encoded with MLMs vary over time. In particular, the number of social media users continues to grow at an exponential rate, and it is a valid concern for the MLMs trained specifically on social media data whether their social biases (if any) would also amplify over time. To empirically analyse this problem, we use a series of MLMs pretrained on chronologically ordered temporal snapshots of corpora. Our analysis reveals that, although social biases are present in all MLMs, most types of social bias remain relatively stable over time (with a few exceptions). To further understand the mechanisms that influence social biases in MLMs, we analyse the temporal corpora used to train the MLMs. Our findings show that some demographic groups, such as male, obtain higher preference over the other, such as female on the training corpora constantly.

6/21/2024

Ask LLMs Directly, What shapes your bias?: Measuring Social Bias in Large Language Models

Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park

Social bias is shaped by the accumulation of social perceptions towards targets across various demographic identities. To fully understand such social bias in large language models (LLMs), it is essential to consider the composite of social perceptions from diverse perspectives among identities. Previous studies have either evaluated biases in LLMs by indirectly assessing the presence of sentiments towards demographic identities in the generated text or measuring the degree of alignment with given stereotypes. These methods have limitations in directly quantifying social biases at the level of distinct perspectives among identities. In this paper, we aim to investigate how social perceptions from various viewpoints contribute to the development of social bias in LLMs. To this end, we propose a novel strategy to intuitively quantify these social perceptions and suggest metrics that can evaluate the social biases within LLMs by aggregating diverse social perceptions. The experimental results show the quantitative demonstration of the social attitude in LLMs by examining social perception. The analysis we conducted shows that our proposed metrics capture the multi-dimensional aspects of social bias, enabling a fine-grained and comprehensive investigation of bias in LLMs.

6/7/2024

What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models

Jeongrok Yu, Seong Ug Kim, Jacob Choi, Jinho D. Choi

Bias is a disproportionate prejudice in favor of one side against another. Due to the success of transformer-based Masked Language Models (MLMs) and their impact on many NLP tasks, a systematic evaluation of bias in these models is needed more than ever. While many studies have evaluated gender bias in English MLMs, only a few works have been conducted for the task in other languages. This paper proposes a multilingual approach to estimate gender bias in MLMs from 5 languages: Chinese, English, German, Portuguese, and Spanish. Unlike previous work, our approach does not depend on parallel corpora coupled with English to detect gender bias in other languages using multilingual lexicons. Moreover, a novel model-based method is presented to generate sentence pairs for a more robust analysis of gender bias, compared to the traditional lexicon-based method. For each language, both the lexicon-based and model-based methods are applied to create two datasets respectively, which are used to evaluate gender bias in an MLM specifically trained for that language using one existing and 3 new scoring metrics. Our results show that the previous approach is data-sensitive and not stable as it does not remove contextual dependencies irrelevant to gender. In fact, the results often flip when different scoring metrics are used on the same dataset, suggesting that gender bias should be studied on a large dataset using multiple evaluation metrics for best practice.

4/11/2024

💬

A Systematic Analysis on the Temporal Generalization of Language Models in Social Media

Asahi Ushio, Jose Camacho-Collados

In machine learning, temporal shifts occur when there are differences between training and test splits in terms of time. For streaming data such as news or social media, models are commonly trained on a fixed corpus from a certain period of time, and they can become obsolete due to the dynamism and evolving nature of online content. This paper focuses on temporal shifts in social media and, in particular, Twitter. We propose a unified evaluation scheme to assess the performance of language models (LMs) under temporal shift on standard social media tasks. LMs are tested on five diverse social media NLP tasks under different temporal settings, which revealed two important findings: (i) the decrease in performance under temporal shift is consistent across different models for entity-focused tasks such as named entity recognition or disambiguation, and hate speech detection, but not significant in the other tasks analysed (i.e., topic and sentiment classification); and (ii) continuous pre-training on the test period does not improve the temporal adaptability of LMs.

5/24/2024