Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMs

Read original: arXiv:2408.09742 - Published 8/20/2024 by Simon D Angus, Lachlan O'Neill

Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMs

Overview

Presents a new method called "Paired Completion" for quantifying issue framing in large language models (LLMs) at scale
Demonstrates the method on political news articles to measure framing of political issues
Provides a flexible, scalable approach to analyze how language models portray different perspectives on complex topics

Plain English Explanation

The paper introduces a new technique called "Paired Completion" that allows researchers to quantify how large language models [like GPT-3] frame complex issues, such as political topics, at a large scale.

The key idea is to present the language model with a prompt that frames an issue in a particular way, and then measure how the model completes that prompt. By comparing completions across different framing prompts, the researchers can gain insights into how the model portrays different perspectives on the issue.

For example, the researchers might give the model a prompt like "The issue of immigration is primarily about [BLANK]." By measuring how the model completes that prompt when the blank is filled in with different words (e.g. "border security" vs. "humanitarian concerns"), they can assess how the model's responses reflect different framings of the immigration issue.

This approach provides a flexible and scalable way to analyze how language models represent complex, multi-faceted topics. Rather than relying on manual analysis of model outputs, the Paired Completion method allows researchers to systematically quantify framing at a large scale.

Technical Explanation

The core of the Paired Completion method is to create "framing prompts" that represent different perspectives on an issue, and then measure how a language model completes those prompts.

For example, to analyze framing of the immigration issue, the researchers might create prompts like:

"The issue of immigration is primarily about [BLANK]"
"Immigration policy should focus on [BLANK]"
"The main challenge with immigration is [BLANK]"

By filling in the blank with different words/phrases, they can create prompts that reflect different frames, such as "border security," "humanitarianism," "cultural identity," etc. They then have the language model complete each of these prompts and analyze the differences in the completions.

The researchers demonstrate this approach on a corpus of political news articles, showing how it can uncover systematic differences in how language models portray issues like immigration, gun control, and abortion based on the framing of the prompts.

Importantly, the Paired Completion method is flexible - it can be applied to analyze framing on any complex, multi-faceted topic, not just political issues. And it scales easily to large language models and text corpora, providing a powerful tool for quantifying issue framing at scale.

Critical Analysis

The Paired Completion method represents an innovative approach to studying how language models portray complex topics. By systematically varying the framing of prompts, it provides a flexible and scalable way to uncover biases and differences in model outputs.

That said, the paper acknowledges some key limitations. First, the method relies on the researcher's ability to create appropriate framing prompts - the results will only be as good as the prompts themselves. There is also the potential for model responses to be influenced by factors beyond just the prompt framing, such as the model's training data and internal biases.

Additionally, while the paper demonstrates the method on political issues, it remains to be seen how well it will generalize to other domains. Analyzing framing for more technical or scientific topics may require different strategies for creating effective prompts.

Overall, the Paired Completion technique is a promising new tool for studying language model biases and issue framing. However, as with any analysis method, its results should be interpreted cautiously and in conjunction with other forms of model evaluation and validation.

Conclusion

The Paired Completion method introduced in this paper provides a novel approach for quantifying how large language models frame complex, multi-faceted issues. By systematically varying the prompts given to the models, researchers can gain insights into the different perspectives and biases reflected in the model outputs.

This flexible, scalable technique has the potential to unlock new ways of studying and understanding the knowledge and decision-making capabilities of large language models. As these models become increasingly influential, tools like Paired Completion will be crucial for unpacking their implicit biases and ensuring they are deployed responsibly.

While the method has some limitations, it represents an important step forward in the ongoing effort to build more transparent and accountable AI systems. By continuing to develop innovative analysis techniques, the research community can work towards realizing the full potential of large language models while mitigating their risks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMs

Simon D Angus, Lachlan O'Neill

Detecting and quantifying issue framing in textual discourse - the perspective one takes to a given topic (e.g. climate science vs. denialism, misogyny vs. gender equality) - is highly valuable to a range of end-users from social and political scientists to program evaluators and policy analysts. However, conceptual framing is notoriously challenging for automated natural language processing (NLP) methods since the words and phrases used by either `side' of an issue are often held in common, with only subtle stylistic flourishes separating their use. Here we develop and rigorously evaluate new detection methods for issue framing and narrative analysis within large text datasets. By introducing a novel application of next-token log probabilities derived from generative large language models (LLMs) we show that issue framing can be reliably and efficiently detected in large corpora with only a few examples of either perspective on a given issue, a method we call `paired completion'. Through 192 independent experiments over three novel, synthetic datasets, we evaluate paired completion against prompt-based LLM methods and labelled methods using traditional NLP and recent LLM contextual embeddings. We additionally conduct a cost-based analysis to mark out the feasible set of performant methods at production-level scales, and a model bias analysis. Together, our work demonstrates a feasible path to scalable, accurate and low-bias issue-framing in large corpora.

8/20/2024

A Study on Scaling Up Multilingual News Framing Analysis

Syeda Sabrina Akter, Antonios Anastasopoulos

Media framing is the study of strategically selecting and presenting specific aspects of political issues to shape public opinion. Despite its relevance to almost all societies around the world, research has been limited due to the lack of available datasets and other resources. This study explores the possibility of dataset creation through crowdsourcing, utilizing non-expert annotators to develop training corpora. We first extend framing analysis beyond English news to a multilingual context (12 typologically diverse languages) through automatic translation. We also present a novel benchmark in Bengali and Portuguese on the immigration and same-sex marriage domains. Additionally, we show that a system trained on our crowd-sourced dataset, combined with other existing ones, leads to a 5.32 percentage point increase from the baseline, showing that crowdsourcing is a viable option. Last, we study the performance of large language models (LLMs) for this task, finding that task-specific fine-tuning is a better approach than employing bigger non-specialized models.

4/3/2024

Evaluating the Ability of Computationally Extracted Narrative Maps to Encode Media Framing

Sebasti'an Concha Mac'ias, Brian Keith Norambuena

Narratives serve as fundamental frameworks in our understanding of the world and play a crucial role in collaborative sensemaking, providing a versatile foundation for sensemaking. Framing is a subtle yet potent mechanism that influences public perception through specific word choices, shaping interpretations of reported news events. Despite the recognized importance of narratives and framing, a significant gap exists in the literature with regard to the explicit consideration of framing within the context of computational extraction and representation. This article explores the capabilities of a specific narrative extraction and representation approach -- narrative maps -- to capture framing information from news data. The research addresses two key questions: (1) Does the narrative extraction method capture the framing distribution of the data set? (2) Does it produce a representation with consistent framing? Our results indicate that while the algorithm captures framing distributions, achieving consistent framing across various starting and ending events poses challenges. Our results highlight the potential of narrative maps to provide users with insights into the intricate framing dynamics within news narratives. However, we note that directly leveraging framing information in the computational narrative extraction process remains an open challenge.

5/7/2024

Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns

Antonina Sinelnik, Dirk Hovy

Any report frames issues to favor a particular interpretation by highlighting or excluding certain aspects of a story. Despite the widespread use of framing in disinformation, framing properties and detection methods remain underexplored outside the English-speaking world. We explore how multilingual framing of the same issue differs systematically. We use eight years of Russia-backed disinformation campaigns, spanning 8k news articles in 4 languages targeting 15 countries. We find that disinformation campaigns consistently and intentionally favor specific framing, depending on the target language of the audience. We further discover how Russian-language articles consistently highlight selected frames depending on the region of the media coverage. We find that the two most prominent models for automatic frame analysis underperform and show high disagreement, highlighting the need for further research.

8/27/2024