Detection of Conspiracy Theories Beyond Keyword Bias in German-Language Telegram Using Large Language Models

2404.17985

Published 4/30/2024 by Milena Pustet, Elisabeth Steffen, Helena Mihaljevi'c

🔎

Abstract

The automated detection of conspiracy theories online typically relies on supervised learning. However, creating respective training data requires expertise, time and mental resilience, given the often harmful content. Moreover, available datasets are predominantly in English and often keyword-based, introducing a token-level bias into the models. Our work addresses the task of detecting conspiracy theories in German Telegram messages. We compare the performance of supervised fine-tuning approaches using BERT-like models with prompt-based approaches using Llama2, GPT-3.5, and GPT-4 which require little or no additional training data. We use a dataset of $sim!! 4,000$ messages collected during the COVID-19 pandemic, without the use of keyword filters. Our findings demonstrate that both approaches can be leveraged effectively: For supervised fine-tuning, we report an F1 score of $sim!! 0.8$ for the positive class, making our model comparable to recent models trained on keyword-focused English corpora. We demonstrate our model's adaptability to intra-domain temporal shifts, achieving F1 scores of $sim!! 0.7$. Among prompting variants, the best model is GPT-4, achieving an F1 score of $sim!! 0.8$ for the positive class in a zero-shot setting and equipped with a custom conspiracy theory definition.

Create account to get full access

Overview

The paper addresses the challenge of detecting conspiracy theories in German Telegram messages during the COVID-19 pandemic.
It compares the performance of supervised fine-tuning approaches using BERT-like models with prompt-based approaches using large language models like LLaMA2, GPT-3.5, and GPT-4.
The researchers used a dataset of around 4,000 messages without relying on keyword filters, which can introduce bias into the models.

Plain English Explanation

The paper looks at ways to automatically detect conspiracy theories in German messages on the Telegram messaging app during the COVID-19 pandemic. [https://aimodels.fyi/papers/arxiv/rumour-evaluation-very-large-language-models] Typically, this kind of detection relies on supervised machine learning, where the model is trained on a set of labeled examples. However, creating that training data can be difficult and time-consuming, as the content is often harmful.

The researchers tried two different approaches: [https://aimodels.fyi/papers/arxiv/surprising-efficacy-fine-tuned-transformers-fact-checking]

Supervised fine-tuning: They took a BERT-like language model and fine-tuned it on their dataset of German conspiracy theory messages. This achieved an F1 score of around 0.8, which is comparable to recent models trained on English data.
Prompt-based approaches: They also tested using large language models like LLaMA2, GPT-3.5, and GPT-4 in a zero-shot setting, where the model is given a custom definition of conspiracy theories and asked to classify the messages. The best-performing model was GPT-4, which achieved an F1 score of around 0.8 without any additional training.

The researchers found that both approaches can be effective at detecting conspiracy theories in this context. [https://aimodels.fyi/papers/arxiv/adapting-fake-news-detection-to-era-large] They also showed that their supervised model could adapt to changes in the data over time, maintaining an F1 score of around 0.7.

Technical Explanation

The paper compares two approaches for detecting conspiracy theories in German Telegram messages:

Supervised fine-tuning: The researchers fine-tuned a BERT-like model (specifically, a German version of DistilBERT) on a dataset of around 4,000 Telegram messages collected during the COVID-19 pandemic. They did not use any keyword filters, which can introduce bias. The fine-tuned model achieved an F1 score of around 0.8 for the positive (conspiracy theory) class, comparable to recent models trained on English datasets.
Prompt-based approaches: The researchers also tested using large language models like LLaMA2, GPT-3.5, and GPT-4 in a zero-shot setting. These models were given a custom definition of conspiracy theories and asked to classify the messages without any additional training. The best-performing model was GPT-4, which achieved an F1 score of around 0.8 for the positive class.

[https://aimodels.fyi/papers/arxiv/chatgpt-vs-media-bias-comparative-study-gpt] The researchers also evaluated their supervised model's ability to adapt to temporal shifts in the data, simulating a real-world scenario where the model needs to handle changes in the distribution of conspiracy theories over time. They report an F1 score of around 0.7 in this setting, demonstrating the model's robustness.

Critical Analysis

The paper presents a valuable contribution to the challenge of detecting conspiracy theories in non-English social media content. [https://aimodels.fyi/papers/arxiv/stance-detection-social-media-fine-tuned-large] However, there are a few potential limitations and areas for further research:

The dataset, while larger than many previous studies, is still relatively small, with only around 4,000 messages. Expanding the dataset size and diversity could help improve the models' performance and generalization.
The paper does not provide much insight into the specific types of conspiracy theories present in the dataset or how the models perform on different subtypes. Analyzing the model's strengths and weaknesses across various conspiracy theory categories could yield useful insights.
The prompt-based approaches rely on the language models' ability to understand and apply the provided conspiracy theory definition. Further research could explore ways to optimize the prompts or incorporate additional contextual information to improve the models' performance.

Overall, the paper presents a solid methodological approach and promising results, but there is still room for further exploration and refinement of the techniques for detecting conspiracy theories in diverse, multilingual social media settings.

Conclusion

This paper addresses the important challenge of automatically detecting conspiracy theories in German Telegram messages during the COVID-19 pandemic. The researchers compared the performance of supervised fine-tuning approaches and prompt-based approaches using large language models, demonstrating the effectiveness of both methods.

The supervised fine-tuning approach achieved an F1 score of around 0.8 for the positive (conspiracy theory) class, comparable to recent models trained on English datasets. The prompt-based approach, using GPT-4 with a custom conspiracy theory definition, also achieved an impressive F1 score of around 0.8 in a zero-shot setting, without any additional training.

These findings suggest that both supervised and prompt-based techniques can be leveraged effectively for detecting conspiracy theories in non-English social media content. The researchers also showed that their supervised model could adapt to temporal shifts in the data, maintaining robust performance over time.

Overall, this work contributes valuable insights and techniques for addressing the challenge of conspiracy theory detection, particularly in multilingual and evolving social media contexts.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model

Zhiwei Liu, Boyang Liu, Paul Thompson, Kailai Yang, Sophia Ananiadou

The internet has brought both benefits and harms to society. A prime example of the latter is misinformation, including conspiracy theories, which flood the web. Recent advances in natural language processing, particularly the emergence of large language models (LLMs), have improved the prospects of accurate misinformation detection. However, most LLM-based approaches to conspiracy theory detection focus only on binary classification and fail to account for the important relationship between misinformation and affective features (i.e., sentiment and emotions). Driven by a comprehensive analysis of conspiracy text that reveals its distinctive affective features, we propose ConspEmoLLM, the first open-source LLM that integrates affective information and is able to perform diverse tasks relating to conspiracy theories. These tasks include not only conspiracy theory detection, but also classification of theory type and detection of related discussion (e.g., opinions towards theories). ConspEmoLLM is fine-tuned based on an emotion-oriented LLM using our novel ConDID dataset, which includes five tasks to support LLM instruction tuning and evaluation. We demonstrate that when applied to these tasks, ConspEmoLLM largely outperforms several open-source general domain LLMs and ChatGPT, as well as an LLM that has been fine-tuned using ConDID, but which does not use affective features. This project will be released on https://github.com/lzw108/ConspEmoLLM/.

5/20/2024

cs.CL

💬

Rumour Evaluation with Very Large Language Models

Dahlia Shehata, Robin Cohen, Charles Clarke

Conversational prompt-engineering-based large language models (LLMs) have enabled targeted control over the output creation, enhancing versatility, adaptability and adhoc retrieval. From another perspective, digital misinformation has reached alarming levels. The anonymity, availability and reach of social media offer fertile ground for rumours to propagate. This work proposes to leverage the advancement of prompting-dependent LLMs to combat misinformation by extending the research efforts of the RumourEval task on its Twitter dataset. To the end, we employ two prompting-based LLM variants (GPT-3.5-turbo and GPT-4) to extend the two RumourEval subtasks: (1) veracity prediction, and (2) stance classification. For veracity prediction, three classifications schemes are experimented per GPT variant. Each scheme is tested in zero-, one- and few-shot settings. Our best results outperform the precedent ones by a substantial margin. For stance classification, prompting-based-approaches show comparable performance to prior results, with no improvement over finetuning methods. Rumour stance subtask is also extended beyond the original setting to allow multiclass classification. All of the generated predictions for both subtasks are equipped with confidence scores determining their trustworthiness degree according to the LLM, and post-hoc justifications for explainability and interpretability purposes. Our primary aim is AI for social good.

4/29/2024

cs.CL cs.SI

🎲

Unveiling Online Conspiracy Theorists: a Text-Based Approach and Characterization

Alessandra Recordare, Guglielmo Cola, Tiziano Fagni, Maurizio Tesconi

In today's digital landscape, the proliferation of conspiracy theories within the disinformation ecosystem of online platforms represents a growing concern. This paper delves into the complexities of this phenomenon. We conducted a comprehensive analysis of two distinct X (formerly known as Twitter) datasets: one comprising users with conspiracy theorizing patterns and another made of users lacking such tendencies and thus serving as a control group. The distinguishing factors between these two groups are explored across three dimensions: emotions, idioms, and linguistic features. Our findings reveal marked differences in the lexicon and language adopted by conspiracy theorists with respect to other users. We developed a machine learning classifier capable of identifying users who propagate conspiracy theories based on a rich set of 871 features. The results demonstrate high accuracy, with an average F1 score of 0.88. Moreover, this paper unveils the most discriminating characteristics that define conspiracy theory propagators.

5/22/2024

cs.SI cs.CY

Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets

Simon Munker, Kai Kugler, Achim Rettinger

Filtering and annotating textual data are routine tasks in many areas, like social media or news analytics. Automating these tasks allows to scale the analyses wrt. speed and breadth of content covered and decreases the manual effort required. Due to technical advancements in Natural Language Processing, specifically the success of large foundation models, a new tool for automating such annotation processes by using a text-to-text interface given written guidelines without providing training samples has become available. In this work, we assess these advancements in-the-wild by empirically testing them in an annotation task on German Twitter data about social and political European crises. We compare the prompt-based results with our human annotation and preceding classification approaches, including Naive Bayes and a BERT-based fine-tuning/domain adaptation pipeline. Our results show that the prompt-based approach - despite being limited by local computation resources during the model selection - is comparable with the fine-tuned BERT but without any annotated training data. Our findings emphasize the ongoing paradigm shift in the NLP landscape, i.e., the unification of downstream tasks and elimination of the need for pre-labeled training data.

6/27/2024

cs.CL cs.AI