Multilingual Models for Check-Worthy Social Media Posts Detection

Read original: arXiv:2408.06737 - Published 8/14/2024 by Sebastian Kula, Michal Gregor

🔎

Overview

This paper presents an extensive study of transformer-based NLP models for detecting social media posts containing verifiable factual claims and harmful claims.
The study covers various activities, including dataset collection, pre-processing, architecture selection, model training, testing, and implementation.
The focus is on developing multilingual models that can process posts in both English and low-resource languages like Arabic, Bulgarian, Dutch, Polish, Czech, and Slovak.
The proposed models are compared to state-of-the-art approaches, demonstrating their robustness and efficiency in simultaneous detection of harmful and factual posts.

Plain English Explanation

The researchers in this study wanted to develop artificial intelligence (AI) models that can automatically detect two types of claims in social media posts: those that contain verifiable factual information and those that are potentially harmful or misleading. They collected and preprocessed datasets of social media posts, then trained and tested various transformer-based machine learning models to perform this detection task.

A key focus was on developing models that could work well not just in English, but also in several other languages that have fewer available resources for training AI systems, such as Arabic, Bulgarian, Dutch, Polish, Czech, and Slovak. The researchers found that their models were able to effectively detect both factual and harmful claims across multiple languages, outperforming existing state-of-the-art approaches. This is important because it means these AI systems could be used to help identify and counter the spread of misinformation and harmful content on social media platforms around the world.

Technical Explanation

The researchers collected and preprocessed datasets of social media posts in multiple languages, including English, Arabic, Bulgarian, Dutch, Polish, Czech, and Slovak. They then experimented with various transformer-based neural network architectures, such as BERT and its multilingual variants, to build models that could simultaneously detect posts containing verifiable factual claims and posts with harmful or misleading content.

Through extensive training and testing, the researchers found that their multilingual models were able to outperform existing state-of-the-art approaches on this multi-label classification task. The novelty of their work lies in the development of efficient models that can handle both English and low-resource language social media posts, providing a robust and scalable solution for detecting misinformation and harmful content online.

Critical Analysis

The paper provides a thorough and well-designed study, with a strong focus on developing practical, multilingual solutions for an important real-world problem. However, the authors acknowledge some limitations, such as the need for further research to improve the models' performance on certain language-specific tasks and to explore the generalization of the approach to other domains beyond social media.

Additionally, while the study demonstrates the technical capabilities of the proposed models, it does not delve deeply into the broader societal implications of deploying such systems, such as potential biases, ethics considerations, or the risk of over-reliance on automated content moderation. These are important aspects that warrant further discussion and analysis.

Conclusion

This research presents a significant advancement in the development of multilingual, transformer-based NLP models for the simultaneous detection of verifiable factual claims and harmful content in social media posts. The robust performance of the proposed models across multiple languages suggests they could be valuable tools for combating the spread of misinformation and harmful narratives online. However, the broader implications of such AI systems merit careful consideration and continued research to ensure they are developed and deployed responsibly.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Multilingual Models for Check-Worthy Social Media Posts Detection

Sebastian Kula, Michal Gregor

This work presents an extensive study of transformer-based NLP models for detection of social media posts that contain verifiable factual claims and harmful claims. The study covers various activities, including dataset collection, dataset pre-processing, architecture selection, setup of settings, model training (fine-tuning), model testing, and implementation. The study includes a comprehensive analysis of different models, with a special focus on multilingual models where the same model is capable of processing social media posts in both English and in low-resource languages such as Arabic, Bulgarian, Dutch, Polish, Czech, Slovak. The results obtained from the study were validated against state-of-the-art models, and the comparison demonstrated the robustness of the proposed models. The novelty of this work lies in the development of multi-label multilingual classification models that can simultaneously detect harmful posts and posts that contain verifiable factual claims in an efficient way.

8/14/2024

MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts

Dominik Macko, Jakub Kopal, Robert Moro, Ivan Srba

Recent LLMs are able to generate high-quality multilingual texts, indistinguishable for humans from authentic human-written ones. Research in machine-generated text detection is however mostly focused on the English language and longer texts, such as news articles, scientific papers or student essays. Social-media texts are usually much shorter and often feature informal language, grammatical errors, or distinct linguistic items (e.g., emoticons, hashtags). There is a gap in studying the ability of existing methods in detection of such texts, reflected also in the lack of existing multilingual benchmark datasets. To fill this gap we propose the first multilingual (22 languages) and multi-platform (5 social media platforms) dataset for benchmarking machine-generated text detection in the social-media domain, called MultiSocial. It contains 472,097 texts, of which about 58k are human-written and approximately the same amount is generated by each of 7 multilingual LLMs. We use this benchmark to compare existing detection methods in zero-shot as well as fine-tuned form. Our results indicate that the fine-tuned detectors have no problem to be trained on social-media texts and that the platform selection for training matters.

6/19/2024

Multimodal Large Language Models to Support Real-World Fact-Checking

Jiahui Geng, Yova Kementchedjhieva, Preslav Nakov, Iryna Gurevych

Multimodal large language models (MLLMs) carry the potential to support humans in processing vast amounts of information. While MLLMs are already being used as a fact-checking tool, their abilities and limitations in this regard are understudied. Here is aim to bridge this gap. In particular, we propose a framework for systematically assessing the capacity of current multimodal models to facilitate real-world fact-checking. Our methodology is evidence-free, leveraging only these models' intrinsic knowledge and reasoning capabilities. By designing prompts that extract models' predictions, explanations, and confidence levels, we delve into research questions concerning model accuracy, robustness, and reasons for failure. We empirically find that (1) GPT-4V exhibits superior performance in identifying malicious and misleading multimodal claims, with the ability to explain the unreasonable aspects and underlying motives, and (2) existing open-source models exhibit strong biases and are highly sensitive to the prompt. Our study offers insights into combating false multimodal information and building secure, trustworthy multimodal models. To the best of our knowledge, we are the first to evaluate MLLMs for real-world fact-checking.

4/29/2024

Crafting Tomorrow's Headlines: Neural News Generation and Detection in English, Turkish, Hungarian, and Persian

Cem Uyuk, Danica Rov'o, Shaghayegh Kolli, Rabia Varol, Georg Groh, Daryna Dementieva

In the era dominated by information overload and its facilitation with Large Language Models (LLMs), the prevalence of misinformation poses a significant threat to public discourse and societal well-being. A critical concern at present involves the identification of machine-generated news. In this work, we take a significant step by introducing a benchmark dataset designed for neural news detection in four languages: English, Turkish, Hungarian, and Persian. The dataset incorporates outputs from multiple multilingual generators (in both, zero-shot and fine-tuned setups) such as BloomZ, LLaMa-2, Mistral, Mixtral, and GPT-4. Next, we experiment with a variety of classifiers, ranging from those based on linguistic features to advanced Transformer-based models and LLMs prompting. We present the detection results aiming to delve into the interpretablity and robustness of machine-generated texts detectors across all target languages.

8/21/2024