Machine-Generated Text Localization

Read original: arXiv:2402.11744 - Published 6/12/2024 by Zhongping Zhang, Wenda Qin, Bryan A. Plummer

Overview

This paper explores the challenge of locating machine-generated text within larger documents.
The researchers propose a novel approach to identify and isolate machine-generated content, which could have important implications for detecting misinformation and authenticating online content.
The work builds on recent advancements in text authenticity detection and machine-generated text datasets.

Plain English Explanation

The paper addresses the problem of distinguishing machine-generated text from human-written text within a larger document. This is an important challenge, as the increasing sophistication of language models makes it harder to detect machine-generated content, which can be used to spread misinformation online.

The researchers develop a new approach to tackle this problem. Rather than trying to classify an entire document as machine-generated or not, they focus on identifying the specific regions or sections of a document that were likely produced by an AI system. This could allow for more accurate detection of misinformation and help maintain trust in online content.

The work builds on recent advancements in related fields, such as techniques for detecting machine-generated text and the creation of new datasets to train and evaluate these models.

Technical Explanation

The core of the researchers' approach is a machine learning model that can locate regions of machine-generated text within a larger document. The model takes a document as input and outputs bounding boxes around the areas it identifies as likely to be AI-generated.

To train and evaluate this model, the researchers used a combination of existing datasets, including PETKAZ, which contains examples of both human-written and machine-generated text. They explored different neural network architectures and training techniques to optimize the model's performance.

The key innovation is the model's ability to pinpoint the specific locations of machine-generated content, rather than just classifying an entire document. This could allow for more nuanced and accurate detection of misinformation, as well as the potential to authenticate the provenance of online content.

Critical Analysis

The researchers acknowledge some limitations of their approach, such as the need for further refinement to improve the model's precision and recall. Additionally, the performance of the system may be dependent on the quality and diversity of the training data used.

One potential concern is the risk of over-reliance on such systems, which could lead to a false sense of security or the unintended suppression of legitimate content. It will be important to continue studying the societal implications of this technology as it develops.

Further research is also needed to understand how these models might be circumvented or fooled, and to explore more advanced techniques for detecting machine-generated content. Maintaining a critical eye on the capabilities and limitations of these tools will be crucial as they are applied in real-world settings.

Conclusion

This paper presents a novel approach to the problem of locating machine-generated text within larger documents. By focusing on identifying specific regions of AI-generated content, rather than classifying entire documents, the researchers have developed a tool that could have important applications in the fight against online misinformation and the authentication of digital content.

While the current system has room for improvement, this work represents a significant step forward in the broader effort to understand and address the challenges posed by the increasing sophistication of language models. As this technology continues to evolve, it will be crucial to study its societal implications and maintain a balanced, critical perspective on its capabilities and limitations.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Machine-Generated Text Localization

Zhongping Zhang, Wenda Qin, Bryan A. Plummer

Machine-Generated Text (MGT) detection aims to identify a piece of text as machine or human written. Prior work has primarily formulated MGT detection as a binary classification task over an entire document, with limited work exploring cases where only part of a document is machine generated. This paper provides the first in-depth study of MGT that localizes the portions of a document that were machine generated. Thus, if a bad actor were to change a key portion of a news article to spread misinformation, whole document MGT detection may fail since the vast majority is human written, but our approach can succeed due to its granular approach. A key challenge in our MGT localization task is that short spans of text, e.g., a single sentence, provides little information indicating if it is machine generated due to its short length. To address this, we leverage contextual information, where we predict whether multiple sentences are machine or human written at once. This enables our approach to identify changes in style or content to boost performance. A gain of 4-13% mean Average Precision (mAP) over prior work demonstrates the effectiveness of approach on five diverse datasets: GoodNews, VisualNews, WikiText, Essay, and WP. We release our implementation at https://github.com/Zhongping-Zhang/MGT_Localization.

6/12/2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohanned Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

The advent of Large Language Models (LLMs) has brought an unprecedented surge in machine-generated text (MGT) across diverse channels. This raises legitimate concerns about its potential misuse and societal implications. The need to identify and differentiate such content from genuine human-generated text is critical in combating disinformation, preserving the integrity of education and scientific fields, and maintaining trust in communication. In this work, we address this problem by introducing a new benchmark based on a multilingual, multi-domain, and multi-generator corpus of MGTs -- M4GT-Bench. The benchmark is compiled of three tasks: (1) mono-lingual and multi-lingual binary MGT detection; (2) multi-way detection where one need to identify, which particular model generated the text; and (3) mixed human-machine text detection, where a word boundary delimiting MGT from human-written content should be determined. On the developed benchmark, we have tested several MGT detection baselines and also conducted an evaluation of human performance. We see that obtaining good performance in MGT detection usually requires an access to the training data from the same domain and generators. The benchmark is available at https://github.com/mbzuai-nlp/M4GT-Bench.

6/28/2024

Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text

Mazal Bethany, Brandon Wherry, Emet Bethany, Nishant Vishwamitra, Anthony Rios, Peyman Najafirad

With the recent proliferation of Large Language Models (LLMs), there has been an increasing demand for tools to detect machine-generated text. The effective detection of machine-generated text face two pertinent problems: First, they are severely limited in generalizing against real-world scenarios, where machine-generated text is produced by a variety of generators, including but not limited to GPT-4 and Dolly, and spans diverse domains, ranging from academic manuscripts to social media posts. Second, existing detection methodologies treat texts produced by LLMs through a restrictive binary classification lens, neglecting the nuanced diversity of artifacts generated by different LLMs. In this work, we undertake a systematic study on the detection of machine-generated text in real-world scenarios. We first study the effectiveness of state-of-the-art approaches and find that they are severely limited against text produced by diverse generators and domains in the real world. Furthermore, t-SNE visualizations of the embeddings from a pretrained LLM's encoder show that they cannot reliably distinguish between human and machine-generated text. Based on our findings, we introduce a novel system, T5LLMCipher, for detecting machine-generated text using a pretrained T5 encoder combined with LLM embedding sub-clustering to address the text produced by diverse generators and domains in the real world. We evaluate our approach across 9 machine-generated text systems and 9 domains and find that our approach provides state-of-the-art generalization ability, with an average increase in F1 score on machine-generated text of 19.6% on unseen generators and domains compared to the top performing existing approaches and correctly attributes the generator of text with an accuracy of 93.6%.

4/4/2024

Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text

Seyedeh Fatemeh Ebrahimi, Karim Akhavan Azari, Amirmasoud Iravani, Arian Qazvini, Pouya Sadeghi, Zeinab Sadat Taghavi, Hossein Sameti

Detecting Machine-Generated Text (MGT) has emerged as a significant area of study within Natural Language Processing. While language models generate text, they often leave discernible traces, which can be scrutinized using either traditional feature-based methods or more advanced neural language models. In this research, we explore the effectiveness of fine-tuning a RoBERTa-base transformer, a powerful neural architecture, to address MGT detection as a binary classification task. Focusing specifically on Subtask A (Monolingual-English) within the SemEval-2024 competition framework, our proposed system achieves an accuracy of 78.9% on the test dataset, positioning us at 57th among participants. Our study addresses this challenge while considering the limited hardware resources, resulting in a system that excels at identifying human-written texts but encounters challenges in accurately discerning MGTs.

7/17/2024