Differentiating between human-written and AI-generated texts using linguistic features automatically extracted from an online computational tool

Read original: arXiv:2407.03646 - Published 7/12/2024 by Georgios P. Georgiou

🔎

Overview

Researchers aimed to compare linguistic features between human-written and AI-generated text.
They used ChatGPT to generate essays and analyzed them alongside human-authored essays.
The analysis revealed significant differences across multiple linguistic aspects like consonants, word stress, nouns, verbs, and use of difficult words.
This highlights the need for improved training methods to make AI-generated text more human-like.

Plain English Explanation

Researchers wanted to understand how the language used in texts written by humans differs from text generated by artificial intelligence (AI) chatbots like ChatGPT. They took human-written essays as a reference and asked ChatGPT to write essays of similar length. Then, they used a computational tool called Open Brain AI to analyze the linguistic features of both sets of essays.

Despite the AI-generated essays seeming to mimic human speech, the analysis showed there were significant differences in areas like the use of consonants, word stress, types of words (nouns, verbs, pronouns, etc.), and the use of complex vocabulary.

These findings suggest that current AI systems still struggle to fully capture the nuances and complexities of human language. More advanced training methods are needed to help AI produce text that is truly indistinguishable from text written by people.

Technical Explanation

The researchers collected a set of human-authored essays and prompted ChatGPT to generate essays of equivalent length. They then used the Open Brain AI online tool to analyze the linguistic features of both the human-written and AI-generated texts.

The analysis looked at various linguistic components, including:

Phonological: Measures related to consonants and word stress
Morphological: Representation of nouns, verbs, pronouns, etc.
Syntactic: Structures like direct objects, prepositional modifiers, etc.
Lexical: Use of difficult or uncommon words

Despite the AI-generated essays appearing to mimic human speech, the results revealed significant differences across multiple linguistic features. The AI texts showed distinct patterns in areas like consonant usage, word stress, noun and verb usage, pronoun distribution, and the prevalence of complex vocabulary.

These findings highlight the continued challenges in developing AI systems that can truly emulate human language at a deep linguistic level. The researchers emphasize the need for enhanced training methodologies to improve the capacity of AI for producing more human-like text.

Critical Analysis

The study provides valuable insights into the linguistic differences between human-written and AI-generated text, an important area of research as AI systems become more advanced in language generation. However, the researchers acknowledge some limitations:

The study focused on a single AI system (ChatGPT) and a specific type of text (essays). Expanding the analysis to a wider range of AI models and text genres could yield additional insights.
The use of Open Brain AI as the analysis tool, while convenient, may have its own limitations or biases that could influence the findings.
The study does not delve into the potential reasons or underlying mechanisms behind the observed linguistic differences, which could be an area for further investigation.

Additionally, one could question whether the goal of making AI-generated text indistinguishable from human-written text is necessarily desirable. There may be ethical considerations around the potential for deception or the need for transparency about the use of AI in content creation.

Conclusion

This study provides a systematic comparison of linguistic features between human-written and AI-generated text, highlighting the continued gap in AI's ability to fully emulate human language. The findings underscore the importance of developing more advanced training methods to improve the capacity of AI systems to produce text that is truly human-like.

As AI technology continues to advance, understanding these linguistic nuances will be crucial for ensuring the responsible and transparent use of AI in content creation, communication, and other applications. The insights from this research can inform the ongoing efforts to enhance the linguistic capabilities of AI and promote more accurate and ethical language generation.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Differentiating between human-written and AI-generated texts using linguistic features automatically extracted from an online computational tool

Georgios P. Georgiou

While extensive research has focused on ChatGPT in recent years, very few studies have systematically quantified and compared linguistic features between human-written and Artificial Intelligence (AI)-generated language. This study aims to investigate how various linguistic components are represented in both types of texts, assessing the ability of AI to emulate human writing. Using human-authored essays as a benchmark, we prompted ChatGPT to generate essays of equivalent length. These texts were analyzed using Open Brain AI, an online computational tool, to extract measures of phonological, morphological, syntactic, and lexical constituents. Despite AI-generated texts appearing to mimic human speech, the results revealed significant differences across multiple linguistic features such as consonants, word stress, nouns, verbs, pronouns, direct objects, prepositional modifiers, and use of difficult words among others. These findings underscore the importance of integrating automated tools for efficient language assessment, reducing time and effort in data analysis. Moreover, they emphasize the necessity for enhanced training methodologies to improve the capacity of AI for producing more human-like text.

7/12/2024

🤖

Decoding AI and Human Authorship: Nuances Revealed Through NLP and Statistical Analysis

Mayowa Akinwande, Oluwaseyi Adeliyi, Toyyibat Yussuph

This research explores the nuanced differences in texts produced by AI and those written by humans, aiming to elucidate how language is expressed differently by AI and humans. Through comprehensive statistical data analysis, the study investigates various linguistic traits, patterns of creativity, and potential biases inherent in human-written and AI- generated texts. The significance of this research lies in its contribution to understanding AI's creative capabilities and its impact on literature, communication, and societal frameworks. By examining a meticulously curated dataset comprising 500K essays spanning diverse topics and genres, generated by LLMs, or written by humans, the study uncovers the deeper layers of linguistic expression and provides insights into the cognitive processes underlying both AI and human-driven textual compositions. The analysis revealed that human-authored essays tend to have a higher total word count on average than AI-generated essays but have a shorter average word length compared to AI- generated essays, and while both groups exhibit high levels of fluency, the vocabulary diversity of Human authored content is higher than AI generated content. However, AI- generated essays show a slightly higher level of novelty, suggesting the potential for generating more original content through AI systems. The paper addresses challenges in assessing the language generation capabilities of AI models and emphasizes the importance of datasets that reflect the complexities of human-AI collaborative writing. Through systematic preprocessing and rigorous statistical analysis, this study offers valuable insights into the evolving landscape of AI-generated content and informs future developments in natural language processing (NLP).

8/6/2024

❗

Distinguishing Chatbot from Human

Gauri Anil Godghase, Rishit Agrawal, Tanush Obili, Mark Stamp

There have been many recent advances in the fields of generative Artificial Intelligence (AI) and Large Language Models (LLM), with the Generative Pre-trained Transformer (GPT) model being a leading chatbot. LLM-based chatbots have become so powerful that it may seem difficult to differentiate between human-written and machine-generated text. To analyze this problem, we have developed a new dataset consisting of more than 750,000 human-written paragraphs, with a corresponding chatbot-generated paragraph for each. Based on this dataset, we apply Machine Learning (ML) techniques to determine the origin of text (human or chatbot). Specifically, we consider two methodologies for tackling this issue: feature analysis and embeddings. Our feature analysis approach involves extracting a collection of features from the text for classification. We also explore the use of contextual embeddings and transformer-based architectures to train classification models. Our proposed solutions offer high classification accuracy and serve as useful tools for textual analysis, resulting in a better understanding of chatbot-generated text in this era of advanced AI technology.

8/12/2024

🌀

Contrasting Linguistic Patterns in Human and LLM-Generated Text

Alberto Mu~noz-Ortiz, Carlos G'omez-Rodr'iguez, David Vilares

We conduct a quantitative analysis contrasting human-written English news text with comparable large language model (LLM) output from six different LLMs that cover three different families and four sizes in total. Our analysis spans several measurable linguistic dimensions, including morphological, syntactic, psychometric, and sociolinguistic aspects. The results reveal various measurable differences between human and AI-generated texts. Human texts exhibit more scattered sentence length distributions, more variety of vocabulary, a distinct use of dependency and constituent types, shorter constituents, and more optimized dependency distances. Humans tend to exhibit stronger negative emotions (such as fear and disgust) and less joy compared to text generated by LLMs, with the toxicity of these models increasing as their size grows. LLM outputs use more numbers, symbols and auxiliaries (suggesting objective language) than human texts, as well as more pronouns. The sexist bias prevalent in human text is also expressed by LLMs, and even magnified in all of them but one. Differences between LLMs and humans are larger than between LLMs.

9/4/2024