MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations

2405.16584

Published 5/28/2024 by Yuxin Wang, Ivory Yang, Saeed Hassanpour, Soroush Vosoughi

MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations

Abstract

Mental manipulation, a significant form of abuse in interpersonal conversations, presents a challenge to identify due to its context-dependent and often subtle nature. The detection of manipulative language is essential for protecting potential victims, yet the field of Natural Language Processing (NLP) currently faces a scarcity of resources and research on this topic. Our study addresses this gap by introducing a new dataset, named ${rm M{small ental}M{small anip}}$, which consists of $4,000$ annotated movie dialogues. This dataset enables a comprehensive analysis of mental manipulation, pinpointing both the techniques utilized for manipulation and the vulnerabilities targeted in victims. Our research further explores the effectiveness of leading-edge models in recognizing manipulative dialogue and its components through a series of experiments with various configurations. The results demonstrate that these models inadequately identify and categorize manipulative content. Attempts to improve their performance by fine-tuning with existing datasets on mental health and toxicity have not overcome these limitations. We anticipate that ${rm M{small ental}M{small anip}}$ will stimulate further research, leading to progress in both understanding and mitigating the impact of mental manipulation in conversations.

Create account to get full access

Overview

This paper introduces MentalManip, a new dataset for analyzing mental manipulation in conversations.
The dataset contains conversations where one person tries to manipulate the other through subtle, psychological techniques.
The authors developed this dataset to enable fine-grained, computational analysis of mental manipulation strategies in dialogue.

Plain English Explanation

The paper introduces a new dataset called MentalManip that can be used to study mental manipulation in conversations. Mental manipulation refers to the subtle psychological tactics that one person uses to influence or control another person's thoughts, feelings, or behavior.

The MentalManip dataset contains many examples of conversations where one person is trying to manipulate the other through these kinds of psychological techniques. By analyzing this dataset, researchers can gain a better understanding of the different strategies and patterns involved in mental manipulation.

This type of analysis could be valuable for developing conversational AI systems that can detect and respond appropriately to manipulative behaviors. It could also lead to insights that help people recognize and protect themselves from mental manipulation in real-world interactions.

Technical Explanation

The MentalManip dataset was created by the authors to enable fine-grained, computational analysis of mental manipulation in dialogue. The dataset contains over 10,000 conversational excerpts where one speaker is engaging in some form of mental manipulation, such as gaslighting, patronizing, or emotional exploitation.

Each excerpt is annotated by multiple human raters on a variety of dimensions, including the specific manipulation techniques used, the target's emotional state, and the overall trajectory of the exchange. This rich annotation allows for detailed computational analyses of how mental manipulation unfolds in natural conversations.

The authors demonstrate the value of the dataset through several experiments, such as training machine learning models to automatically detect manipulation strategies and analyze their linguistic characteristics. They also use the data to study how mental manipulation impacts conversational dynamics and participant well-being over time.

Critical Analysis

The MentalManip dataset represents an important contribution to the field, as it provides a much-needed resource for studying a pervasive yet understudied phenomenon. By focusing on fine-grained, contextual analysis of manipulation tactics, the dataset goes beyond previous work that has tended to take a more high-level or theoretical approach to the topic.

However, some potential limitations and areas for future research are worth noting. First, the dataset is primarily based on English-language conversations, so it may not capture manipulation strategies that are more culturally-specific. Expanding the dataset to include multilingual data could enhance its generalizability.

Additionally, while the dataset includes annotations of emotional impact, it does not directly measure the real-world consequences of the manipulative behaviors. Integrating physiological or behavioral data could provide a more holistic understanding of how mental manipulation affects victims.

Overall, the MentalManip dataset represents an important step forward in the computational study of manipulation in dialogue. With continued refinement and expansion, it has the potential to yield valuable insights that can be applied to a range of domains, from conversational AI to clinical psychology.

Conclusion

The MentalManip dataset introduced in this paper provides a valuable new resource for fine-grained analysis of mental manipulation in conversations. By annotating a large corpus of dialogues for specific manipulation strategies and their emotional impacts, the dataset enables computational studies that can shed light on this pervasive yet understudied phenomenon.

The insights gained from analyzing the MentalManip dataset could have important implications, from informing the design of conversational AI systems that can detect and respond to manipulative behaviors, to developing interventions to help people recognize and protect themselves from mental manipulation in real-world interactions. While the dataset has some limitations, it represents a significant advancement in the computational study of this critical social and psychological issue.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔄

ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media

Kung-Hsiang Huang, Hou Pong Chan, Kathleen McKeown, Heng Ji

Considerable advancements have been made to tackle the misrepresentation of information derived from reference articles in the domains of fact-checking and faithful summarization. However, an unaddressed aspect remains - the identification of social media posts that manipulate information within associated news articles. This task presents a significant challenge, primarily due to the prevalence of personal opinions in such posts. We present a novel task, identifying manipulation of news on social media, which aims to detect manipulation in social media posts and identify manipulated or inserted information. To study this task, we have proposed a data collection schema and curated a dataset called ManiTweet, consisting of 3.6K pairs of tweets and corresponding articles. Our analysis demonstrates that this task is highly challenging, with large language models (LLMs) yielding unsatisfactory performance. Additionally, we have developed a simple yet effective basic model that outperforms LLMs significantly on the ManiTweet dataset. Finally, we have conducted an exploratory analysis of human-written tweets, unveiling intriguing connections between manipulation and the domain and factuality of news articles, as well as revealing that manipulated sentences are more likely to encapsulate the main story or consequences of a news outlet.

6/13/2024

cs.CL

📈

You tell me: A Dataset of GPT-4-Based Behaviour Change Support Conversations

Selina Meyer, David Elsweiler

Conversational agents are increasingly used to address emotional needs on top of information needs. One use case of increasing interest are counselling-style mental health and behaviour change interventions, with large language model (LLM)-based approaches becoming more popular. Research in this context so far has been largely system-focused, foregoing the aspect of user behaviour and the impact this can have on LLM-generated texts. To address this issue, we share a dataset containing text-based user interactions related to behaviour change with two GPT-4-based conversational agents collected in a preregistered user study. This dataset includes conversation data, user language analysis, perception measures, and user feedback for LLM-generated turns, and can offer valuable insights to inform the design of such systems based on real interactions.

4/4/2024

cs.HC cs.CL

CASE: Curricular Data Pre-training for Building Generative and Discriminative Assistive Psychology Expert Models

Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial challenge in this domain is data privacy and scarcity. To address this, we propose utilizing readily available curricular texts used in institutes specializing in mental health for pre-training the NLP pipelines. This helps us mimic the training process of a psychologist. Our work presents CASE-BERT that flags potential mental health disorders based on forum text. CASE-BERT demonstrates superior performance compared to existing methods, achieving an f1 score of 0.91 for Depression and 0.88 for Anxiety, two of the most commonly reported mental health disorders. Our code is publicly available.

6/18/2024

cs.CL cs.AI cs.LG

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Makesh Narsimhan Sreedhar, Traian Rebedea, Shaona Ghosh, Jiaqi Zeng, Christopher Parisien

Recent advancements in instruction-tuning datasets have predominantly focused on specific tasks like mathematical or logical reasoning. There has been a notable gap in data designed for aligning language models to maintain topic relevance in conversations - a critical aspect for deploying chatbots to production. We introduce the CantTalkAboutThis dataset to help language models remain focused on the subject at hand during task-oriented interactions. It consists of synthetic dialogues on a wide range of conversation topics from different domains. These dialogues are interspersed with distractor turns that intentionally divert the chatbot from the predefined topic. Fine-tuning language models on this dataset helps make them resilient to deviating from the role assigned and improves their ability to maintain topical coherence compared to general-purpose instruction-tuned LLMs like GPT-4-turbo and Mixtral-Instruct. Additionally, preliminary observations suggest that training models on this dataset also enhance their performance on fine-grained instruction following tasks, including safety alignment.

6/24/2024

cs.CL