Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning

2406.02251

Published 6/5/2024 by Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Bjorn W. Schuller

Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning

Abstract

Telling stories is an integral part of human communication which can evoke emotions and influence the affective states of the audience. Automatically modeling emotional trajectories in stories has thus attracted considerable scholarly interest. However, as most existing works have been limited to unsupervised dictionary-based approaches, there is no benchmark for this task. We address this gap by introducing continuous valence and arousal labels for an existing dataset of children's stories originally annotated with discrete emotion categories. We collect additional annotations for this data and map the categorical labels to the continuous valence and arousal space. For predicting the thus obtained emotionality signals, we fine-tune a DeBERTa model and improve upon this baseline via a weakly supervised learning approach. The best configuration achieves a Concordance Correlation Coefficient (CCC) of $.8221$ for valence and $.7125$ for arousal on the test set, demonstrating the efficacy of our proposed approach. A detailed analysis shows the extent to which the results vary depending on factors such as the author, the individual story, or the section within the story. In addition, we uncover the weaknesses of our approach by investigating examples that prove to be difficult to predict.

Create account to get full access

Overview

This paper explores the use of transformers and weakly-supervised learning to model emotional trajectories in written stories.
The researchers developed a novel approach that combines language models, emotion recognition, and narrative analysis to better understand how emotions evolve over the course of a story.
The proposed method was evaluated on a dataset of short stories and demonstrated improved performance compared to existing techniques.

Plain English Explanation

The researchers in this study were interested in understanding how emotions are expressed and change over the course of a written story. To do this, they used a type of artificial intelligence (AI) model called a transformer, which is good at analyzing and understanding language. They combined this with a "weakly-supervised" approach, which means the model was trained on data that wasn't perfectly labeled, but was still able to learn meaningful patterns.

The key idea was to track how the emotional tone of a story shifts and evolves from the beginning to the end. For example, a story might start off feeling sad, then become more hopeful, and finally end on a triumphant note. The researchers wanted to be able to capture these emotional arcs or "trajectories" automatically, without needing detailed human annotations of every sentence.

To test their approach, the researchers used a dataset of short stories and had their model analyze the emotional content. The proposed method demonstrated improved performance compared to other existing techniques for this task.

By being able to model emotional trajectories in this way, the researchers hope to gain deeper insights into how stories are structured and how authors use language to convey feelings and experiences to readers. This could have applications in areas like automatic emotion recognition in conversations, identifying the most impactful parts of a text, and generally improving our understanding of how language and emotion are intertwined.

Technical Explanation

The researchers' approach builds on recent advancements in transformer-based language models and weakly-supervised learning. They first fine-tuned a pre-trained transformer model on a large corpus of text to obtain contextual word embeddings. These embeddings were then used as input to an emotion recognition module, which predicted the emotional valence (positivity/negativity) and arousal (intensity) for each word in a story.

By aggregating the emotion predictions at the sentence or paragraph level, the researchers were able to construct an "emotional trajectory" that captures how the story's emotional tone changes over time. They experimented with different pooling strategies and model architectures to optimize this process.

Crucially, the researchers did not rely on stories having detailed, human-annotated emotional labels. Instead, they used a weakly-supervised approach that learned from noisy, automatically-generated labels. This allowed them to scale the analysis to a much larger corpus of stories compared to previous studies that required extensive manual annotations.

The researchers evaluated their method on a dataset of short stories and found that it outperformed several baseline approaches, including those that used traditional sentiment analysis techniques. This suggests the value of the transformers and weakly-supervised learning in capturing the nuanced emotional dynamics present in written narratives.

Critical Analysis

One key limitation of the research is that it focused solely on the textual content of stories, without considering other modalities like images or audio that may also convey emotional information. Future work could explore integrating multimodal data to build a more comprehensive understanding of emotional trajectories.

Additionally, the dataset used in the study was relatively small, consisting of only a few hundred short stories. Larger and more diverse datasets may be needed to fully validate the generalizability of the proposed approach. The researchers also acknowledged that their weakly-supervised labels may contain some biases or errors, which could impact the model's performance.

Another area for potential improvement is the interpretability of the emotional trajectories generated by the model. While the researchers demonstrated quantitative improvements, it would be helpful to gain more qualitative insights into how the model's outputs align with human intuitions about the emotional arcs of stories.

Despite these limitations, the work represents an important step forward in leveraging transformers and weakly-supervised learning to better understand the emotional dynamics of written narratives. The researchers have laid the groundwork for future studies that could further refine and expand this line of inquiry.

Conclusion

This paper presents a novel approach for modeling emotional trajectories in written stories using transformers and weakly-supervised learning. The researchers were able to demonstrate improved performance over existing techniques, suggesting the value of their method in capturing the nuanced emotional dynamics present in narrative texts.

While the study has some limitations, it represents an important advancement in our understanding of how language and emotion intersect. The ability to automatically track the emotional arcs of stories could have applications in areas like narrative analysis, affective computing, and the development of more empathetic AI systems. As the field continues to evolve, we can expect to see further innovations in this direction.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification

Michael Mitsios, Georgios Vamvoukakis, Georgia Maniati, Nikolaos Ellinas, Georgios Dimitriou, Konstantinos Markopoulos, Panos Kakoulidis, Alexandra Vioni, Myrsini Christidou, Junkwang Oh, Gunu Jho, Inchul Hwang, Georgios Vardaxoglou, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

Emotion detection in textual data has received growing interest in recent years, as it is pivotal for developing empathetic human-computer interaction systems. This paper introduces a method for categorizing emotions from text, which acknowledges and differentiates between the diversified similarities and distinctions of various emotions. Initially, we establish a baseline by training a transformer-based model for standard emotion classification, achieving state-of-the-art performance. We argue that not all misclassifications are of the same importance, as there are perceptual similarities among emotional classes. We thus redefine the emotion labeling problem by shifting it from a traditional classification model to an ordinal classification one, where discrete emotions are arranged in a sequential order according to their valence levels. Finally, we propose a method that performs ordinal classification in the two-dimensional emotion space, considering both valence and arousal scales. The results show that our approach not only preserves high accuracy in emotion prediction but also significantly reduces the magnitude of errors in cases of misclassification.

4/3/2024

cs.LG

Transformer based neural networks for emotion recognition in conversations

Claudiu Creanga, Liviu P. Dinu

This paper outlines the approach of the ISDS-NLP team in the SemEval 2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF). For Subtask 1 we obtained a weighted F1 score of 0.43 and placed 12 in the leaderboard. We investigate two distinct approaches: Masked Language Modeling (MLM) and Causal Language Modeling (CLM). For MLM, we employ pre-trained BERT-like models in a multilingual setting, fine-tuning them with a classifier to predict emotions. Experiments with varying input lengths, classifier architectures, and fine-tuning strategies demonstrate the effectiveness of this approach. Additionally, we utilize Mistral 7B Instruct V0.2, a state-of-the-art model, applying zero-shot and few-shot prompting techniques. Our findings indicate that while Mistral shows promise, MLMs currently outperform them in sentence-level emotion classification.

5/21/2024

cs.CL

Cluster-to-Predict Affect Contours from Speech

Gokhan Kuc{s}c{c}u, Engin Erzin

Continuous emotion recognition (CER) aims to track the dynamic changes in a person's emotional state over time. This paper proposes a novel approach to translating CER into a prediction problem of dynamic affect-contour clusters from speech, where the affect-contour is defined as the contour of annotated affect attributes in a temporal window. Our approach defines a cluster-to-predict (C2P) framework that learns affect-contour clusters, which are predicted from speech with higher precision. To achieve this, C2P runs an unsupervised iterative optimization process to learn affect-contour clusters by minimizing both clustering loss and speech-driven affect-contour prediction loss. Our objective findings demonstrate the value of speech-driven clustering for both arousal and valence attributes. Experiments conducted on the RECOLA dataset yielded promising classification results, with F1 scores of 0.84 for arousal and 0.75 for valence in our four-class speech-driven affect-contour prediction model.

6/6/2024

eess.AS cs.HC

You are an expert annotator: Automatic Best-Worst-Scaling Annotations for Emotion Intensity Modeling

Christopher Bagdon, Prathamesh Karmalker, Harsha Gurulingappa, Roman Klinger

Labeling corpora constitutes a bottleneck to create models for new tasks or domains. Large language models mitigate the issue with automatic corpus labeling methods, particularly for categorical annotations. Some NLP tasks such as emotion intensity prediction, however, require text regression, but there is no work on automating annotations for continuous label assignments. Regression is considered more challenging than classification: The fact that humans perform worse when tasked to choose values from a rating scale lead to comparative annotation methods, including best-worst scaling. This raises the question if large language model-based annotation methods show similar patterns, namely that they perform worse on rating scale annotation tasks than on comparative annotation tasks. To study this, we automate emotion intensity predictions and compare direct rating scale predictions, pairwise comparisons and best-worst scaling. We find that the latter shows the highest reliability. A transformer regressor fine-tuned on these data performs nearly on par with a model trained on the original manual annotations.

4/23/2024

cs.CL