nicolay-r at SemEval-2024 Task 3: Using Flan-T5 for Reasoning Emotion Cause in Conversations with Chain-of-Thought on Emotion States

2404.03361

Published 4/5/2024 by Nicolay Rusnachenko, Huizhi Liang

❗

Abstract

Emotion expression is one of the essential traits of conversations. It may be self-related or caused by another speaker. The variety of reasons may serve as a source of the further emotion causes: conversation history, speaker's emotional state, etc. Inspired by the most recent advances in Chain-of-Thought, in this work, we exploit the existing three-hop reasoning approach (THOR) to perform large language model instruction-tuning for answering: emotion states (THOR-state), and emotion caused by one speaker to the other (THOR-cause). We equip THOR-cause with the reasoning revision (rr) for devising a reasoning path in fine-tuning. In particular, we rely on the annotated speaker emotion states to revise reasoning path. Our final submission, based on Flan-T5-base (250M) and the rule-based span correction technique, preliminary tuned with THOR-state and fine-tuned with THOR-cause-rr on competition training data, results in 3rd and 4th places (F1-proportional) and 5th place (F1-strict) among 15 participating teams. Our THOR implementation fork is publicly available: https://github.com/nicolay-r/THOR-ECAC

Create account to get full access

Overview

The paper explores the use of large language models for emotion expression analysis in conversations.
It proposes a three-hop reasoning approach (THOR) to perform instruction-tuning for answering questions about emotion states and the emotion caused by one speaker to another.
The THOR-cause model is equipped with reasoning revision (rr) to refine the reasoning path during fine-tuning, using annotated speaker emotion states.
The final model, based on Flan-T5-base and a rule-based span correction technique, achieved top-3 and top-5 results in a competition.

Plain English Explanation

Conversations often involve the expression of emotions, which can be related to the speakers themselves or influenced by what the other person says. The researchers in this study looked at the different reasons behind these emotional reactions, such as the conversation history or the speakers' emotional states.

Inspired by recent advances in chain-of-thought reasoning, the researchers used a three-hop approach called THOR to train large language models to answer two types of questions: identifying the emotion states of the speakers (THOR-state) and determining the emotion caused by one speaker to the other (THOR-cause).

For the THOR-cause model, the researchers added a "reasoning revision" (rr) component, which uses the annotated emotion states of the speakers to refine the reasoning path during the fine-tuning process. This helps the model better understand the causal relationships between speakers' statements and the emotions they evoke.

The final model, built on the Flan-T5-base language model and a rule-based span correction technique, was tested in a competition and achieved impressive results, ranking 3rd, 4th, and 5th among 15 participating teams.

Technical Explanation

The researchers leveraged the THOR (Three-Hop Reasoning) approach to perform instruction-tuning of large language models for two tasks: identifying the emotion states of speakers (THOR-state) and determining the emotion caused by one speaker to the other (THOR-cause).

For the THOR-cause task, the researchers added a "reasoning revision" (rr) component, which uses the annotated emotion states of the speakers to refine the reasoning path during the fine-tuning process. This helps the model better understand the causal relationships between speakers' statements and the emotions they evoke.

The final model was built on the Flan-T5-base (250M) language model and used a rule-based span correction technique. It was first tuned with the THOR-state task and then fine-tuned with the THOR-cause-rr task, using the competition training data.

The results of this approach were impressive, with the final model achieving 3rd, 4th, and 5th places among 15 participating teams in the competition, based on F1-proportional and F1-strict metrics.

Critical Analysis

The paper presents a novel approach to emotion expression analysis in conversations by leveraging the THOR reasoning framework and incorporating a reasoning revision component for the THOR-cause task. This combination of techniques appears to have been effective in improving the model's performance on the competition tasks.

However, the paper does not provide much detail on the specific architectures or implementation details of the THOR-state and THOR-cause-rr models. Additionally, the dataset used for training and evaluation is not described in depth, which makes it difficult to assess the generalizability of the approach.

Furthermore, the paper does not discuss any potential limitations or caveats of the proposed method. For example, it would be interesting to understand how the THOR-cause-rr approach handles cases where the emotion caused by one speaker is not directly evident from the conversation history or the annotated emotion states.

Nonetheless, the researchers have made their THOR implementation fork publicly available, which allows other researchers to build upon their work and explore potential improvements or extensions.

Conclusion

This research demonstrates the potential of large language models, combined with advanced reasoning techniques, to tackle the challenge of emotion expression analysis in conversations. The THOR-state and THOR-cause-rr models, built on the Flan-T5-base architecture, achieved impressive results in a competition, showcasing the effectiveness of this approach.

While the paper leaves room for further exploration of the method's limitations and potential improvements, the researchers have made a valuable contribution to the field of emotion recognition and understanding in natural language processing. Their publicly available implementation provides a solid foundation for future work in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains

Zhaopei Huang, Jinming Zhao, Qin Jin

Understanding the process of emotion generation is crucial for analyzing the causes behind emotions. Causal Emotion Entailment (CEE), an emotion-understanding task, aims to identify the causal utterances in a conversation that stimulate the emotions expressed in a target utterance. However, current works in CEE mainly focus on modeling semantic and emotional interactions in conversations, neglecting the exploration of the emotion-generation process. This hinders the models from deeply understanding emotions, restricting their ability to produce explainable predictions. In this work, inspired by the emotion generation process of stimulus-appraisal-emotion in the cognitive appraisal theory, we introduce a step-by-step reasoning method, Emotion-Cause Reasoning Chain (ECR-Chain), to infer the stimulus from the target emotional expressions in conversations. Specifically, we first introduce the ECR-Chain to ChatGPT via few-shot prompting, which significantly improves its performance on the CEE task. We further propose an automated construction process to utilize ChatGPT in building an ECR-Chain set, which can enhance the reasoning abilities of smaller models through supervised training and assist the Vicuna-7B model in achieving state-of-the-art CEE performance. Moreover, our methods can enable these generative language models to effectively perform emotion-cause reasoning in an explainable manner. Our code, data and more details are at https://github.com/hzp3517/ECR-Chain.

5/22/2024

cs.CL

🛸

SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in Conversations

Fanfan Wang, Heqing Ma, Jianfei Yu, Rui Xia, Erik Cambria

The ability to understand emotions is an essential component of human-like artificial intelligence, as emotions greatly influence human cognition, decision making, and social interactions. In addition to emotion recognition in conversations, the task of identifying the potential causes behind an individual's emotional state in conversations, is of great importance in many application scenarios. We organize SemEval-2024 Task 3, named Multimodal Emotion Cause Analysis in Conversations, which aims at extracting all pairs of emotions and their corresponding causes from conversations. Under different modality settings, it consists of two subtasks: Textual Emotion-Cause Pair Extraction in Conversations (TECPE) and Multimodal Emotion-Cause Pair Extraction in Conversations (MECPE). The shared task has attracted 143 registrations and 216 successful submissions. In this paper, we introduce the task, dataset and evaluation settings, summarize the systems of the top teams, and discuss the findings of the participants.

6/12/2024

cs.CL cs.AI cs.MM

Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage framework for Emotion-Cause Pair Extraction in Conversations

Shen Zhang, Haojie Zhang, Jing Zhang, Xudong Zhang, Yimeng Zhuang, Jinting Wu

In human-computer interaction, it is crucial for agents to respond to human by understanding their emotions. Unraveling the causes of emotions is more challenging. A new task named Multimodal Emotion-Cause Pair Extraction in Conversations is responsible for recognizing emotion and identifying causal expressions. In this study, we propose a multi-stage framework to generate emotion and extract the emotion causal pairs given the target emotion. In the first stage, Llama-2-based InstructERC is utilized to extract the emotion category of each utterance in a conversation. After emotion recognition, a two-stream attention model is employed to extract the emotion causal pairs given the target emotion for subtask 2 while MuTEC is employed to extract causal span for subtask 1. Our approach achieved first place for both of the two subtasks in the competition.

4/29/2024

cs.CL cs.SD eess.AS

PetKaz at SemEval-2024 Task 3: Advancing Emotion Classification with an LLM for Emotion-Cause Pair Extraction in Conversations

Roman Kazakov, Kseniia Petukhova, Ekaterina Kochmar

In this paper, we present our submission to the SemEval-2023 Task~3 The Competition of Multimodal Emotion Cause Analysis in Conversations, focusing on extracting emotion-cause pairs from dialogs. Specifically, our approach relies on combining fine-tuned GPT-3.5 for emotion classification and a BiLSTM-based neural network to detect causes. We score 2nd in the ranking for Subtask 1, demonstrating the effectiveness of our approach through one of the highest weighted-average proportional F1 scores recorded at 0.264.

4/9/2024

cs.CL cs.AI