Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence

2402.10073

Published 6/13/2024 by Weixiang Zhao, Zhuojun Li, Shilong Wang, Yang Wang, Yulin Hu, Yanyan Zhao, Chen Wei, Bing Qin

💬

Abstract

Emotional Intelligence (EI), consisting of emotion perception, emotion cognition and emotion expression, plays the critical roles in improving user interaction experience for the current large language model (LLM) based conversational general AI assistants. Previous works mainly focus on raising the emotion perception ability of them via naive fine-tuning on EI-related classification or regression tasks. However, this leads to the incomplete enhancement of EI and catastrophic forgetting of the general intelligence (GI). To this end, we first introduce textsc{EiBench}, a large-scale collection of EI-related tasks in the text-to-text formation with task instructions that covers all three aspects of EI, which lays a solid foundation for the comprehensive EI enhancement of LLMs. Then a novel underline{textbf{Mo}}dular underline{textbf{E}}motional underline{textbf{I}}ntelligence enhancement method (textbf{MoEI}), consisting of Modular Parameter Expansion and intra-inter modulation, is proposed to comprehensively enhance the EI of LLMs without compromise their GI. Extensive experiments on two representative LLM-based assistants, Flan-T5 and LLaMA-2-Chat, demonstrate the effectiveness of MoEI to improving EI while maintain GI.

Create account to get full access

Overview

The paper explores how Emotional Intelligence (EI) can be used to improve the user interaction experience for large language model (LLM) based conversational AI assistants.
Previous research has focused on improving the emotion perception ability of these assistants, but this leads to incomplete EI enhancement and a loss of general intelligence (GI).
The paper introduces EiBench, a dataset covering all three aspects of EI (emotion perception, cognition, and expression), and proposes a novel Modular Emotional Intelligence enhancement method (MoEI) to comprehensively improve EI without compromising GI.

Plain English Explanation

Imagine you're chatting with a virtual assistant, and it responds with the perfect emotional tone - it's empathetic when you're feeling down, enthusiastic when you share good news, and calm when you need reassurance. This is the power of Emotional Intelligence (EI), which involves understanding and expressing emotions in a natural way.

Current large language model (LLM) based AI assistants often struggle with EI, as their training has focused more on general knowledge and language skills rather than emotional awareness. The researchers behind this paper wanted to find a way to improve the EI of these assistants without sacrificing their broader capabilities.

They first created a new dataset called EiBench, which covers a wide range of EI-related tasks, from recognizing emotions in text to expressing empathy. This set a solid foundation for comprehensive EI enhancement.

Next, they developed a technique called Modular Emotional Intelligence enhancement (MoEI), which allows the AI to improve its EI skills without forgetting its general intelligence. This involves expanding the model's parameters in a modular way and using special techniques to balance the AI's emotional and general capabilities.

By testing MoEI on two popular LLM-based assistants, the researchers demonstrated that it can significantly boost the EI of these systems while maintaining their overall knowledge and language understanding. This is an important step towards creating AI assistants that can engage with users in a more natural, empathetic, and personalized way.

Technical Explanation

The paper presents a novel approach to comprehensively enhancing the Emotional Intelligence (EI) of large language model (LLM) based conversational AI assistants without compromising their General Intelligence (GI).

Previous works have focused on improving the emotion perception ability of these assistants through fine-tuning on EI-related classification or regression tasks. However, this leads to an incomplete enhancement of EI and can result in catastrophic forgetting of the model's GI.

To address this, the authors first introduce EiBench, a large-scale collection of EI-related tasks in a text-to-text format with clear instructions. EiBench covers all three aspects of EI: emotion perception, emotion cognition, and emotion expression, providing a comprehensive benchmark for EI enhancement.

The researchers then propose a novel Modular Emotional Intelligence enhancement (MoEI) method, which consists of two key components:

Modular Parameter Expansion: The model's parameters are expanded in a modular fashion to accommodate the additional EI-related capabilities without interfering with the existing GI.
Intra-Inter Modulation: A specialized training procedure that balances the model's EI and GI through intra-module and inter-module interactions, preventing catastrophic forgetting.

Extensive experiments on two representative LLM-based assistants, Flan-T5 and LLaMA-2-Chat, demonstrate the effectiveness of MoEI in improving EI while maintaining GI performance.

Critical Analysis

The paper presents a compelling approach to enhancing the Emotional Intelligence (EI) of large language model (LLM) based conversational AI assistants, but there are a few potential areas for further exploration:

Generalization to Other LLMs: The experiments were conducted on two specific LLM-based assistants, Flan-T5 and LLaMA-2-Chat. It would be valuable to see how well the MoEI approach generalizes to a wider range of LLM architectures and sizes, including industry-leading models like GPT-3 or Anthropic's InstructGPT.
Real-World User Evaluations: While the experiments demonstrate improvements in EI-related tasks, it would be important to assess the impact on actual user interactions and satisfaction. Can Large Language Models be Good Emotional Companions? is a related paper that explores this aspect.
Ethical Implications: As AI assistants become more emotionally intelligent, there are potential ethical concerns around the nature of the user-AI relationship and the risk of emotional manipulation. The paper does not delve into these important considerations.

Overall, the research represents a valuable step forward in enhancing the emotional capabilities of conversational AI systems. Further work is needed to ensure these advancements are implemented responsibly and with a focus on benefiting users.

Conclusion

This paper introduces a novel approach to comprehensively improving the Emotional Intelligence (EI) of large language model (LLM) based conversational AI assistants without compromising their General Intelligence (GI). By developing the EiBench dataset and the Modular Emotional Intelligence enhancement (MoEI) method, the researchers have made significant progress in enabling these AI systems to engage with users in a more natural, empathetic, and personalized way.

As AI assistants become increasingly prevalent in our daily lives, enhancing their emotional awareness and sensitivity is crucial for fostering more positive and meaningful interactions. This research lays the groundwork for a new generation of AI companions that can understand and respond to human emotions, enriching the user experience and paving the way for more intuitive and harmonious human-AI collaboration.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M. C. Lee, Rada Mihalcea, Minlie Huang

Recent advances in Large Language Models (LLMs) have highlighted the need for robust, comprehensive, and challenging benchmarks. Yet, research on evaluating their Emotional Intelligence (EI) is considerably limited. Existing benchmarks have two major shortcomings: first, they mainly focus on emotion recognition, neglecting essential EI capabilities such as emotion regulation and thought facilitation through emotion understanding; second, they are primarily constructed from existing datasets, which include frequent patterns, explicit information, and annotation errors, leading to unreliable evaluation. We propose EmoBench, a benchmark that draws upon established psychological theories and proposes a comprehensive definition for machine EI, including Emotional Understanding and Emotional Application. EmoBench includes a set of 400 hand-crafted questions in English and Chinese, which are meticulously designed to require thorough reasoning and understanding. Our findings reveal a considerable gap between the EI of existing LLMs and the average human, highlighting a promising direction for future research. Our code and data are publicly available at https://github.com/Sahandfer/EmoBench.

6/10/2024

cs.CL cs.AI

TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection

Long Cheng, Qihao Shao, Christine Zhao, Sheng Bi, Gina-Anne Levow

Cross-lingual emotion detection allows us to analyze global trends, public opinion, and social phenomena at scale. We participated in the Explainability of Cross-lingual Emotion Detection (EXALT) shared task, achieving an F1-score of 0.6046 on the evaluation set for the emotion detection sub-task. Our system outperformed the baseline by more than 0.16 F1-score absolute, and ranked second amongst competing systems. We conducted experiments using fine-tuning, zero-shot learning, and few-shot learning for Large Language Model (LLM)-based models as well as embedding-based BiLSTM and KNN for non-LLM-based techniques. Additionally, we introduced two novel methods: the Multi-Iteration Agentic Workflow and the Multi-Binary-Classifier Agentic Workflow. We found that LLM-based approaches provided good performance on multilingual emotion detection. Furthermore, ensembles combining all our experimented models yielded higher F1-scores than any single approach alone.

5/28/2024

cs.CL cs.AI

💬

Modeling Emotions and Ethics with Large Language Models

Edward Y. Chang

This paper explores the integration of human-like emotions and ethical considerations into Large Language Models (LLMs). We first model eight fundamental human emotions, presented as opposing pairs, and employ collaborative LLMs to reinterpret and express these emotions across a spectrum of intensity. Our focus extends to embedding a latent ethical dimension within LLMs, guided by a novel self-supervised learning algorithm with human feedback (SSHF). This approach enables LLMs to perform self-evaluations and adjustments concerning ethical guidelines, enhancing their capability to generate content that is not only emotionally resonant but also ethically aligned. The methodologies and case studies presented herein illustrate the potential of LLMs to transcend mere text and image generation, venturing into the realms of empathetic interaction and principled decision-making, thereby setting a new precedent in the development of emotionally aware and ethically conscious AI systems.

4/23/2024

cs.CL cs.AI

💬

Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models

Edward Y. Chang

This research develops advanced methodologies for Large Language Models (LLMs) to better manage linguistic behaviors related to emotions and ethics. We introduce DIKE, an adversarial framework that enhances the LLMs' ability to internalize and reflect global human values, adapting to varied cultural contexts to promote transparency and trust among users. The methodology involves detailed modeling of emotions, classification of linguistic behaviors, and implementation of ethical guardrails. Our innovative approaches include mapping emotions and behaviors using self-supervised learning techniques, refining these guardrails through adversarial reviews, and systematically adjusting outputs to ensure ethical alignment. This framework establishes a robust foundation for AI systems to operate with ethical integrity and cultural sensitivity, paving the way for more responsible and context-aware AI interactions.

5/15/2024

cs.CL cs.AI