Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Read original: arXiv:2409.08811 - Published 9/16/2024 by Shao Zhang, Xihuai Wang, Wenhao Zhang, Yongshan Chen, Landi Gao, Dakuo Wang, Weinan Zhang, Xinbing Wang, Ying Wen

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Overview

Examines the role of mutual theory of mind (ToM) in human-AI collaboration using LLM-driven AI agents in a real-time shared workspace task
Investigates how humans and AI agents develop shared understanding and coordinate their actions
Provides empirical insights into the challenges and opportunities for fostering mutual ToM in human-AI teams

Plain English Explanation

The study explores how humans and AI agents can work together effectively by developing a mutual understanding of each other's thoughts, beliefs, and intentions - a concept known as "mutual theory of mind." In a real-time shared workspace task, the researchers observed how humans and AI agents driven by large language models interacted and coordinated their actions.

The key idea is that for human-AI teams to collaborate seamlessly, both the human and the AI need to build a mental model of each other's perspectives, goals, and reasoning. This mutual theory of mind allows them to anticipate each other's behavior, communicate more effectively, and coordinate their efforts towards a shared objective.

The study provides empirical insights into the challenges and opportunities involved in fostering this mutual understanding between humans and AI agents. By understanding these dynamics, the researchers aim to inform the design of human-AI collaborative systems that can leverage the strengths of both to achieve better outcomes.

Technical Explanation

The paper presents an empirical study that examines the role of mutual theory of mind (ToM) in human-AI collaboration using large language model (LLM)-driven AI agents in a real-time shared workspace task. The researchers designed an experiment where human participants and AI agents worked together to complete a collaborative task, and they observed the dynamics of how the teams developed shared understanding and coordinated their actions.

The study's key elements include:

Experiment Design: Participants were paired with an AI agent and asked to complete a collaborative task in a real-time shared workspace. The task involved arranging shapes on a shared digital canvas, with the human and AI agent each controlling one cursor.
AI Agent Architecture: The AI agents were developed using large language models (LLMs) trained on a variety of tasks, including language understanding, task planning, and action execution. The agents were designed to engage in natural language communication with the human partners and reason about their mental states.
Insights and Findings: The researchers analyzed the interactions between the human participants and AI agents, focusing on how they developed a mutual understanding of each other's perspectives, goals, and reasoning. They identified key challenges and opportunities in fostering this mutual theory of mind, which has important implications for the design of effective human-AI collaborative systems.

Critical Analysis

The study provides valuable insights into the challenges and opportunities of achieving mutual theory of mind in human-AI collaboration. However, the paper acknowledges several limitations and areas for further research:

Ecological Validity: The study was conducted in a controlled experimental setting, which may not fully capture the complexity and dynamism of real-world human-AI collaboration scenarios. Further research is needed to validate the findings in more naturalistic settings.
AI Agent Capabilities: The AI agents used in the study were driven by large language models, which have certain capabilities and limitations. As AI technology continues to advance, it will be important to assess how more capable AI agents might influence the dynamics of mutual theory of mind development.
Individual Differences: The study did not examine how individual factors, such as personality traits or cognitive abilities, might affect the human participants' ability to develop a mutual theory of mind with the AI agents. Exploring these individual differences could provide valuable insights.
Long-term Interactions: The study focused on a single, relatively short collaborative task. Investigating how mutual theory of mind evolves over longer-term human-AI interactions would be an important area for future research.

Conclusion

This study provides valuable empirical insights into the role of mutual theory of mind in human-AI collaboration. By examining how humans and LLM-driven AI agents develop shared understanding and coordinate their actions in a real-time shared workspace task, the researchers have highlighted the key challenges and opportunities involved in fostering effective human-AI teams.

The findings underscore the importance of designing AI systems that can build mental models of their human partners and engage in natural language communication to achieve a mutual understanding. As AI technology continues to advance, these insights can inform the development of collaborative systems that leverage the strengths of both humans and machines to tackle complex problems more effectively.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Shao Zhang, Xihuai Wang, Wenhao Zhang, Yongshan Chen, Landi Gao, Dakuo Wang, Weinan Zhang, Xinbing Wang, Ying Wen

Theory of Mind (ToM) significantly impacts human collaboration and communication as a crucial capability to understand others. When AI agents with ToM capability collaborate with humans, Mutual Theory of Mind (MToM) arises in such human-AI teams (HATs). The MToM process, which involves interactive communication and ToM-based strategy adjustment, affects the team's performance and collaboration process. To explore the MToM process, we conducted a mixed-design experiment using a large language model-driven AI agent with ToM and communication modules in a real-time shared-workspace task. We find that the agent's ToM capability does not significantly impact team performance but enhances human understanding of the agent and the feeling of being understood. Most participants in our study believe verbal communication increases human burden, and the results show that bidirectional communication leads to lower HAT performance. We discuss the results' implications for designing AI agents that collaborate with humans in real-time shared workspace tasks.

9/16/2024

❗

Mutual Theory of Mind for Human-AI Communication

Qiaosi Wang (Georgia Institute of Technology), Ashok K. Goel (Georgia Institute of Technology)

New developments are enabling AI systems to perceive, recognize, and respond with social cues based on inferences made from humans' explicit or implicit behavioral and verbal cues. These AI systems, equipped with an equivalent of human's Theory of Mind (ToM) capability, are currently serving as matchmakers on dating platforms, assisting student learning as teaching assistants, and enhancing productivity as work partners. They mark a new era in human-AI interaction (HAI) that diverges from traditional human-computer interaction (HCI), where computers are commonly seen as tools instead of social actors. Designing and understanding the human perceptions and experiences in this emerging HAI era becomes an urgent and critical issue for AI systems to fulfill human needs and mitigate risks across social contexts. In this paper, we posit the Mutual Theory of Mind (MToM) framework, inspired by our capability of ToM in human-human communications, to guide this new generation of HAI research by highlighting the iterative and mutual shaping nature of human-AI communication. We discuss the motivation of the MToM framework and its three key components that iteratively shape the human-AI communication in three stages. We then describe two empirical studies inspired by the MToM framework to demonstrate the power of MToM in guiding the design and understanding of human-AI communication. Finally, we discuss future research opportunities in human-AI interaction through the lens of MToM.

5/28/2024

🛸

Expedient Assistance and Consequential Misunderstanding: Envisioning an Operationalized Mutual Theory of Mind

Justin D. Weisz, Michael Muller, Arielle Goldberg, Dario Andres Silva Moran

Design fictions allow us to prototype the future. They enable us to interrogate emerging or non-existent technologies and examine their implications. We present three design fictions that probe the potential consequences of operationalizing a mutual theory of mind (MToM) between human users and one (or more) AI agents. We use these fictions to explore many aspects of MToM, including how models of the other party are shaped through interaction, how discrepancies between these models lead to breakdowns, and how models of a human's knowledge and skills enable AI agents to act in their stead. We examine these aspects through two lenses: a utopian lens in which MToM enhances human-human interactions and leads to synergistic human-AI collaborations, and a dystopian lens in which a faulty or misaligned MToM leads to problematic outcomes. Our work provides an aspirational vision for human-centered MToM research while simultaneously warning of the consequences when implemented incorrectly.

6/19/2024

Theory of Mind for Multi-Agent Collaboration via Large Language Models

Huao Li, Yu Quan Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Michael Lewis, Katia Sycara

While Large Language Models (LLMs) have demonstrated impressive accomplishments in both reasoning and planning, their abilities in multi-agent collaborations remains largely unexplored. This study evaluates LLM-based agents in a multi-agent cooperative text game with Theory of Mind (ToM) inference tasks, comparing their performance with Multi-Agent Reinforcement Learning (MARL) and planning-based baselines. We observed evidence of emergent collaborative behaviors and high-order Theory of Mind capabilities among LLM-based agents. Our results reveal limitations in LLM-based agents' planning optimization due to systematic failures in managing long-horizon contexts and hallucination about the task state. We explore the use of explicit belief state representations to mitigate these issues, finding that it enhances task performance and the accuracy of ToM inferences for LLM-based agents.

6/28/2024