Approximating Human Models During Argumentation-based Dialogues

2405.18650

Published 5/30/2024 by Yinxu Tang, Stylianos Loukas Vasileiou, William Yeoh

🌀

Abstract

Explainable AI Planning (XAIP) aims to develop AI agents that can effectively explain their decisions and actions to human users, fostering trust and facilitating human-AI collaboration. A key challenge in XAIP is model reconciliation, which seeks to align the mental models of AI agents and humans. While existing approaches often assume a known and deterministic human model, this simplification may not capture the complexities and uncertainties of real-world interactions. In this paper, we propose a novel framework that enables AI agents to learn and update a probabilistic human model through argumentation-based dialogues. Our approach incorporates trust-based and certainty-based update mechanisms, allowing the agent to refine its understanding of the human's mental state based on the human's expressed trust in the agent's arguments and certainty in their own arguments. We employ a probability weighting function inspired by prospect theory to capture the relationship between trust and perceived probability, and use a Bayesian approach to update the agent's probability distribution over possible human models. We conduct a human-subject study to empirically evaluate the effectiveness of our approach in an argumentation scenario, demonstrating its ability to capture the dynamics of human belief formation and adaptation.

Create account to get full access

Overview

The paper proposes a novel framework for Explainable AI Planning (XAIP) that enables AI agents to learn and update a probabilistic model of human mental states through argumentation-based dialogues.
The key challenge addressed is model reconciliation, which aims to align the mental models of AI agents and humans.
The approach incorporates trust-based and certainty-based update mechanisms, allowing the agent to refine its understanding of the human's mental state based on the human's expressed trust and certainty.
The researchers conduct a human-subject study to evaluate the effectiveness of their approach in an argumentation scenario.

Plain English Explanation

The paper is about making AI systems that can explain their decisions and actions to human users in a way that builds trust and enables effective collaboration. A key challenge in this field, known as Explainable AI Planning (XAIP), is model reconciliation - aligning the mental models of the AI system and the human user.

Existing approaches often assume the human's mental model is known and deterministic, but this may not capture the complexities and uncertainties of real-world interactions. The researchers propose a new framework that allows the AI system to learn and update a probabilistic model of the human's mental state through conversations.

The system uses the human's expressed trust in the AI's arguments and certainty in their own arguments to refine its understanding of the human's beliefs and reasoning. This is inspired by prospect theory, which describes how people's perceptions of probability are influenced by factors like trust.

The researchers conducted a study where humans interacted with the AI system in an argumentation scenario. This allowed them to evaluate how well the system could capture the dynamics of human belief formation and adaptation, which is an important step towards more human-centric and trustworthy AI systems.

Technical Explanation

The paper presents a novel framework for Explainable AI Planning (XAIP) that enables AI agents to learn and update a probabilistic model of human mental states through argumentation-based dialogues. The key challenge addressed is model reconciliation, which seeks to align the mental models of AI agents and humans.

The approach incorporates trust-based and certainty-based update mechanisms, allowing the agent to refine its understanding of the human's mental state based on the human's expressed trust in the agent's arguments and certainty in their own arguments. The researchers employ a probability weighting function inspired by prospect theory to capture the relationship between trust and perceived probability, and use a Bayesian approach to update the agent's probability distribution over possible human models.

The researchers conducted a human-subject study to empirically evaluate the effectiveness of their approach in an argumentation scenario. The study allowed them to demonstrate the system's ability to capture the dynamics of human belief formation and adaptation, which is an important step towards more human-centric and trustworthy AI systems.

Critical Analysis

The paper presents a promising approach to Explainable AI Planning (XAIP) that addresses the challenge of model reconciliation by allowing the AI agent to learn and update a probabilistic model of the human's mental state.

However, the researchers acknowledge that their study was limited to a specific argumentation scenario, and further research is needed to understand how the approach would perform in more complex, real-world interactions. Additionally, the human-centric and trustworthy AI systems proposed in the paper may still face challenges in unraveling the dilemma of AI errors and ensuring appropriate levels of trust and transparency in high-stakes scenarios.

Future research could explore ways to expand the model to handle a wider range of human behaviors and interaction dynamics, as well as investigate the scalability and robustness of the approach in more diverse and challenging real-world settings.

Conclusion

The paper proposes a novel framework for Explainable AI Planning (XAIP) that enables AI agents to learn and update a probabilistic model of human mental states through argumentation-based dialogues. This addresses the key challenge of model reconciliation and represents an important step towards more human-centric and trustworthy AI systems.

The empirical evaluation of the approach in an argumentation scenario demonstrates its ability to capture the dynamics of human belief formation and adaptation. While further research is needed to expand the model and test its performance in more complex, real-world settings, this work contributes valuable insights to the ongoing efforts to improve the onboarding of health professionals to AI systems and unravel the dilemma of AI errors in a way that fosters trust and collaboration between humans and AI agents.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Explainable Human-AI Interaction: A Planning Perspective

Sarath Sreedharan, Anagha Kulkarni, Subbarao Kambhampati

From its inception, AI has had a rather ambivalent relationship with humans -- swinging between their augmentation and replacement. Now, as AI technologies enter our everyday lives at an ever increasing pace, there is a greater need for AI systems to work synergistically with humans. One critical requirement for such synergistic human-AI interaction is that the AI systems be explainable to the humans in the loop. To do this effectively, AI agents need to go beyond planning with their own models of the world, and take into account the mental model of the human in the loop. Drawing from several years of research in our lab, we will discuss how the AI agent can use these mental models to either conform to human expectations, or change those expectations through explanatory communication. While the main focus of the book is on cooperative scenarios, we will point out how the same mental models can be used for obfuscation and deception. Although the book is primarily driven by our own research in these areas, in every chapter, we will provide ample connections to relevant research from other groups.

5/28/2024

cs.AI

🤯

Towards Dialogues for Joint Human-AI Reasoning and Value Alignment

Elfia Bezou-Vrakatseli, Oana Cocarascu, Sanjay Modgil

We argue that enabling human-AI dialogue, purposed to support joint reasoning (i.e., 'inquiry'), is important for ensuring that AI decision making is aligned with human values and preferences. In particular, we point to logic-based models of argumentation and dialogue, and suggest that the traditional focus on persuasion dialogues be replaced by a focus on inquiry dialogues, and the distinct challenges that joint inquiry raises. Given recent dramatic advances in the performance of large language models (LLMs), and the anticipated increase in their use for decision making, we provide a roadmap for research into inquiry dialogues for supporting joint human-LLM reasoning tasks that are ethically salient, and that thereby require that decisions are value aligned.

5/29/2024

cs.AI

On the Utility of Accounting for Human Beliefs about AI Behavior in Human-AI Collaboration

Guanghui Yu, Robert Kasumba, Chien-Ju Ho, William Yeoh

To enable effective human-AI collaboration, merely optimizing AI performance while ignoring humans is not sufficient. Recent research has demonstrated that designing AI agents to account for human behavior leads to improved performance in human-AI collaboration. However, a limitation of most existing approaches is their assumption that human behavior is static, irrespective of AI behavior. In reality, humans may adjust their action plans based on their observations of AI behavior. In this paper, we address this limitation by enabling a collaborative AI agent to consider the beliefs of its human partner, i.e., what the human partner thinks the AI agent is doing, and design its action plan to facilitate easier collaboration with its human partner. Specifically, we developed a model of human beliefs that accounts for how humans reason about the behavior of their AI partners. Based on this belief model, we then developed an AI agent that considers both human behavior and human beliefs in devising its strategy for working with humans. Through extensive real-world human-subject experiments, we demonstrated that our belief model more accurately predicts humans' beliefs about AI behavior. Moreover, we showed that our design of AI agents that accounts for human beliefs enhances performance in human-AI collaboration.

6/11/2024

cs.AI cs.HC cs.LG

Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making

Min Hun Lee, Silvana Xin Yi Choo, Shamala D/O Thilarajah

With advanced AI/ML, there has been growing research on explainable AI (XAI) and studies on how humans interact with AI and XAI for effective human-AI collaborative decision-making. However, we still have a lack of understanding of how AI systems and XAI should be first presented to users without technical backgrounds. In this paper, we present the findings of semi-structured interviews with health professionals (n=12) and students (n=4) majoring in medicine and health to study how to improve onboarding with AI and XAI. For the interviews, we built upon human-AI interaction guidelines to create onboarding materials of an AI system for stroke rehabilitation assessment and AI explanations and introduce them to the participants. Our findings reveal that beyond presenting traditional performance metrics on AI, participants desired benchmark information, the practical benefits of AI, and interaction trials to better contextualize AI performance, and refine the objectives and performance of AI. Based on these findings, we highlight directions for improving onboarding with AI and XAI and human-AI collaborative decision-making.

5/28/2024

cs.HC cs.AI cs.LG