Take It, Leave It, or Fix It: Measuring Productivity and Trust in Human-AI Collaboration

2402.18498

Published 4/3/2024 by Crystal Qian, James Wexler

Take It, Leave It, or Fix It: Measuring Productivity and Trust in Human-AI Collaboration

Abstract

Although recent developments in generative AI have greatly enhanced the capabilities of conversational agents such as Google's Gemini (formerly Bard) or OpenAI's ChatGPT, it's unclear whether the usage of these agents aids users across various contexts. To better understand how access to conversational AI affects productivity and trust, we conducted a mixed-methods, task-based user study, observing 76 software engineers (N=76) as they completed a programming exam with and without access to Bard. Effects on performance, efficiency, satisfaction, and trust vary depending on user expertise, question type (open-ended solve vs. definitive search questions), and measurement type (demonstrated vs. self-reported). Our findings include evidence of automation complacency, increased reliance on the AI over the course of the task, and increased performance for novices on solve-type questions when using the AI. We discuss common behaviors, design recommendations, and impact considerations to improve collaborations with conversational AI.

Create account to get full access

Overview

The paper examines how humans collaborate with AI systems, focusing on measuring productivity and trust in these interactions.
Researchers conducted a study where participants were asked to perform various writing tasks with the help of an AI assistant.
The study evaluated factors like how often participants chose to use the AI's suggestions, how much time they spent on the tasks, and their perceived trust in the AI.

Plain English Explanation

The researchers wanted to understand how well humans and AI systems can work together. They had people complete writing tasks with the assistance of an AI agent. The participants could choose to use the AI's suggestions, ignore them, or modify them. The researchers measured things like how much time the participants spent on the tasks and how much they trusted the AI. This allowed them to see how productive the human-AI collaboration was and how much the people trusted the AI's capabilities.

The key idea is that for AI systems to be truly useful, they need to be able to work seamlessly with humans. By studying factors like productivity and trust, the researchers can learn how to design AI assistants that complement human strengths and inspire confidence. This is important as AI becomes more integrated into our daily lives and professional activities.

Technical Explanation

The paper describes an experiment where participants were asked to complete various writing tasks with the help of an AI assistant. The researchers measured several metrics to assess the productivity and trust in the human-AI collaboration:

Task Completion Time: How long it took participants to finish each writing task.
AI Utilization Rate: How often participants chose to use the AI's suggested text, either completely or with modifications.
Perceived Trust: Participants' self-reported level of trust in the AI's suggestions.

The results showed that participants were more productive when they used the AI's suggestions, completing tasks faster than when working alone. However, the AI utilization rate varied - participants sometimes rejected the AI's suggestions, opting to write their own text instead. Perceived trust in the AI was also an important factor, with more trusting participants using the AI's suggestions more frequently.

The researchers conclude that successful human-AI collaboration requires finding the right balance between AI assistance and human oversight. AI systems need to be designed to complement human strengths while maintaining an appropriate level of transparency and trustworthiness.

Critical Analysis

The paper provides valuable insights into the challenges and opportunities of human-AI collaboration. By focusing on measurable outcomes like productivity and trust, the researchers offer a nuanced view of how these systems can be optimized.

One limitation acknowledged in the paper is the relatively small and homogeneous participant pool, which may limit the generalizability of the findings. Further research with more diverse samples could help validate the results.

Additionally, the study only examined a specific writing task scenario. Exploring human-AI collaboration in other domains, such as decision-making or problem-solving, could yield additional insights.

It would also be interesting to investigate how factors like the AI's transparency, explainability, and ability to learn from human feedback might influence trust and collaboration over time.

Overall, this paper contributes important groundwork for understanding the human-AI dynamic and provides a solid foundation for future research in this rapidly evolving field.

Conclusion

This study offers a thoughtful examination of human-AI collaboration, highlighting the need to balance AI assistance with human oversight and trust. By measuring productivity and trust, the researchers demonstrate that successful integration of AI systems requires careful design to complement human strengths and inspire confidence.

As AI becomes more ubiquitous, these insights can help guide the development of AI assistants that seamlessly and transparently collaborate with humans, unlocking new levels of productivity and innovation. Continued research in this area will be crucial for ensuring AI systems are designed to work in harmony with people, rather than replacing or undermining human capabilities.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🤔

Collaborative human-AI trust (CHAI-T): A process framework for active management of trust in human-AI collaboration

Melanie J. McGrath (CSIRO), Andreas Duenser (CSIRO), Justine Lacey (CSIRO), Cecile Paris (CSIRO)

Collaborative human-AI (HAI) teaming combines the unique skills and capabilities of humans and machines in sustained teaming interactions leveraging the strengths of each. In tasks involving regular exposure to novelty and uncertainty, collaboration between adaptive, creative humans and powerful, precise artificial intelligence (AI) promises new solutions and efficiencies. User trust is essential to creating and maintaining these collaborative relationships. Established models of trust in traditional forms of AI typically recognize the contribution of three primary categories of trust antecedents: characteristics of the human user, characteristics of the technology, and environmental factors. The emergence of HAI teams, however, requires an understanding of human trust that accounts for the specificity of task contexts and goals, integrates processes of interaction, and captures how trust evolves in a teaming environment over time. Drawing on both the psychological and computer science literature, the process framework of trust in collaborative HAI teams (CHAI-T) presented in this paper adopts the tripartite structure of antecedents established by earlier models, while incorporating team processes and performance phases to capture the dynamism inherent to trust in teaming contexts. These features enable active management of trust in collaborative AI systems, with practical implications for the design and deployment of collaborative HAI teams.

4/3/2024

cs.HC

🤖

Augmenting the Author: Exploring the Potential of AI Collaboration in Academic Writing

Joseph Tu, Hilda Hadan, Derrick M. Wang, Sabrina A Sgandurra, Reza Hadi Mogavi, Lennart E. Nacke

This workshop paper presents a critical examination of the integration of Generative AI (Gen AI) into the academic writing process, focusing on the use of AI as a collaborative tool. It contrasts the performance and interaction of two AI models, Gemini and ChatGPT, through a collaborative inquiry approach where researchers engage in facilitated sessions to design prompts that elicit specific AI responses for crafting research outlines. This case study highlights the importance of prompt design, output analysis, and recognizing the AI's limitations to ensure responsible and effective AI integration in scholarly work. Preliminary findings suggest that prompt variation significantly affects output quality and reveals distinct capabilities and constraints of each model. The paper contributes to the field of Human-Computer Interaction by exploring effective prompt strategies and providing a comparative analysis of Gen AI models, ultimately aiming to enhance AI-assisted academic writing and prompt a deeper dialogue within the HCI community.

4/26/2024

cs.HC cs.AI

🤖

Human-Generative AI Collaborative Problem Solving Who Leads and How Students Perceive the Interactions

Gaoxia Zhu, Vidya Sudarshan, Jason Fok Kow, Yew Soon Ong

This research investigates distinct human-generative AI collaboration types and students' interaction experiences when collaborating with generative AI (i.e., ChatGPT) for problem-solving tasks and how these factors relate to students' sense of agency and perceived collaborative problem solving. By analyzing the surveys and reflections of 79 undergraduate students, we identified three human-generative AI collaboration types: even contribution, human leads, and AI leads. Notably, our study shows that 77.21% of students perceived they led or had even contributed to collaborative problem-solving when collaborating with ChatGPT. On the other hand, 15.19% of the human participants indicated that the collaborations were led by ChatGPT, indicating a potential tendency for students to rely on ChatGPT. Furthermore, 67.09% of students perceived their interaction experiences with ChatGPT to be positive or mixed. We also found a positive correlation between positive interaction experience and a sense of positive agency. The results of this study contribute to our understanding of the collaboration between students and generative AI and highlight the need to study further why some students let ChatGPT lead collaborative problem-solving and how to enhance their interaction experience through curriculum and technology design.

5/24/2024

cs.HC cs.AI

On the Utility of Accounting for Human Beliefs about AI Behavior in Human-AI Collaboration

Guanghui Yu, Robert Kasumba, Chien-Ju Ho, William Yeoh

To enable effective human-AI collaboration, merely optimizing AI performance while ignoring humans is not sufficient. Recent research has demonstrated that designing AI agents to account for human behavior leads to improved performance in human-AI collaboration. However, a limitation of most existing approaches is their assumption that human behavior is static, irrespective of AI behavior. In reality, humans may adjust their action plans based on their observations of AI behavior. In this paper, we address this limitation by enabling a collaborative AI agent to consider the beliefs of its human partner, i.e., what the human partner thinks the AI agent is doing, and design its action plan to facilitate easier collaboration with its human partner. Specifically, we developed a model of human beliefs that accounts for how humans reason about the behavior of their AI partners. Based on this belief model, we then developed an AI agent that considers both human behavior and human beliefs in devising its strategy for working with humans. Through extensive real-world human-subject experiments, we demonstrated that our belief model more accurately predicts humans' beliefs about AI behavior. Moreover, we showed that our design of AI agents that accounts for human beliefs enhances performance in human-AI collaboration.

6/11/2024

cs.AI cs.HC cs.LG