Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints

Read original: arXiv:2403.06906 - Published 8/21/2024 by Jean V. Alves, Diogo Leit~ao, S'ergio Jesus, Marco O. P. Sampaio, Javier Li'ebana, Pedro Saleiro, M'ario A. T. Figueiredo, Pedro Bizarro

Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints

Overview

This paper presents a cost-sensitive learning framework for a system that defers to multiple experts, subject to workload constraints.
The goal is to learn a deferral policy that minimizes the overall cost, which includes the cost of processing instances and the cost of deferral to experts.
The framework aims to address scenarios where there are multiple experts with varying costs and capacities.

Plain English Explanation

The paper describes a system that can defer decision-making to multiple experts when needed, rather than trying to make all decisions itself. This is useful in situations where there are different types of experts available, each with their own strengths, weaknesses, and costs.

The key idea is to learn a deferral policy - a set of rules that determine when to defer to an expert and which one to choose. This policy is optimized to minimize the overall cost, which includes both the cost of processing instances and the cost of deferring to experts.

The framework aims to unify the process of deferring to multiple experts, taking into account factors like the experts' workloads and the relative costs of deferring to each one. This allows the system to adaptively complement the experts and defer content moderation decisions to humans and AI in a principled way.

Technical Explanation

The paper proposes a cost-sensitive learning framework to learn a deferral policy for a system that can defer to multiple experts, subject to workload constraints.

The key components of the framework are:

Feature Extraction: The system extracts relevant features from each instance that need to be classified.
Deferral Policy: The deferral policy is a function that maps the extracted features to a decision - either classify the instance directly or defer to one of the available experts.
Expert Classifiers: The system has access to multiple expert classifiers, each with their own cost and capacity constraints.
Cost Function: The overall cost function includes the cost of processing instances directly and the cost of deferring to experts, weighted by the deferral policy.

The framework aims to learn the deferral policy that minimizes the overall cost, subject to the experts' workload constraints. This is formulated as a constrained optimization problem, which the authors solve using a Lagrangian relaxation approach.

The paper also presents experimental results on real-world datasets, demonstrating the effectiveness of the proposed framework compared to various baselines.

Critical Analysis

The paper provides a well-designed framework for addressing the challenge of cost-sensitive learning with multiple experts and workload constraints. Some key strengths of the approach include:

Flexibility: The framework can accommodate a wide range of expert classifiers, costs, and capacity constraints, making it applicable to diverse real-world scenarios.
Optimization: The Lagrangian relaxation approach used to solve the optimization problem is a principled and effective technique.
Experimental Validation: The experiments on real-world datasets help demonstrate the practical utility of the proposed framework.

However, the paper also acknowledges some limitations and areas for further research:

Scalability: As the number of experts grows, the optimization problem may become computationally challenging, and the authors suggest exploring approximation techniques.
Expert Selection: The current framework assumes the set of available experts is fixed, but in practice, the system may need to dynamically select which experts to utilize.
Uncertainty Modeling: The framework does not explicitly model uncertainty in the experts' predictions or the costs, which could be an important consideration in some applications.

Overall, the paper presents a valuable contribution to the field of cost-sensitive learning and deferral systems, with potential applications in domains like medical diagnosis, content moderation, and decision support systems.

Conclusion

This paper introduces a cost-sensitive learning framework for deferring decisions to multiple experts while considering their workload constraints. The key innovation is the ability to learn a deferral policy that minimizes the overall cost, including both the cost of processing instances directly and the cost of deferring to experts.

The framework addresses an important practical challenge, as many real-world applications require integrating the capabilities of diverse expert systems in a principled and cost-effective manner. The authors' experimental results demonstrate the effectiveness of their approach, and the paper outlines several promising directions for future research to further improve the scalability and flexibility of the system.

Overall, this work represents a valuable contribution to the growing field of hybrid human-AI decision-making systems, with the potential to enable more efficient and effective utilization of expert knowledge in a wide range of application domains.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints

Jean V. Alves, Diogo Leit~ao, S'ergio Jesus, Marco O. P. Sampaio, Javier Li'ebana, Pedro Saleiro, M'ario A. T. Figueiredo, Pedro Bizarro

Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key real-world aspects that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type I and type II errors have different costs; ii) requiring concurrent human predictions for every instance of the training dataset; and iii) not dealing with human work-capacity constraints. To address these issues, we propose the textit{deferral under cost and capacity constraints framework} (DeCCaF). DeCCaF is a novel L2D approach, employing supervised learning to model the probability of human error under less restrictive data requirements (only one expert prediction per instance) and using constraint programming to globally minimize the error cost, subject to workload limitations. We test DeCCaF in a series of cost-sensitive fraud detection scenarios with different teams of 9 synthetic fraud analysts, with individual work-capacity constraints. The results demonstrate that our approach performs significantly better than the baselines in a wide array of scenarios, achieving an average $8.4%$ reduction in the misclassification cost. The code used for the experiments is available at https://github.com/feedzai/deccaf

8/21/2024

Learning to Defer to a Population: A Meta-Learning Approach

Dharmesh Tailor, Aditya Patra, Rajeev Verma, Putra Manggala, Eric Nalisnick

The learning to defer (L2D) framework allows autonomous systems to be safe and robust by allocating difficult decisions to a human expert. All existing work on L2D assumes that each expert is well-identified, and if any expert were to change, the system should be re-trained. In this work, we alleviate this constraint, formulating an L2D system that can cope with never-before-seen experts at test-time. We accomplish this by using meta-learning, considering both optimization- and model-based variants. Given a small context set to characterize the currently available expert, our framework can quickly adapt its deferral policy. For the model-based approach, we employ an attention mechanism that is able to look for points in the context set that are similar to a given test point, leading to an even more precise assessment of the expert's abilities. In the experiments, we validate our methods on image recognition, traffic sign detection, and skin lesion diagnosis benchmarks.

5/14/2024

A Unifying Post-Processing Framework for Multi-Objective Learn-to-Defer Problems

Mohammad-Amin Charusaie, Samira Samadi

Learn-to-Defer is a paradigm that enables learning algorithms to work not in isolation but as a team with human experts. In this paradigm, we permit the system to defer a subset of its tasks to the expert. Although there are currently systems that follow this paradigm and are designed to optimize the accuracy of the final human-AI team, the general methodology for developing such systems under a set of constraints (e.g., algorithmic fairness, expert intervention budget, defer of anomaly, etc.) remains largely unexplored. In this paper, using a $d$-dimensional generalization to the fundamental lemma of Neyman and Pearson (d-GNP), we obtain the Bayes optimal solution for learn-to-defer systems under various constraints. Furthermore, we design a generalizable algorithm to estimate that solution and apply this algorithm to the COMPAS and ACSIncome datasets. Our algorithm shows improvements in terms of constraint violation over a set of baselines.

7/18/2024

Learning to Complement and to Defer to Multiple Users

Zheng Zhang, Wenjie Ai, Kevin Wells, David Rosewarne, Thanh-Toan Do, Gustavo Carneiro

With the development of Human-AI Collaboration in Classification (HAI-CC), integrating users and AI predictions becomes challenging due to the complex decision-making process. This process has three options: 1) AI autonomously classifies, 2) learning to complement, where AI collaborates with users, and 3) learning to defer, where AI defers to users. Despite their interconnected nature, these options have been studied in isolation rather than as components of a unified system. In this paper, we address this weakness with the novel HAI-CC methodology, called Learning to Complement and to Defer to Multiple Users (LECODU). LECODU not only combines learning to complement and learning to defer strategies, but it also incorporates an estimation of the optimal number of users to engage in the decision process. The training of LECODU maximises classification accuracy and minimises collaboration costs associated with user involvement. Comprehensive evaluations across real-world and synthesized datasets demonstrate LECODU's superior performance compared to state-of-the-art HAI-CC methods. Remarkably, even when relying on unreliable users with high rates of label noise, LECODU exhibits significant improvement over both human decision-makers alone and AI alone.

7/10/2024