Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model Multiplicity

Read original: arXiv:2203.07139 - Published 7/11/2024 by Kacper Sokol, Meelis Kull, Jeffrey Chan, Flora Salim

📈

Overview

Data-driven predictive models can have unexpected real-world consequences, even when the models themselves are technically sound.
One key consideration is the fairness of these systems, both for individuals and groups, as algorithms can discriminate based on protected characteristics.
Previous research has focused on identifying and mitigating unfairness in a single, fixed model, but this paper proposes a new definition of "cross-model fairness."

Plain English Explanation

When data-driven predictive models are used to make decisions that affect people's lives, the choices made in creating these models can have unintended consequences. Even if the models themselves are well-designed from an engineering standpoint, they may operate within a social context where their outputs can lead to unfair treatment of individuals or groups.

The concept of fairness is particularly relevant here. Algorithms can discriminate against people based on characteristics like race, gender, or age, even if those properties are not explicitly included in the data used to train the models. Previous research has focused on identifying and addressing unfairness within a single, fixed model, often by adjusting classification thresholds.

However, this paper takes a different approach. It explores a scenario where there are multiple predictive models that are considered equally accurate, but they may classify the same individual differently. In this case, a person could argue for the model that gives them the most favorable outcome, which could have negative consequences for others. This is the idea of "cross-model fairness."

The researchers use a simple two-dimensional example and then examine real-world datasets and models to demonstrate that this type of unfairness can indeed occur in practice. They also explore the analytical properties of cross-model fairness and how it might be difficult to address through technical means alone, as doing so could degrade the overall predictive performance of the models.

Technical Explanation

The paper begins by introducing the concept of cross-model fairness, which differs from previous approaches to fairness in machine learning. While prior research has focused on identifying and mitigating unfairness within a single, fixed predictive model, this paper explores a scenario where there are multiple models that are considered equally accurate, but they may classify the same individual differently.

The researchers first demonstrate this idea using a simple two-dimensional example with linear classification. They then present a comprehensive empirical study based on real-world predictive models and datasets that are popular in the algorithmic fairness community.

The key finding is that this type of unfairness, where individuals can be harmed by the

ad hoc

choice of a predictor from a group of equally well-performing models, can readily be found in real-life scenarios. The paper also investigates the analytical properties of cross-model fairness and its broader implications.

Critical Analysis

The paper raises an important and novel issue in the field of algorithmic fairness. By considering the possibility of multiple, equally accurate predictive models, it highlights a previously overlooked source of unfairness that could have significant real-world consequences.

One limitation of the research is that it focuses primarily on linear classification tasks. While this provides a clear and accessible example, it would be valuable to explore the phenomenon of cross-model fairness in more complex, non-linear predictive models as well.

Additionally, the paper acknowledges that addressing cross-model unfairness through technical means alone may be difficult, as mitigating this issue could degrade the overall predictive performance of the models. This suggests that a more holistic, interdisciplinary approach involving policymakers, ethicists, and domain experts may be necessary to tackle this challenge effectively.

Further research could also explore the interaction between cross-model fairness and other fairness criteria, such as group-level and individual-level fairness. Understanding how these different fairness considerations might conflict or complement each other could provide valuable insights for developing more comprehensive fairness frameworks.

Conclusion

This paper introduces the novel concept of cross-model fairness, which highlights a previously overlooked source of unfairness in data-driven predictive models. By considering the possibility of multiple, equally accurate models that may classify the same individual differently, the researchers demonstrate that individuals can be harmed by the

ad hoc

choice of a predictor.

The findings suggest that this type of unfairness can occur in real-world scenarios and may be challenging to mitigate through technical means alone, as doing so could degrade the overall predictive performance of the models. This underscores the need for a more holistic, interdisciplinary approach to addressing fairness in machine learning systems, involving policymakers, ethicists, and domain experts to ensure these powerful tools are used in a fair and responsible manner.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Cross-model Fairness: Empirical Study of Fairness and Ethics Under Model Multiplicity

Kacper Sokol, Meelis Kull, Jeffrey Chan, Flora Salim

While data-driven predictive models are a strictly technological construct, they may operate within a social context in which benign engineering choices entail implicit, indirect and unexpected real-life consequences. Fairness of such systems -- pertaining both to individuals and groups -- is one relevant consideration in this space; algorithms can discriminate people across various protected characteristics regardless of whether these properties are included in the data or discernible through proxy variables. To date, this notion has predominantly been studied for a fixed model, often under different classification thresholds, striving to identify and eradicate undesirable, discriminative and possibly unlawful aspects of its operation. Here, we backtrack on this fixed model assumption to propose and explore a novel definition of cross-model fairness where individuals can be harmed when one predictor is chosen ad hoc from a group of equally well performing models, i.e., in view of utility-based model multiplicity. Since a person may be classified differently across models that are otherwise considered equivalent, this individual could argue for a predictor granting them the most favourable outcome, employing which may have adverse effects on other people. We introduce this scenario with a two-dimensional example and linear classification; then, we present a comprehensive empirical study based on real-life predictive models and data sets that are popular with the algorithmic fairness community; finally, we investigate analytical properties of cross-model fairness and its ramifications in a broader context. Our findings suggest that such unfairness can be readily found in real life and it may be difficult to mitigate by technical means alone as doing so is likely to degrade predictive performance.

7/11/2024

📈

One Model Many Scores: Using Multiverse Analysis to Prevent Fairness Hacking and Evaluate the Influence of Model Design Decisions

Jan Simson, Florian Pfisterer, Christoph Kern

A vast number of systems across the world use algorithmic decision making (ADM) to (partially) automate decisions that have previously been made by humans. The downstream effects of ADM systems critically depend on the decisions made during a systems' design, implementation, and evaluation, as biases in data can be mitigated or reinforced along the modeling pipeline. Many of these decisions are made implicitly, without knowing exactly how they will influence the final system. To study this issue, we draw on insights from the field of psychology and introduce the method of multiverse analysis for algorithmic fairness. In our proposed method, we turn implicit decisions during design and evaluation into explicit ones and demonstrate their fairness implications. By combining decisions, we create a grid of all possible universes of decision combinations. For each of these universes, we compute metrics of fairness and performance. Using the resulting dataset, one can investigate the variability and robustness of fairness scores and see how and which decisions impact fairness. We demonstrate how multiverse analyses can be used to better understand fairness implications of design and evaluation decisions using an exemplary case study of predicting public health care coverage for vulnerable populations. Our results highlight how decisions regarding the evaluation of a system can lead to vastly different fairness metrics for the same model. This is problematic, as a nefarious actor could optimise or hack a fairness metric to portray a discriminating model as fair merely by changing how it is evaluated. We illustrate how a multiverse analysis can help to address this issue.

6/21/2024

🌐

When mitigating bias is unfair: multiplicity and arbitrariness in algorithmic group fairness

Natasa Krco, Thibault Laugel, Vincent Grari, Jean-Michel Loubes, Marcin Detyniecki

Most research on fair machine learning has prioritized optimizing criteria such as Demographic Parity and Equalized Odds. Despite these efforts, there remains a limited understanding of how different bias mitigation strategies affect individual predictions and whether they introduce arbitrariness into the debiasing process. This paper addresses these gaps by exploring whether models that achieve comparable fairness and accuracy metrics impact the same individuals and mitigate bias in a consistent manner. We introduce the FRAME (FaiRness Arbitrariness and Multiplicity Evaluation) framework, which evaluates bias mitigation through five dimensions: Impact Size (how many people were affected), Change Direction (positive versus negative changes), Decision Rates (impact on models' acceptance rates), Affected Subpopulations (who was affected), and Neglected Subpopulations (where unfairness persists). This framework is intended to help practitioners understand the impacts of debiasing processes and make better-informed decisions regarding model selection. Applying FRAME to various bias mitigation approaches across key datasets allows us to exhibit significant differences in the behaviors of debiasing methods. These findings highlight the limitations of current fairness criteria and the inherent arbitrariness in the debiasing process.

5/24/2024

Fairness-Accuracy Trade-Offs: A Causal Perspective

Drago Plecko, Elias Bareinboim

Systems based on machine learning may exhibit discriminatory behavior based on sensitive characteristics such as gender, sex, religion, or race. In light of this, various notions of fairness and methods to quantify discrimination were proposed, leading to the development of numerous approaches for constructing fair predictors. At the same time, imposing fairness constraints may decrease the utility of the decision-maker, highlighting a tension between fairness and utility. This tension is also recognized in legal frameworks, for instance in the disparate impact doctrine of Title VII of the Civil Rights Act of 1964 -- in which specific attention is given to considerations of business necessity -- possibly allowing the usage of proxy variables associated with the sensitive attribute in case a high-enough utility cannot be achieved without them. In this work, we analyze the tension between fairness and accuracy from a causal lens for the first time. We introduce the notion of a path-specific excess loss (PSEL) that captures how much the predictor's loss increases when a causal fairness constraint is enforced. We then show that the total excess loss (TEL), defined as the difference between the loss of predictor fair along all causal pathways vs. an unconstrained predictor, can be decomposed into a sum of more local PSELs. At the same time, enforcing a causal constraint often reduces the disparity between demographic groups. Thus, we introduce a quantity that summarizes the fairness-utility trade-off, called the causal fairness/utility ratio, defined as the ratio of the reduction in discrimination vs. the excess loss from constraining a causal pathway. This quantity is suitable for comparing the fairness-utility trade-off across causal pathways. Finally, as our approach requires causally-constrained fair predictors, we introduce a new neural approach for causally-constrained fair learning.

5/27/2024