Metrizing Fairness

2205.15049

Published 6/12/2024 by Yves Rychener, Bahar Taskesen, Daniel Kuhn

📈

Abstract

We study supervised learning problems that have significant effects on individuals from two demographic groups, and we seek predictors that are fair with respect to a group fairness criterion such as statistical parity (SP). A predictor is SP-fair if the distributions of predictions within the two groups are close in Kolmogorov distance, and fairness is achieved by penalizing the dissimilarity of these two distributions in the objective function of the learning problem. In this paper, we identify conditions under which hard SP constraints are guaranteed to improve predictive accuracy. We also showcase conceptual and computational benefits of measuring unfairness with integral probability metrics (IPMs) other than the Kolmogorov distance. Conceptually, we show that the generator of any IPM can be interpreted as a family of utility functions and that unfairness with respect to this IPM arises if individuals in the two demographic groups have diverging expected utilities. We also prove that the unfairness-regularized prediction loss admits unbiased gradient estimators, which are constructed from random mini-batches of training samples, if unfairness is measured by the squared $mathcal L^2$-distance or by a squared maximum mean discrepancy. In this case, the fair learning problem is susceptible to efficient stochastic gradient descent (SGD) algorithms. Numerical experiments on synthetic and real data show that these SGD algorithms outperform state-of-the-art methods for fair learning in that they achieve superior accuracy-unfairness trade-offs -- sometimes orders of magnitude faster.

Create account to get full access

Overview

The paper explores supervised learning problems for predicting properties of individuals who belong to one of two demographic groups.
The goal is to find predictors that are fair according to the principle of statistical parity, meaning the distributions of the predictions within the two groups should be similar.
The paper showcases the benefits of using integral probability metrics (IPMs) other than the Kolmogorov distance to measure unfairness.

Plain English Explanation

The paper looks at a common problem in machine learning where you want to predict some property of individuals, and those individuals belong to one of two different demographic groups. For example, you might want to predict someone's income level, and the individuals could be divided into two groups based on their gender or race.

The key challenge is to make these predictions in a fair way, where the predictions for the two groups are similar. This is known as achieving statistical parity - the distributions of the predictions should be close between the two groups.

Traditionally, researchers have used the Kolmogorov distance to measure how different the prediction distributions are between the groups, and then tried to minimize this distance in the machine learning model. This paper introduces the use of other types of distance metrics, called integral probability metrics (IPMs), to measure unfairness.

The paper shows that these alternative IPM metrics have some conceptual and computational benefits over the Kolmogorov distance. Conceptually, they can be interpreted as measuring the divergence in "expected utility" between the two groups. Computationally, certain IPMs allow for unbiased gradient estimates, which means the machine learning model can be trained more efficiently using stochastic gradient descent.

The paper also identifies conditions where achieving statistical parity can actually improve the overall prediction accuracy, beyond just making the model fair. This is an important finding, as it suggests there can be benefits to fairness beyond just meeting ethical or regulatory requirements.

Technical Explanation

The core of the paper is exploring the use of integral probability metrics (IPMs) to measure and optimize for fairness in supervised learning problems with two demographic groups.

Traditionally, the Kolmogorov distance has been used to quantify the unfairness between the prediction distributions of the two groups, and this unfairness metric is then incorporated into the machine learning model's objective function. The paper shows that other IPM metrics, such as the squared L2 distance and squared maximum mean discrepancy, have conceptual and computational advantages over the Kolmogorov distance.

Conceptually, the authors prove that any IPM can be interpreted as measuring the divergence in expected utility between the two demographic groups. This provides an intuitive interpretation of unfairness. Computationally, they show that for certain IPMs, the unfairness-regularized prediction loss admits unbiased gradient estimators, making the optimization problem amenable to efficient stochastic gradient descent (SGD) algorithms.

The paper presents extensive numerical experiments on real-world data, demonstrating that these SGD-based fair learning algorithms can achieve superior accuracy-unfairness trade-offs compared to state-of-the-art fair learning methods - sometimes orders of magnitude faster.

Finally, the authors identify conditions under which statistical parity (i.e., similar prediction distributions between groups) can actually improve prediction accuracy. This is an important finding, as it suggests fairness can have benefits beyond just ethical or regulatory requirements.

Critical Analysis

The paper makes a compelling case for the use of IPMs as a more flexible and powerful way to measure and optimize for fairness in supervised learning problems. The conceptual and computational advantages they identify are well-justified and supported by the theoretical analysis and empirical results.

However, it's worth noting that the paper focuses solely on the binary classification setting. While the authors mention the potential extension to multiclass problems, further research would be needed to validate the effectiveness of IPMs in those more complex scenarios.

Additionally, the paper does not address the issue of individual fairness, which is another important fairness criterion beyond just group-level statistical parity. Techniques like reweighting or hyperparameter tuning may be necessary to achieve individual fairness alongside group-level fairness.

Finally, the paper's findings on the potential accuracy benefits of statistical parity are intriguing, but the authors acknowledge that the conditions for this to hold may be fairly narrow. [Further research is needed to fully understand the intrinsic fairness-accuracy tradeoffs and the circumstances under which fairness can lead to improved predictive performance.

Conclusion

This paper makes a significant contribution to the field of fair machine learning by introducing the use of IPMs as a flexible and powerful way to measure and optimize for fairness in supervised learning problems. The conceptual and computational advantages of IPMs over traditional Kolmogorov-based approaches are well-demonstrated, and the authors' findings on the potential accuracy benefits of statistical parity are an important step towards understanding the interplay between fairness and predictive performance.

While the paper is focused on the binary classification setting, the insights and techniques it presents have the potential to be extended and applied to a wide range of fair machine learning applications, further advancing the field's ability to develop models that are both accurate and equitable.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

Fairness and Unfairness in Binary and Multiclass Classification: Quantifying, Calculating, and Bounding

Sivan Sabato, Eran Treister, Elad Yom-Tov

We propose a new interpretable measure of unfairness, that allows providing a quantitative analysis of classifier fairness, beyond a dichotomous fair/unfair distinction. We show how this measure can be calculated when the classifier's conditional confusion matrices are known. We further propose methods for auditing classifiers for their fairness when the confusion matrices cannot be obtained or even estimated. Our approach lower-bounds the unfairness of a classifier based only on aggregate statistics, which may be provided by the owner of the classifier or collected from freely available data. We use the equalized odds criterion, which we generalize to the multiclass case. We report experiments on data sets representing diverse applications, which demonstrate the effectiveness and the wide range of possible uses of the proposed methodology. An implementation of the procedures proposed in this paper and as the code for running the experiments are provided in https://github.com/sivansabato/unfairness.

4/9/2024

cs.LG cs.CY stat.ML

Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly

Yijun Bian, Yujie Luo

Providing various machine learning (ML) applications in the real world, concerns about discrimination hidden in ML models are growing, particularly in high-stakes domains. Existing techniques for assessing the discrimination level of ML models include commonly used group and individual fairness measures. However, these two types of fairness measures are usually hard to be compatible with each other, and even two different group fairness measures might be incompatible as well. To address this issue, we investigate to evaluate the discrimination level of classifiers from a manifold perspective and propose a harmonic fairness measure via manifolds (HFM) based on distances between sets. Yet the direct calculation of distances might be too expensive to afford, reducing its practical applicability. Therefore, we devise an approximation algorithm named Approximation of distance between sets (ApproxDist) to facilitate accurate estimation of distances, and we further demonstrate its algorithmic effectiveness under certain reasonable assumptions. Empirical results indicate that the proposed fairness measure HFM is valid and that the proposed ApproxDist is effective and efficient.

5/16/2024

cs.LG cs.CY

The Unfairness of $varepsilon$-Fairness

Tolulope Fadina, Thorsten Schmidt

Fairness in decision-making processes is often quantified using probabilistic metrics. However, these metrics may not fully capture the real-world consequences of unfairness. In this article, we adopt a utility-based approach to more accurately measure the real-world impacts of decision-making process. In particular, we show that if the concept of $varepsilon$-fairness is employed, it can possibly lead to outcomes that are maximally unfair in the real-world context. Additionally, we address the common issue of unavailable data on false negatives by proposing a reduced setting that still captures essential fairness considerations. We illustrate our findings with two real-world examples: college admissions and credit risk assessment. Our analysis reveals that while traditional probability-based evaluations might suggest fairness, a utility-based approach uncovers the necessary actions to truly achieve equality. For instance, in the college admission case, we find that enhancing completion rates is crucial for ensuring fairness. Summarizing, this paper highlights the importance of considering the real-world context when evaluating fairness.

6/19/2024

cs.LG stat.ML

🔍

Individual Fairness Through Reweighting and Tuning

Abdoul Jalil Djiberou Mahamadou, Lea Goetz, Russ Altman

Inherent bias within society can be amplified and perpetuated by artificial intelligence (AI) systems. To address this issue, a wide range of solutions have been proposed to identify and mitigate bias and enforce fairness for individuals and groups. Recently, Graph Laplacian Regularizer (GLR), a regularization technique from the semi-supervised learning literature has been used as a substitute for the common Lipschitz condition to enhance individual fairness. Notable prior work has shown that enforcing individual fairness through a GLR can improve the transfer learning accuracy of AI models under covariate shifts. However, the prior work defines a GLR on the source and target data combined, implicitly assuming that the target data are available at train time, which might not hold in practice. In this work, we investigated whether defining a GLR independently on the train and target data could maintain similar accuracy. Furthermore, we introduced the Normalized Fairness Gain score (NFG) to measure individual fairness by measuring the amount of gained fairness when a GLR is used versus not. We evaluated the new and original methods under NFG, the Prediction Consistency (PC), and traditional classification metrics on the German Credit Approval dataset. The results showed that the two models achieved similar statistical mean performances over five-fold cross-validation. Furthermore, the proposed metric showed that PC scores can be misleading as the scores can be high and statistically similar to fairness-enhanced models while NFG scores are small. This work therefore provides new insights into when a GLR effectively enhances individual fairness and the pitfalls of PC.

5/9/2024

cs.LG cs.AI