Equalised Odds is not Equal Individual Odds: Post-processing for Group and Individual Fairness

2304.09779

YC

0

Reddit

0

Published 4/22/2024 by Edward A. Small, Kacper Sokol, Daniel Manning, Flora D. Salim, Jeffrey Chan

🤖

Abstract

Group fairness is achieved by equalising prediction distributions between protected sub-populations; individual fairness requires treating similar individuals alike. These two objectives, however, are incompatible when a scoring model is calibrated through discontinuous probability functions, where individuals can be randomly assigned an outcome determined by a fixed probability. This procedure may provide two similar individuals from the same protected group with classification odds that are disparately different -- a clear violation of individual fairness. Assigning unique odds to each protected sub-population may also prevent members of one sub-population from ever receiving equal chances of a positive outcome to another, which we argue is another type of unfairness called individual odds. We reconcile all this by constructing continuous probability functions between group thresholds that are constrained by their Lipschitz constant. Our solution preserves the model's predictive power, individual fairness and robustness while ensuring group fairness.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • Achieving group fairness and individual fairness in machine learning models can be incompatible when the models use discontinuous probability functions.
  • Discontinuous probability functions can randomly assign different outcomes to similar individuals within the same protected group, violating individual fairness.
  • Assigning unique odds to each protected subpopulation can also prevent members of one subpopulation from ever receiving equal chances of a positive outcome compared to another, which is another type of unfairness called "individual odds."
  • The paper proposes a solution that uses continuous probability functions constrained by their Lipschitz constant to preserve the model's predictive power, individual fairness, and robustness while ensuring group fairness.

Plain English Explanation

Machine learning models are often designed to be "fair" - that is, to treat different groups of people equally. There are two main types of fairness that are often considered: group fairness and individual fairness.

Group fairness means that the model's predictions are equally distributed across different protected subgroups (e.g., race, gender). Individual fairness means that the model treats similar individuals alike, regardless of which subgroup they belong to.

However, the paper explains that these two fairness objectives can be incompatible when the model uses certain types of probability functions. Specifically, if the model uses "discontinuous" probability functions, it can randomly assign different outcomes to similar individuals within the same protected group. This violates individual fairness.

The paper also notes that assigning unique odds to each protected subpopulation can prevent members of one subpopulation from ever having an equal chance of a positive outcome compared to another subpopulation. This is another type of unfairness called "individual odds."

To address these issues, the paper proposes a solution that uses "continuous" probability functions that are constrained by their Lipschitz constant. This approach preserves the model's predictive power, ensures individual fairness, and maintains robustness while also achieving group fairness.

Technical Explanation

The paper examines the tension between group fairness and individual fairness in machine learning models. Group fairness is achieved by equalizing prediction distributions between protected subgroups, while individual fairness requires treating similar individuals alike.

The authors show that these objectives can be incompatible when the scoring model is calibrated through discontinuous probability functions. In this case, individuals can be randomly assigned an outcome determined by a fixed probability, leading to disparate classification odds for similar individuals within the same protected group - a violation of individual fairness.

Additionally, assigning unique odds to each protected subpopulation can prevent members of one subpopulation from ever receiving equal chances of a positive outcome compared to another, which the authors refer to as "individual odds" unfairness.

To address these issues, the paper proposes constructing continuous probability functions between group thresholds that are constrained by their Lipschitz constant. This solution preserves the model's predictive power, individual fairness, and robustness while ensuring group fairness.

Critical Analysis

The paper identifies an important challenge in achieving both group fairness and individual fairness in machine learning models, especially when using discontinuous probability functions. The proposed solution of using continuous probability functions constrained by the Lipschitz constant seems promising, as it aims to balance the competing objectives of fairness.

However, the paper does not provide a comprehensive evaluation of the approach or compare it to other fairness-aware machine learning techniques. It would be useful to see how the proposed method performs relative to other state-of-the-art fairness interventions, both in terms of fairness metrics and model performance.

Additionally, the paper does not discuss the computational complexity or training time of the proposed solution, which could be important practical considerations. It would be valuable to understand the trade-offs involved in implementing this approach.

Finally, the paper could benefit from a more in-depth discussion of the potential limitations and areas for future research. For example, the approach may be sensitive to the choice of the Lipschitz constant, and it's unclear how it would scale to high-dimensional or highly complex data distributions.

Conclusion

This paper tackles the challenging problem of reconciling group fairness and individual fairness in machine learning models. By constructing continuous probability functions constrained by the Lipschitz constant, the proposed solution aims to preserve the model's predictive power, individual fairness, and robustness while ensuring group fairness.

The key insight is that discontinuous probability functions can lead to violations of individual fairness, even when group fairness is achieved. The paper's approach addresses this issue, offering a potential path forward for developing fair and effective machine learning models.

While the paper does not provide a comprehensive evaluation or discuss all potential limitations, it highlights an important problem and presents a promising direction for further research and development in the field of algorithmic fairness.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🎲

Intrinsic Fairness-Accuracy Tradeoffs under Equalized Odds

Meiyu Zhong, Ravi Tandon

YC

0

Reddit

0

With the growing adoption of machine learning (ML) systems in areas like law enforcement, criminal justice, finance, hiring, and admissions, it is increasingly critical to guarantee the fairness of decisions assisted by ML. In this paper, we study the tradeoff between fairness and accuracy under the statistical notion of equalized odds. We present a new upper bound on the accuracy (that holds for any classifier), as a function of the fairness budget. In addition, our bounds also exhibit dependence on the underlying statistics of the data, labels and the sensitive group attributes. We validate our theoretical upper bounds through empirical analysis on three real-world datasets: COMPAS, Adult, and Law School. Specifically, we compare our upper bound to the tradeoffs that are achieved by various existing fair classifiers in the literature. Our results show that achieving high accuracy subject to a low-bias could be fundamentally limited based on the statistical disparity across the groups.

Read more

5/17/2024

Optimal Group Fair Classifiers from Linear Post-Processing

Optimal Group Fair Classifiers from Linear Post-Processing

Ruicheng Xian, Han Zhao

YC

0

Reddit

0

We propose a post-processing algorithm for fair classification that mitigates model bias under a unified family of group fairness criteria covering statistical parity, equal opportunity, and equalized odds, applicable to multi-class problems and both attribute-aware and attribute-blind settings. It achieves fairness by re-calibrating the output score of the given base model with a fairness cost -- a linear combination of the (predicted) group memberships. Our algorithm is based on a representation result showing that the optimal fair classifier can be expressed as a linear post-processing of the loss function and the group predictor, derived via using these as sufficient statistics to reformulate the fair classification problem as a linear program. The parameters of the post-processor are estimated by solving the empirical LP. Experiments on benchmark datasets show the efficiency and effectiveness of our algorithm at reducing disparity compared to existing algorithms, including in-processing, especially on larger problems.

Read more

5/8/2024

📈

Metrizing Fairness

Yves Rychener, Bahar Taskesen, Daniel Kuhn

YC

0

Reddit

0

We study supervised learning problems that have significant effects on individuals from two demographic groups, and we seek predictors that are fair with respect to a group fairness criterion such as statistical parity (SP). A predictor is SP-fair if the distributions of predictions within the two groups are close in Kolmogorov distance, and fairness is achieved by penalizing the dissimilarity of these two distributions in the objective function of the learning problem. In this paper, we identify conditions under which hard SP constraints are guaranteed to improve predictive accuracy. We also showcase conceptual and computational benefits of measuring unfairness with integral probability metrics (IPMs) other than the Kolmogorov distance. Conceptually, we show that the generator of any IPM can be interpreted as a family of utility functions and that unfairness with respect to this IPM arises if individuals in the two demographic groups have diverging expected utilities. We also prove that the unfairness-regularized prediction loss admits unbiased gradient estimators, which are constructed from random mini-batches of training samples, if unfairness is measured by the squared $mathcal L^2$-distance or by a squared maximum mean discrepancy. In this case, the fair learning problem is susceptible to efficient stochastic gradient descent (SGD) algorithms. Numerical experiments on synthetic and real data show that these SGD algorithms outperform state-of-the-art methods for fair learning in that they achieve superior accuracy-unfairness trade-offs -- sometimes orders of magnitude faster.

Read more

6/12/2024

🤔

Individual Fairness under Varied Notions of Group Fairness in Bipartite Matching - One Framework to Approximate Them All

Atasi Panda, Anand Louis, Prajakta Nimbhorkar

YC

0

Reddit

0

We study the probabilistic assignment of items to platforms that satisfies both group and individual fairness constraints. Each item belongs to specific groups and has a preference ordering over platforms. Each platform enforces group fairness by limiting the number of items per group that can be assigned to it. There could be multiple optimal solutions that satisfy the group fairness constraints, but this alone ignores item preferences. Our approach explores a `best of both worlds fairness' solution to get a randomized matching, which is ex-ante individually fair and ex-post group-fair. Thus, we seek a `probabilistic individually fair' distribution over `group-fair' matchings where each item has a `high' probability of matching to one of its top choices. This distribution is also ex-ante group-fair. Users can customize fairness constraints to suit their requirements. Our first result is a polynomial-time algorithm that computes a distribution over `group-fair' matchings such that the individual fairness constraints are approximately satisfied and the expected size of a matching is close to OPT. We empirically test this on real-world datasets. We present two additional polynomial-time bi-criteria approximation algorithms that users can choose from to balance group fairness and individual fairness trade-offs. For disjoint groups, we provide an exact polynomial-time algorithm adaptable to additional lower `group fairness' bounds. Extending our model, we encompass `maxmin group fairness,' amplifying underrepresented groups, and `mindom group fairness,' reducing the representation of dominant groups.'

Read more

5/13/2024