Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

Read original: arXiv:2310.11401 - Published 4/30/2024 by Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami, Rahul Kidambi, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

🤿

Overview

This paper addresses the challenge of achieving group fairness in online machine learning settings, where data arrives one instance at a time.
The authors propose a novel ensemble model called Aranyani that can efficiently compute fairness gradients using aggregate statistics, eliminating the need for additional storage and forward/backward passes.
Aranyani is an ensemble of oblique decision trees that allows for parameter isolation and faster fairness computations compared to traditional in-processing fairness techniques.
The authors evaluate Aranyani on several public datasets and show it achieves a better accuracy-fairness trade-off than baseline approaches.

Plain English Explanation

When machine learning models are used to make decisions that impact people, it's important to ensure those decisions are fair, both for individual users and for different demographic groups. Fairness, especially group fairness, is an important consideration in the context of machine learning systems.

One way to make models more fair is to incorporate fairness objectives into the training process, such as demographic parity. However, when data arrives one instance at a time in an online setting, optimizing these fairness objectives becomes challenging. Estimating group fairness requires additional storage and computation that can slow down the training process.

To address this, the researchers developed a new model called Aranyani, which is an ensemble of decision trees. The tree structure of Aranyani allows the model to efficiently compute fairness gradients using aggregate statistics from previous decisions, without needing to store extra data or perform extra computation.

The authors tested Aranyani on several different datasets, including vision and language tasks, and found that it was able to achieve a better balance between accuracy and fairness compared to other approaches. Aranyani was able to make fair decisions in these online settings more effectively than baseline methods.

Technical Explanation

The key innovation in this paper is the Aranyani model, which is an ensemble of oblique decision trees designed for online settings where data arrives one instance at a time. Traditional in-processing fairness techniques, which optimize a combination of a fairness objective and a task-specific objective, face challenges in the online setting.

Specifically, group fairness objectives like demographic parity require estimating expectations of predictions across different demographic groups. In the online setting, this estimation requires additional storage and significantly more computation (e.g., forward/backward passes) at every time step compared to the task-specific objective alone.

Aranyani addresses this by leveraging the hierarchical tree structure to enable parameter isolation and efficient fairness gradient computation. The authors present an efficient training framework for Aranyani and provide theoretical analysis of its properties.

In their experiments, the researchers evaluate Aranyani on 5 publicly available benchmarks, including vision and language datasets. The results show that Aranyani achieves a better accuracy-fairness trade-off compared to baseline approaches like adversarial debiasing and optimized in-processing techniques.

Critical Analysis

The paper provides a novel and promising approach to addressing group fairness in online machine learning settings. The authors demonstrate the effectiveness of Aranyani through extensive experiments on diverse datasets.

One potential limitation is that the paper focuses on group fairness, while individual fairness is also an important consideration in many real-world applications. The authors acknowledge this, but do not provide a comprehensive solution for both group and individual fairness simultaneously.

Additionally, the paper does not explore the interpretability of the Aranyani model, which could be an important factor in certain applications where model decisions need to be explainable. Further research could investigate ways to enhance the interpretability of the Aranyani ensemble.

It would also be valuable to see how Aranyani performs in more complex, real-world scenarios with higher-dimensional data and a greater diversity of features and protected attributes. The current experiments, while comprehensive, may not fully capture the challenges of deploying such a system in practice.

Overall, the Aranyani approach represents a significant contribution to the field of fair machine learning, and the authors have provided a solid foundation for future research in this area.

Conclusion

This paper presents Aranyani, a novel ensemble model for achieving group fairness in online machine learning settings. By leveraging the hierarchical structure of decision trees, Aranyani can efficiently compute fairness gradients using aggregate statistics, overcoming the challenges of traditional in-processing fairness techniques in the online setting.

The empirical results demonstrate that Aranyani is able to achieve a better balance between accuracy and fairness compared to baseline approaches, across a variety of public datasets. This work advances the state of the art in fair machine learning and provides a promising direction for further research in this important area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami, Rahul Kidambi, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

Fairness, especially group fairness, is an important consideration in the context of machine learning systems. The most commonly adopted group fairness-enhancing techniques are in-processing methods that rely on a mixture of a fairness objective (e.g., demographic parity) and a task-specific objective (e.g., cross-entropy) during the training process. However, when data arrives in an online fashion -- one instance at a time -- optimizing such fairness objectives poses several challenges. In particular, group fairness objectives are defined using expectations of predictions across different demographic groups. In the online setting, where the algorithm has access to a single instance at a time, estimating the group fairness objective requires additional storage and significantly more computation (e.g., forward/backward passes) than the task-specific objective at every time step. In this paper, we propose Aranyani, an ensemble of oblique decision trees, to make fair decisions in online settings. The hierarchical tree structure of Aranyani enables parameter isolation and allows us to efficiently compute the fairness gradients using aggregate statistics of previous decisions, eliminating the need for additional storage and forward/backward passes. We also present an efficient framework to train Aranyani and theoretically analyze several of its properties. We conduct empirical evaluations on 5 publicly available benchmarks (including vision and language datasets) to show that Aranyani achieves a better accuracy-fairness trade-off compared to baseline approaches.

4/30/2024

Counterfactual Fairness through Transforming Data Orthogonal to Bias

Shuyi Chen, Shixiang Zhu

Machine learning models have shown exceptional prowess in solving complex issues across various domains. However, these models can sometimes exhibit biased decision-making, resulting in unequal treatment of different groups. Despite substantial research on counterfactual fairness, methods to reduce the impact of multivariate and continuous sensitive variables on decision-making outcomes are still underdeveloped. We propose a novel data pre-processing algorithm, Orthogonal to Bias (OB), which is designed to eliminate the influence of a group of continuous sensitive variables, thus promoting counterfactual fairness in machine learning applications. Our approach, based on the assumption of a jointly normal distribution within a structural causal model (SCM), demonstrates that counterfactual fairness can be achieved by ensuring the data is orthogonal to the observed sensitive variables. The OB algorithm is model-agnostic, making it applicable to a wide range of machine learning models and tasks. Additionally, it includes a sparse variant to improve numerical stability through regularization. Empirical evaluations on both simulated and real-world datasets, encompassing settings with both discrete and continuous sensitive variables, show that our methodology effectively promotes fairer outcomes without compromising accuracy.

7/2/2024

📊

A Canonical Data Transformation for Achieving Inter- and Within-group Fairness

Zachary McBride Lazri, Ivan Brugere, Xin Tian, Dana Dachman-Soled, Antigoni Polychroniadou, Danial Dervovic, Min Wu

Increases in the deployment of machine learning algorithms for applications that deal with sensitive data have brought attention to the issue of fairness in machine learning. Many works have been devoted to applications that require different demographic groups to be treated fairly. However, algorithms that aim to satisfy inter-group fairness (also called group fairness) may inadvertently treat individuals within the same demographic group unfairly. To address this issue, we introduce a formal definition of within-group fairness that maintains fairness among individuals from within the same group. We propose a pre-processing framework to meet both inter- and within-group fairness criteria with little compromise in accuracy. The framework maps the feature vectors of members from different groups to an inter-group-fair canonical domain before feeding them into a scoring function. The mapping is constructed to preserve the relative relationship between the scores obtained from the unprocessed feature vectors of individuals from the same demographic group, guaranteeing within-group fairness. We apply this framework to the COMPAS risk assessment and Law School datasets and compare its performance in achieving inter-group and within-group fairness to two regularization-based methods.

7/9/2024

Evaluating AI Group Fairness: a Fuzzy Logic Perspective

Emmanouil Krasanakis, Symeon Papadopoulos

Artificial intelligence systems often address fairness concerns by evaluating and mitigating measures of group discrimination, for example that indicate biases against certain genders or races. However, what constitutes group fairness depends on who is asked and the social context, whereas definitions are often relaxed to accept small deviations from the statistical constraints they set out to impose. Here we decouple definitions of group fairness both from the context and from relaxation-related uncertainty by expressing them in the axiomatic system of Basic fuzzy Logic (BL) with loosely understood predicates, like encountering group members. We then evaluate the definitions in subclasses of BL, such as Product or Lukasiewicz logics. Evaluation produces continuous instead of binary truth values by choosing the logic subclass and truth values for predicates that reflect uncertain context-specific beliefs, such as stakeholder opinions gathered through questionnaires. Internally, it follows logic-specific rules to compute the truth values of definitions. We show that commonly held propositions standardize the resulting mathematical formulas and we transcribe logic and truth value choices to layperson terms, so that anyone can answer them. We also use our framework to study several literature definitions of algorithmic fairness, for which we rationalize previous expedient practices that are non-probabilistic and show how to re-interpret their formulas and parameters in new contexts.

6/28/2024