Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity

2405.16585

Published 5/28/2024 by Yuhang Chen, Wenke Huang, Mang Ye

Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity

Abstract

Federated learning (FL) has emerged as a new paradigm for privacy-preserving collaborative training. Under domain skew, the current FL approaches are biased and face two fairness problems. 1) Parameter Update Conflict: data disparity among clients leads to varying parameter importance and inconsistent update directions. These two disparities cause important parameters to potentially be overwhelmed by unimportant ones of dominant updates. It consequently results in significant performance decreases for lower-performing clients. 2) Model Aggregation Bias: existing FL approaches introduce unfair weight allocation and neglect domain diversity. It leads to biased model convergence objective and distinct performance among domains. We discover a pronounced directional update consistency in Federated Learning and propose a novel framework to tackle above issues. First, leveraging the discovered characteristic, we selectively discard unimportant parameter updates to prevent updates from clients with lower performance overwhelmed by unimportant parameters, resulting in fairer generalization performance. Second, we propose a fair aggregation objective to prevent global model bias towards some domains, ensuring that the global model continuously aligns with an unbiased model. The proposed method is generic and can be combined with other existing FL methods to enhance fairness. Comprehensive experiments on Digits and Office-Caltech demonstrate the high fairness and performance of our method.

Create account to get full access

Overview

• This research paper proposes a new approach to fair federated learning, which is a machine learning technique that allows multiple organizations to collaboratively train a shared model without sharing their private data. • The key ideas are to enforce local consistency, where the model makes similar predictions for related examples within each organization, and to encourage domain diversity, where the model learns from the distinct data distributions of each organization. • This helps address the challenge of "domain skew," where the data distributions differ across organizations, which can lead to unfairness in the final model.

Plain English Explanation

The paper looks at a problem in federated learning, which is a way for different organizations to work together to train a machine learning model without sharing their private data. The challenge is that the data at each organization can be quite different, a problem known as "domain skew."

The researchers propose two main ideas to address this:

Local Consistency: They want the model to make similar predictions for related examples within each organization. This helps the model learn the unique patterns in each organization's data.
Domain Diversity: They also want the model to learn from the distinct data distributions of each organization. This encourages the model to capture the diversity of the overall dataset, rather than just optimizing for the average.

By enforcing local consistency and domain diversity, the researchers aim to train a more fair and robust federated learning model, even when the data varies significantly across organizations.

Technical Explanation

The paper introduces a new federated learning approach called Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity (FFLD).

The key technical components are:

Local Consistency: The model is encouraged to make similar predictions for related examples within each organization's local dataset. This is achieved by adding a consistency loss term that penalizes large differences in predictions for similar examples.
Domain Diversity: The model is also trained to capture the distinct data distributions across organizations. This is done by adding a diversity loss term that encourages the model to learn different representations for examples from different organizations.
Federated Optimization: The model is trained using a federated optimization algorithm, where each organization updates the model locally and the central server aggregates the updates.

The researchers evaluate FFLD on several benchmark federated learning datasets and show that it outperforms existing federated learning methods in terms of both model performance and fairness across organizations.

Critical Analysis

The paper addresses an important challenge in federated learning – how to train a fair and robust model when the data distributions vary significantly across organizations. The proposed FFLD approach is a promising solution, with the local consistency and domain diversity objectives helping to overcome the issue of domain skew.

However, the paper does not fully address the potential for "sybil attacks," where a single organization could create multiple virtual entities to skew the federated optimization process. Additionally, the authors do not explore the impact of organization-level incentives and potential strategic behavior, which could be important considerations in real-world federated learning deployments.

Further research could also investigate the scalability of FFLD to large-scale federated learning settings with hundreds or thousands of organizations, as well as its robustness to noisy or adversarial updates from individual organizations.

Conclusion

This paper presents a novel approach to fair federated learning that addresses the challenge of domain skew by enforcing local consistency and domain diversity. The proposed FFLD method outperforms existing federated learning techniques, suggesting it could be a valuable tool for organizations looking to collaboratively train machine learning models while preserving data privacy and fairness.

However, the research also highlights the need for continued work on making federated learning systems more secure, scalable, and robust to strategic behavior from participants. As federated learning becomes more widely adopted, addressing these challenges will be crucial to ensuring the technology delivers on its promise of collaborative, privacy-preserving AI.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Fair Concurrent Training of Multiple Models in Federated Learning

Marie Siew, Haoran Zhang, Jong-Ik Park, Yuezhou Liu, Yichen Ruan, Lili Su, Stratis Ioannidis, Edmund Yeh, Carlee Joe-Wong

Federated learning (FL) enables collaborative learning across multiple clients. In most FL work, all clients train a single learning task. However, the recent proliferation of FL applications may increasingly require multiple FL tasks to be trained simultaneously, sharing clients' computing and communication resources, which we call Multiple-Model Federated Learning (MMFL). Current MMFL algorithms use naive average-based client-task allocation schemes that can lead to unfair performance when FL tasks have heterogeneous difficulty levels, e.g., tasks with larger models may need more rounds and data to train. Just as naively allocating resources to generic computing jobs with heterogeneous resource needs can lead to unfair outcomes, naive allocation of clients to FL tasks can lead to unfairness, with some tasks having excessively long training times, or lower converged accuracies. Furthermore, in the FL setting, since clients are typically not paid for their training effort, we face a further challenge that some clients may not even be willing to train some tasks, e.g., due to high computational costs, which may exacerbate unfairness in training outcomes across tasks. We address both challenges by firstly designing FedFairMMFL, a difficulty-aware algorithm that dynamically allocates clients to tasks in each training round. We provide guarantees on airness and FedFairMMFL's convergence rate. We then propose a novel auction design that incentivizes clients to train multiple tasks, so as to fairly distribute clients' training efforts across the tasks. We show how our fairness-based learning and incentive mechanisms impact training convergence and finally evaluate our algorithm with multiple sets of learning tasks on real world datasets.

4/23/2024

cs.LG cs.AI

🌐

Fairness Without Demographics in Human-Centered Federated Learning

Shaily Roy, Harshit Sharma, Asif Salekin

Federated learning (FL) enables collaborative model training while preserving data privacy, making it suitable for decentralized human-centered AI applications. However, a significant research gap remains in ensuring fairness in these systems. Current fairness strategies in FL require knowledge of bias-creating/sensitive attributes, clashing with FL's privacy principles. Moreover, in human-centered datasets, sensitive attributes may remain latent. To tackle these challenges, we present a novel bias mitigation approach inspired by Fairness without Demographics in machine learning. The presented approach achieves fairness without needing knowledge of sensitive attributes by minimizing the top eigenvalue of the Hessian matrix during training, ensuring equitable loss landscapes across FL participants. Notably, we introduce a novel FL aggregation scheme that promotes participating models based on error rates and loss landscape curvature attributes, fostering fairness across the FL system. This work represents the first approach to attaining Fairness without Demographics in human-centered FL. Through comprehensive evaluation, our approach demonstrates effectiveness in balancing fairness and efficacy across various real-world applications, FL setups, and scenarios involving single and multiple bias-inducing factors, representing a significant advancement in human-centered FL.

5/17/2024

cs.LG cs.AI cs.DC

🤿

Cross-Silo Federated Learning Across Divergent Domains with Iterative Parameter Alignment

Matt Gorbett, Hossein Shirazi, Indrakshi Ray

Learning from the collective knowledge of data dispersed across private sources can provide neural networks with enhanced generalization capabilities. Federated learning, a method for collaboratively training a machine learning model across remote clients, achieves this by combining client models via the orchestration of a central server. However, current approaches face two critical limitations: i) they struggle to converge when client domains are sufficiently different, and ii) current aggregation techniques produce an identical global model for each client. In this work, we address these issues by reformulating the typical federated learning setup: rather than learning a single global model, we learn N models each optimized for a common objective. To achieve this, we apply a weighted distance minimization to model parameters shared in a peer-to-peer topology. The resulting framework, Iterative Parameter Alignment, applies naturally to the cross-silo setting, and has the following properties: (i) a unique solution for each participant, with the option to globally converge each model in the federation, and (ii) an optional early-stopping mechanism to elicit fairness among peers in collaborative learning settings. These characteristics jointly provide a flexible new framework for iteratively learning from peer models trained on disparate datasets. We find that the technique achieves competitive results on a variety of data partitions compared to state-of-the-art approaches. Further, we show that the method is robust to divergent domains (i.e. disjoint classes across peers) where existing approaches struggle.

5/20/2024

cs.LG cs.CV cs.DC

Fairness-aware Federated Minimax Optimization with Convergence Guarantee

Gerry Windiarto Mohamad Dunda, Shenghui Song

Federated learning (FL) has garnered considerable attention due to its privacy-preserving feature. Nonetheless, the lack of freedom in managing user data can lead to group fairness issues, where models are biased towards sensitive factors such as race or gender. To tackle this issue, this paper proposes a novel algorithm, fair federated averaging with augmented Lagrangian method (FFALM), designed explicitly to address group fairness issues in FL. Specifically, we impose a fairness constraint on the training objective and solve the minimax reformulation of the constrained optimization problem. Then, we derive the theoretical upper bound for the convergence rate of FFALM. The effectiveness of FFALM in improving fairness is shown empirically on CelebA and UTKFace datasets in the presence of severe statistical heterogeneity.

5/30/2024

cs.LG cs.CY