A Multivocal Literature Review on Privacy and Fairness in Federated Learning

Read original: arXiv:2408.08666 - Published 8/19/2024 by Beatrice Balbierer, Lukas Heinlein, Domenique Zipperling, Niklas Kuhl

🌐

Overview

Federated learning is a way to develop AI models without requiring data sharing, which can help protect privacy.
However, research has shown that sensitive information can still be extracted during the training process, so additional privacy-preserving measures like differential privacy are necessary.
Fairness is also critical for real-world federated learning applications, ensuring fair distribution of performance and non-discriminatory behavior, especially in high-risk domains like healthcare.
There appears to be a tension between privacy and fairness, and this relationship needs further examination.

Plain English Explanation

Federated learning offers a promising approach to building AI models without the need to share data, which can help protect people's privacy. However, even with federated learning, researchers have found that sensitive information can still be extracted during the training process. This means additional privacy-preserving techniques, such as differential privacy, are essential.

Beyond privacy, fairness is crucial for real-world federated learning applications. Fairness ensures that the performance of the AI model is distributed fairly and that it doesn't discriminate, which is particularly important in high-risk areas like healthcare, where we need to avoid repeating past discriminatory errors.

Interestingly, recent research has suggested that there may be a tension between privacy and fairness in federated learning. This relationship needs to be better understood to develop integrated frameworks that can effectively balance these competing considerations.

Technical Explanation

The paper conducts a multivocal literature review to examine current methods for integrating privacy and fairness in federated learning. The analysis reveals that the relationship between privacy and fairness has been largely overlooked, posing a critical risk for real-world federated learning applications.

The authors highlight the need to explore the interconnections between privacy, fairness, and model performance. They advocate for the creation of integrated federated learning frameworks that can effectively address these concerns simultaneously.

Critical Analysis

The paper rightly identifies the importance of addressing both privacy and fairness in federated learning, as these factors are crucial for the successful deployment of these systems in real-world, high-stakes applications. The literature review provides a valuable overview of the current state of research in this area.

However, the paper does not delve deeply into the specific technical details or trade-offs involved in balancing privacy and fairness. It also does not provide concrete suggestions or guidelines for how to develop the integrated federated learning frameworks that the authors call for.

Further research is needed to better understand the underlying mechanisms and design principles that can enable the simultaneous optimization of privacy, fairness, and performance in federated learning systems. Empirical studies evaluating the practical implementation of such frameworks would also be valuable.

Conclusion

This paper highlights the critical need to consider both privacy and fairness in the development of federated learning applications. It argues that the relationship between these two factors has been overlooked, posing significant risks for real-world deployment.

The authors advocate for the creation of integrated federated learning frameworks that can effectively balance privacy, fairness, and performance considerations. Addressing this challenge will be essential for unlocking the full potential of federated learning and ensuring its responsible and equitable application, especially in high-stakes domains like healthcare.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🌐

A Multivocal Literature Review on Privacy and Fairness in Federated Learning

Beatrice Balbierer, Lukas Heinlein, Domenique Zipperling, Niklas Kuhl

Federated Learning presents a way to revolutionize AI applications by eliminating the necessity for data sharing. Yet, research has shown that information can still be extracted during training, making additional privacy-preserving measures such as differential privacy imperative. To implement real-world federated learning applications, fairness, ranging from a fair distribution of performance to non-discriminative behaviour, must be considered. Particularly in high-risk applications (e.g. healthcare), avoiding the repetition of past discriminatory errors is paramount. As recent research has demonstrated an inherent tension between privacy and fairness, we conduct a multivocal literature review to examine the current methods to integrate privacy and fairness in federated learning. Our analyses illustrate that the relationship between privacy and fairness has been neglected, posing a critical risk for real-world applications. We highlight the need to explore the relationship between privacy, fairness, and performance, advocating for the creation of integrated federated learning frameworks.

8/19/2024

Linkage on Security, Privacy and Fairness in Federated Learning: New Balances and New Perspectives

Linlin Wang, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

Federated learning is fast becoming a popular paradigm for applications involving mobile devices, banking systems, healthcare, and IoT systems. Hence, over the past five years, researchers have undertaken extensive studies on the privacy leaks, security threats, and fairness associated with these emerging models. For the most part, these three critical concepts have been studied in isolation; however, recent research has revealed that there may be an intricate interplay between them. For instance, some researchers have discovered that pursuing fairness may compromise privacy, or that efforts to enhance security can impact fairness. These emerging insights shed light on the fundamental connections between privacy, security, and fairness within federated learning, and, by delving deeper into these interconnections, we may be able to significantly augment research and development across the field. Consequently, the aim of this survey is to offer comprehensive descriptions of the privacy, security, and fairness issues in federated learning. Moreover, we analyze the complex relationships between these three dimensions of cyber safety and pinpoint the fundamental elements that influence each of them. We contend that there exists a trade-off between privacy and fairness and between security and gradient sharing. On this basis, fairness can function as a bridge between privacy and security to build models that are either more secure or more private. Building upon our observations, we identify the trade-offs between privacy and fairness and between security and fairness within the context of federated learning. The survey then concludes with promising directions for future research in this vanguard field.

6/18/2024

Federated Fairness Analytics: Quantifying Fairness in Federated Learning

Oscar Dilley, Juan Marcelo Parra-Ullauri, Rasheed Hussain, Dimitra Simeonidou

Federated Learning (FL) is a privacy-enhancing technology for distributed ML. By training models locally and aggregating updates - a federation learns together, while bypassing centralised data collection. FL is increasingly popular in healthcare, finance and personal computing. However, it inherits fairness challenges from classical ML and introduces new ones, resulting from differences in data quality, client participation, communication constraints, aggregation methods and underlying hardware. Fairness remains an unresolved issue in FL and the community has identified an absence of succinct definitions and metrics to quantify fairness; to address this, we propose Federated Fairness Analytics - a methodology for measuring fairness. Our definition of fairness comprises four notions with novel, corresponding metrics. They are symptomatically defined and leverage techniques originating from XAI, cooperative game-theory and networking engineering. We tested a range of experimental settings, varying the FL approach, ML task and data settings. The results show that statistical heterogeneity and client participation affect fairness and fairness conscious approaches such as Ditto and q-FedAvg marginally improve fairness-performance trade-offs. Using our techniques, FL practitioners can uncover previously unobtainable insights into their system's fairness, at differing levels of granularity in order to address fairness challenges in FL. We have open-sourced our work at: https://github.com/oscardilley/federated-fairness.

8/16/2024

PUFFLE: Balancing Privacy, Utility, and Fairness in Federated Learning

Luca Corbucci, Mikko A Heikkila, David Solans Noguero, Anna Monreale, Nicolas Kourtellis

Training and deploying Machine Learning models that simultaneously adhere to principles of fairness and privacy while ensuring good utility poses a significant challenge. The interplay between these three factors of trustworthiness is frequently underestimated and remains insufficiently explored. Consequently, many efforts focus on ensuring only two of these factors, neglecting one in the process. The decentralization of the datasets and the variations in distributions among the clients exacerbate the complexity of achieving this ethical trade-off in the context of Federated Learning (FL). For the first time in FL literature, we address these three factors of trustworthiness. We introduce PUFFLE, a high-level parameterised approach that can help in the exploration of the balance between utility, privacy, and fairness in FL scenarios. We prove that PUFFLE can be effective across diverse datasets, models, and data distributions, reducing the model unfairness up to 75%, with a maximum reduction in the utility of 17% in the worst-case scenario, while maintaining strict privacy guarantees during the FL training.

7/23/2024