Differentially-Private Hierarchical Federated Learning

2401.11592

Published 5/17/2024 by Frank Po-Chen Lin, Christopher Brinton

Differentially-Private Hierarchical Federated Learning

Abstract

While federated learning (FL) eliminates the transmission of raw data over a network, it is still vulnerable to privacy breaches from the communicated model parameters. In this work, we propose underline{H}ierarchical underline{F}ederated Learning with underline{H}ierarchical underline{D}ifferential underline{P}rivacy ({tt H$^2$FDP}), a DP-enhanced FL methodology for jointly optimizing privacy and performance in hierarchical networks. Building upon recent proposals for Hierarchical Differential Privacy (HDP), one of the key concepts of {tt H$^2$FDP} is adapting DP noise injection at different layers of an established FL hierarchy -- edge devices, edge servers, and cloud servers -- according to the trust models within particular subnetworks. We conduct a comprehensive analysis of the convergence behavior of {tt H$^2$FDP}, revealing conditions on parameter tuning under which the training process converges sublinearly to a finite stationarity gap that depends on the network hierarchy, trust model, and target privacy level. Leveraging these relationships, we develop an adaptive control algorithm for {tt H$^2$FDP} that tunes properties of local model training to minimize communication energy, latency, and the stationarity gap while striving to maintain a sub-linear convergence rate and meet desired privacy criteria. Subsequent numerical evaluations demonstrate that {tt H$^2$FDP} obtains substantial improvements in these metrics over baselines for different privacy budgets, and validate the impact of different system configurations.

Create account to get full access

Overview

This paper proposes a formal analysis and evaluation of differential privacy in hierarchical federated learning.
It examines how to effectively apply differential privacy to protect the privacy of participants in a hierarchical federated learning system.
The authors provide a theoretical framework and empirical evaluation to understand the privacy-utility tradeoffs and optimize the differential privacy parameters.

Plain English Explanation

The paper looks at how to protect the privacy of people's data in a type of machine learning called "federated learning." In federated learning, many different devices or organizations collaborate to train a shared machine learning model, without needing to share all of their private data.

The authors focus on a more complex version of federated learning, called "hierarchical federated learning." In this setup, the devices or organizations are organized into a hierarchical structure, with multiple levels of coordination. The researchers investigate how to add "differential privacy" to this hierarchical system in order to further protect people's privacy.

Differential privacy is a mathematical technique that adds controlled noise or randomness to the data, making it hard for anyone to identify individual people's information. The key is to find the right balance - adding enough noise to protect privacy, but not so much that it ruins the usefulness of the machine learning model.

The paper provides a theoretical framework to analyze this privacy-utility tradeoff, as well as empirical experiments to see how different settings of the differential privacy parameters affect the model's performance. This helps the researchers understand how to best apply differential privacy in hierarchical federated learning systems.

Technical Explanation

The paper proposes a formal analysis and evaluation of differential privacy in the context of hierarchical federated learning.

The authors first provide a theoretical framework to analyze the privacy-utility tradeoffs when applying differential privacy to hierarchical federated learning systems. This involves modeling the aggregation of noisy updates from devices at multiple levels of the hierarchy, and deriving expressions for the privacy loss and utility degradation.

The paper then presents an empirical evaluation using both synthetic and real-world datasets. The experiments compare the performance of hierarchical federated learning with and without differential privacy, exploring how different settings of the differential privacy parameters (e.g. noise scale) impact model accuracy. The results shed light on how to effectively balance privacy and utility in these hierarchical systems.

The analysis and experiments demonstrate that differential privacy can be successfully incorporated into hierarchical federated learning, providing provable privacy guarantees while maintaining reasonable model performance. The paper's theoretical framework and empirical insights can guide the design of practical federated learning systems that prioritize both privacy and utility.

Critical Analysis

The paper provides a rigorous theoretical and empirical treatment of the privacy-utility tradeoffs in hierarchical federated learning with differential privacy. The authors have carefully modeled the multi-level aggregation process and derived informative expressions quantifying the privacy loss and utility degradation.

That said, the analysis makes some simplifying assumptions, such as independent and identically distributed data across devices, which may not always hold in real-world federated learning scenarios. Additionally, the empirical evaluation is limited to a few specific datasets and models - more comprehensive testing on diverse applications would further strengthen the conclusions.

The paper also does not address some practical challenges of implementing differential privacy in federated learning, such as the computational overhead, communication costs, and potential negative impact on model convergence. Exploring these engineering considerations could yield important insights for deploying such privacy-preserving techniques in production systems.

Overall, the work represents an important step forward in understanding how to effectively balance privacy and utility in hierarchical federated learning. The theoretical framework and experimental results provided in the paper can serve as a valuable foundation for future research in this area.

Conclusion

This paper presents a formal analysis and evaluation of applying differential privacy to hierarchical federated learning systems. The authors develop a theoretical model to understand the privacy-utility tradeoffs, and conduct empirical experiments to validate their approach.

The results show that differential privacy can be successfully incorporated into hierarchical federated learning, providing provable privacy guarantees while maintaining reasonable model performance. This work contributes important insights that can guide the design of practical federated learning systems that prioritize both privacy and utility.

The paper's theoretical framework and empirical findings represent a significant advancement in our understanding of how to effectively balance these competing objectives in the context of federated learning with hierarchical structures. This research paves the way for the development of more secure and privacy-preserving machine learning systems that can be deployed at scale.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Enhancing Federated Learning with Adaptive Differential Privacy and Priority-Based Aggregation

Mahtab Talaei, Iman Izadi

Federated learning (FL), a novel branch of distributed machine learning (ML), develops global models through a private procedure without direct access to local datasets. However, it is still possible to access the model updates (gradient updates of deep neural networks) transferred between clients and servers, potentially revealing sensitive local information to adversaries using model inversion attacks. Differential privacy (DP) offers a promising approach to addressing this issue by adding noise to the parameters. On the other hand, heterogeneities in data structure, storage, communication, and computational capabilities of devices can cause convergence problems and delays in developing the global model. A personalized weighted averaging of local parameters based on the resources of each device can yield a better aggregated model in each round. In this paper, to efficiently preserve privacy, we propose a personalized DP framework that injects noise based on clients' relative impact factors and aggregates parameters while considering heterogeneities and adjusting properties. To fulfill the DP requirements, we first analyze the convergence boundary of the FL algorithm when impact factors are personalized and fixed throughout the learning process. We then further study the convergence property considering time-varying (adaptive) impact factors.

6/27/2024

cs.LG cs.CR cs.DC

Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning

Saber Malekmohammadi, Yaoliang Yu, Yang Cao

High utility and rigorous data privacy are of the main goals of a federated learning (FL) system, which learns a model from the data distributed among some clients. The latter has been tried to achieve by using differential privacy in FL (DPFL). There is often heterogeneity in clients privacy requirements, and existing DPFL works either assume uniform privacy requirements for clients or are not applicable when server is not fully trusted (our setting). Furthermore, there is often heterogeneity in batch and/or dataset size of clients, which as shown, results in extra variation in the DP noise level across clients model updates. With these sources of heterogeneity, straightforward aggregation strategies, e.g., assigning clients aggregation weights proportional to their privacy parameters will lead to lower utility. We propose Robust-HDP, which efficiently estimates the true noise level in clients model updates and reduces the noise-level in the aggregated model updates considerably. Robust-HDP improves utility and convergence speed, while being safe to the clients that may maliciously send falsified privacy parameter to server. Extensive experimental results on multiple datasets and our theoretical analysis confirm the effectiveness of Robust-HDP. Our code can be found here.

6/7/2024

cs.LG cs.CR cs.DC

Mitigating Disparate Impact of Differential Privacy in Federated Learning through Robust Clustering

Saber Malekmohammadi, Afaf Taik, Golnoosh Farnadi

Federated Learning (FL) is a decentralized machine learning (ML) approach that keeps data localized and often incorporates Differential Privacy (DP) to enhance privacy guarantees. Similar to previous work on DP in ML, we observed that differentially private federated learning (DPFL) introduces performance disparities, particularly affecting minority groups. Recent work has attempted to address performance fairness in vanilla FL through clustering, but this method remains sensitive and prone to errors, which are further exacerbated by the DP noise in DPFL. To fill this gap, in this paper, we propose a novel clustered DPFL algorithm designed to effectively identify clients' clusters in highly heterogeneous settings while maintaining high accuracy with DP guarantees. To this end, we propose to cluster clients based on both their model updates and training loss values. Our proposed approach also addresses the server's uncertainties in clustering clients' model updates by employing larger batch sizes along with Gaussian Mixture Model (GMM) to alleviate the impact of noise and potential clustering errors, especially in privacy-sensitive scenarios. We provide theoretical analysis of the effectiveness of our proposed approach. We also extensively evaluate our approach across diverse data distributions and privacy budgets and show its effectiveness in mitigating the disparate impact of DP in FL settings with a small computational cost.

5/30/2024

cs.LG cs.CR cs.DC

QMGeo: Differentially Private Federated Learning via Stochastic Quantization with Mixed Truncated Geometric Distribution

Zixi Wang, M. Cenk Gursoy

Federated learning (FL) is a framework which allows multiple users to jointly train a global machine learning (ML) model by transmitting only model updates under the coordination of a parameter server, while being able to keep their datasets local. One key motivation of such distributed frameworks is to provide privacy guarantees to the users. However, preserving the users' datasets locally is shown to be not sufficient for privacy. Several differential privacy (DP) mechanisms have been proposed to provide provable privacy guarantees by introducing randomness into the framework, and majority of these mechanisms rely on injecting additive noise. FL frameworks also face the challenge of communication efficiency, especially as machine learning models grow in complexity and size. Quantization is a commonly utilized method, reducing the communication cost by transmitting compressed representation of the underlying information. Although there have been several studies on DP and quantization in FL, the potential contribution of the quantization method alone in providing privacy guarantees has not been extensively analyzed yet. We in this paper present a novel stochastic quantization method, utilizing a mixed geometric distribution to introduce the randomness needed to provide DP, without any additive noise. We provide convergence analysis for our framework and empirically study its performance.

6/12/2024

cs.LG cs.AI