Communication-efficient Vertical Federated Learning via Compressed Error Feedback

Read original: arXiv:2406.14420 - Published 6/21/2024 by Pedro Valdeira, Jo~ao Xavier, Cl'audia Soares, Yuejie Chi

Communication-efficient Vertical Federated Learning via Compressed Error Feedback

Overview

This paper proposes a communication-efficient approach to vertical federated learning, where data is distributed across different organizations.
The key idea is to use compressed error feedback, which reduces the amount of data that needs to be transmitted between the server and clients during model training.
The proposed method is shown to achieve superior performance compared to existing federated learning techniques, while significantly reducing communication overhead.

Plain English Explanation

Vertical federated learning is a way for different organizations to work together on a machine learning model without sharing their private data. In this setting, each organization has its own unique data that it wants to keep private, but they can still benefit from collaborating on the model.

The main challenge in vertical federated learning is that a lot of data needs to be sent back and forth between the organizations and the central server during the training process. This can be slow and expensive, especially if the organizations have a lot of data.

The researchers in this paper have come up with a new technique called "compressed error feedback" to address this problem. The key idea is to only send a compressed, or summarized, version of the errors or mistakes the model is making, rather than sending all the raw data. This significantly reduces the amount of data that needs to be transmitted, making the overall process much more efficient.

Their experiments show that this compressed error feedback approach can achieve better model performance than other federated learning methods, while also dramatically reducing the communication costs. This is an important advance that could make vertical federated learning much more practical and widely adopted in real-world applications.

Technical Explanation

The paper proposes a communication-efficient vertical federated learning approach using compressed error feedback. In this setting, the data is vertically partitioned across multiple organizations, each with their own private dataset.

The key innovation is to only transmit a compressed version of the model's errors or gradients, rather than the full raw data, during the iterative training process. This is achieved by leveraging side information about the data distribution to adaptively compress the error feedback.

The authors design a compression scheme that exploits the low-rank structure of the error matrices, allowing for significant reduction in communication costs without sacrificing model performance. This is in contrast to prior work that used fixed compression rates or relied on computationally expensive eigen-decompositions.

Extensive experiments on real-world datasets demonstrate that the proposed approach outperforms existing federated learning baselines in terms of both test accuracy and communication efficiency.

Critical Analysis

The paper provides a thorough theoretical analysis of the proposed compressed error feedback approach, including convergence guarantees and bounds on the communication complexity. However, the authors do not explore the impact of factors like data heterogeneity or system failures on the practical performance of their method.

Additionally, while the experiments cover a range of datasets and models, they focus primarily on the vertical federated learning setting. It would be valuable to see how the compressed error feedback technique compares to other communication-efficient federated learning approaches, such as those that leverage adaptive compression or hierarchical structures, in a more general federated learning context.

Conclusion

This paper presents a novel communication-efficient approach to vertical federated learning using compressed error feedback. By only transmitting a compressed version of the model's errors, the proposed method is able to achieve superior performance while significantly reducing the overall communication costs.

The theoretical analysis and experimental results demonstrate the effectiveness of this technique, which could have important practical implications for making vertical federated learning more scalable and widely applicable. Future work could explore extensions to other federated learning settings and further optimizations to the compression scheme.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Communication-efficient Vertical Federated Learning via Compressed Error Feedback

Pedro Valdeira, Jo~ao Xavier, Cl'audia Soares, Yuejie Chi

Communication overhead is a known bottleneck in federated learning (FL). To address this, lossy compression is commonly used on the information communicated between the server and clients during training. In horizontal FL, where each client holds a subset of the samples, such communication-compressed training methods have recently seen significant progress. However, in their vertical FL counterparts, where each client holds a subset of the features, our understanding remains limited. To address this, we propose an error feedback compressed vertical federated learning (EFVFL) method to train split neural networks. In contrast with previous communication-compressed methods for vertical FL, EFVFL does not require a vanishing compression error for the gradient norm to converge to zero for smooth nonconvex problems. By leveraging error feedback, our method can achieve a $mathcal{O}(1/T)$ convergence rate in the full-batch case, improving over the state-of-the-art $mathcal{O}(1/sqrt{T})$ rate under $mathcal{O}(1/sqrt{T})$ compression error, and matching the rate of uncompressed methods. Further, when the objective function satisfies the Polyak-{L}ojasiewicz inequality, our method converges linearly. In addition to improving convergence rates, our method also supports the use of private labels. Numerical experiments show that EFVFL significantly improves over the prior art, confirming our theoretical results.

6/21/2024

Efficient Model Compression for Hierarchical Federated Learning

Xi Zhu, Songcan Yu, Junbo Wang, Qinglin Yang

Federated learning (FL), as an emerging collaborative learning paradigm, has garnered significant attention due to its capacity to preserve privacy within distributed learning systems. In these systems, clients collaboratively train a unified neural network model using their local datasets and share model parameters rather than raw data, enhancing privacy. Predominantly, FL systems are designed for mobile and edge computing environments where training typically occurs over wireless networks. Consequently, as model sizes increase, the conventional FL frameworks increasingly consume substantial communication resources. To address this challenge and improve communication efficiency, this paper introduces a novel hierarchical FL framework that integrates the benefits of clustered FL and model compression. We present an adaptive clustering algorithm that identifies a core client and dynamically organizes clients into clusters. Furthermore, to enhance transmission efficiency, each core client implements a local aggregation with compression (LC aggregation) algorithm after collecting compressed models from other clients within the same cluster. Simulation results affirm that our proposed algorithms not only maintain comparable predictive accuracy but also significantly reduce energy consumption relative to existing FL mechanisms.

5/29/2024

⚙️

Improving Privacy-Preserving Vertical Federated Learning by Efficient Communication with ADMM

Chulin Xie, Pin-Yu Chen, Qinbin Li, Arash Nourian, Ce Zhang, Bo Li

Federated learning (FL) enables distributed resource-constrained devices to jointly train shared models while keeping the training data local for privacy purposes. Vertical FL (VFL), which allows each client to collect partial features, has attracted intensive research efforts recently. We identified the main challenges that existing VFL frameworks are facing: the server needs to communicate gradients with the clients for each training step, incurring high communication cost that leads to rapid consumption of privacy budgets. To address these challenges, in this paper, we introduce a VFL framework with multiple heads (VIM), which takes the separate contribution of each client into account, and enables an efficient decomposition of the VFL optimization objective to sub-objectives that can be iteratively tackled by the server and the clients on their own. In particular, we propose an Alternating Direction Method of Multipliers (ADMM)-based method to solve our optimization problem, which allows clients to conduct multiple local updates before communication, and thus reduces the communication cost and leads to better performance under differential privacy (DP). We provide the user-level DP mechanism for our framework to protect user privacy. Moreover, we show that a byproduct of VIM is that the weights of learned heads reflect the importance of local clients. We conduct extensive evaluations and show that on four vertical FL datasets, VIM achieves significantly higher performance and faster convergence compared with the state-of-the-art. We also explicitly evaluate the importance of local clients and show that VIM enables functionalities such as client-level explanation and client denoising. We hope this work will shed light on a new way of effective VFL training and understanding.

4/9/2024

📊

Communication-Efficient Hybrid Federated Learning for E-health with Horizontal and Vertical Data Partitioning

Chong Yu, Shuaiqi Shen, Shiqiang Wang, Kuan Zhang, Hai Zhao

E-health allows smart devices and medical institutions to collaboratively collect patients' data, which is trained by Artificial Intelligence (AI) technologies to help doctors make diagnosis. By allowing multiple devices to train models collaboratively, federated learning is a promising solution to address the communication and privacy issues in e-health. However, applying federated learning in e-health faces many challenges. First, medical data is both horizontally and vertically partitioned. Since single Horizontal Federated Learning (HFL) or Vertical Federated Learning (VFL) techniques cannot deal with both types of data partitioning, directly applying them may consume excessive communication cost due to transmitting a part of raw data when requiring high modeling accuracy. Second, a naive combination of HFL and VFL has limitations including low training efficiency, unsound convergence analysis, and lack of parameter tuning strategies. In this paper, we provide a thorough study on an effective integration of HFL and VFL, to achieve communication efficiency and overcome the above limitations when data is both horizontally and vertically partitioned. Specifically, we propose a hybrid federated learning framework with one intermediate result exchange and two aggregation phases. Based on this framework, we develop a Hybrid Stochastic Gradient Descent (HSGD) algorithm to train models. Then, we theoretically analyze the convergence upper bound of the proposed algorithm. Using the convergence results, we design adaptive strategies to adjust the training parameters and shrink the size of transmitted data. Experimental results validate that the proposed HSGD algorithm can achieve the desired accuracy while reducing communication cost, and they also verify the effectiveness of the adaptive strategies.

4/17/2024