Split Learning without Local Weight Sharing to Enhance Client-side Data Privacy

Read original: arXiv:2212.00250 - Published 7/23/2024 by Ngoc Duy Pham, Tran Khoa Phan, Alsharif Abuadbba, Yansong Gao, Doan Nguyen, Naveen Chilamkurti

📊

Overview

Split learning (SL) aims to protect user data privacy by distributing deep models between client-server and keeping private data locally.
In SL training with multiple clients, the local model weights are shared among the clients for local model update.
This paper reveals data privacy leakage from local weight sharing among clients in SL through model inversion attacks.
To reduce the data privacy leakage, the paper proposes and analyzes privacy-enhanced SL (P-SL) without local weight sharing.
The paper also proposes parallelized P-SL to expedite the training process by duplicating multiple server-side model instances without compromising accuracy.
The paper explores P-SL with late participating clients and devises a server-side cache-based training method to address the forgetting phenomenon in SL when late clients join.

Plain English Explanation

Split learning (SL) is a technique that aims to protect people's private data by dividing deep learning models between the client (e.g., your phone) and the server (e.g., a company's computer). This keeps the sensitive data on the client side instead of sending it to the server.

In SL training with multiple clients, the clients share their local model weights with each other to update their local models. However, this paper shows that this weight sharing can actually leak information about the clients' private data through a type of attack called "model inversion."

To address this privacy issue, the paper proposes "privacy-enhanced SL" (P-SL), which avoids sharing local weights among clients. The paper also introduces "parallelized P-SL," which speeds up the training process by using multiple copies of the server-side model without sacrificing accuracy.

Additionally, the paper explores how P-SL works when new clients join the training process late. It devises a method where the server caches past training data to help the new clients catch up without forgetting what was previously learned.

Overall, this research aims to make split learning more privacy-preserving while maintaining the performance benefits. The key ideas are to avoid sharing private data and weights, use parallelism to speed up training, and handle late-joining clients through caching.

Technical Explanation

This paper first reveals that the local weight sharing among clients in traditional split learning (SL) can lead to data privacy leakage through model inversion attacks. To address this issue, the authors propose a privacy-enhanced version of SL, called P-SL, which avoids local weight sharing.

In P-SL, the clients send their local gradients directly to the server, rather than sharing their local model weights. The server then updates a global model and sends the updated global model back to the clients. This decoupling of the local and global models prevents the leakage of private information through the shared weights.

To expedite the training process in P-SL, the authors further propose a parallelized version, where multiple instances of the server-side model are maintained and updated concurrently. This parallelization allows for faster convergence without compromising the model's accuracy.

The paper also explores the scenario of late-joining clients in P-SL. When new clients join the training process after the initial rounds, they may struggle to catch up due to the "forgetting" phenomenon, where the model forgets what it has learned from previous clients. To address this, the authors devise a server-side cache-based training method, where the server stores past training data and uses it to help the late-joining clients learn without negatively impacting the overall model performance.

The experimental results demonstrate that P-SL can reduce up to 50% of client-side data leakage compared to traditional SL, achieving a better privacy-accuracy trade-off. Additionally, P-SL and its cache-based version achieve comparable accuracy to the baseline SL under various data distributions, while requiring less computation and communication.

Critical Analysis

The paper presents a compelling solution to the privacy leakage issue in traditional split learning (SL) by introducing privacy-enhanced SL (P-SL). The key innovation of avoiding local weight sharing among clients is a significant step towards improving data privacy in distributed deep learning.

One potential limitation of the proposed approach is the increased computational and communication overhead compared to traditional SL, due to the need to send gradients instead of weights and maintain multiple server-side model instances. The authors have addressed this to some extent by introducing parallelized P-SL, but the overall efficiency and scalability of the approach could be further investigated.

Additionally, the paper focuses on mitigating the forgetting phenomenon when late-joining clients participate in the training process. While the proposed cache-based method seems effective, it may introduce additional complexity and storage requirements, especially in scenarios with a large number of late-joining clients. Exploring alternative approaches to handle the forgetting issue could be an area for further research.

Furthermore, the paper does not explicitly address the potential impact of the proposed techniques on the model's robustness or generalization performance. Investigating these aspects would provide a more comprehensive understanding of the trade-offs and practical implications of adopting P-SL in real-world applications.

Overall, the research presented in this paper represents a significant contribution to enhancing data privacy in split learning, and the proposed P-SL approach offers a promising direction for future developments in this field.

Conclusion

This paper tackles the data privacy leakage issue in traditional split learning (SL) by introducing privacy-enhanced SL (P-SL), which avoids local weight sharing among clients. The authors demonstrate that P-SL can reduce up to 50% of client-side data leakage, achieving a better privacy-accuracy trade-off compared to current approaches using differential privacy.

Additionally, the paper proposes parallelized P-SL to expedite the training process and a server-side cache-based training method to address the forgetting phenomenon when late-joining clients participate. These innovations maintain comparable accuracy to the baseline SL while reducing computational and communication costs.

The research presented in this paper represents a significant advancement in protecting user data privacy in distributed deep learning. By decoupling the local and global models and introducing parallelism and caching mechanisms, the authors have developed a more robust and practical split learning framework that can be valuable for a wide range of real-world applications where data privacy is a critical concern.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📊

Split Learning without Local Weight Sharing to Enhance Client-side Data Privacy

Ngoc Duy Pham, Tran Khoa Phan, Alsharif Abuadbba, Yansong Gao, Doan Nguyen, Naveen Chilamkurti

Split learning (SL) aims to protect user data privacy by distributing deep models between client-server and keeping private data locally. In SL training with multiple clients, the local model weights are shared among the clients for local model update. This paper first reveals data privacy leakage exacerbated from local weight sharing among the clients in SL through model inversion attacks. Then, to reduce the data privacy leakage issue, we propose and analyze privacy-enhanced SL (P-SL) (or SL without local weight sharing). We further propose parallelized P-SL to expedite the training process by duplicating multiple server-side model instances without compromising accuracy. Finally, we explore P-SL with late participating clients and devise a server-side cache-based training method to address the forgetting phenomenon in SL when late clients join. Experimental results demonstrate that P-SL helps reduce up to 50% of client-side data leakage, which essentially achieves a better privacy-accuracy trade-off than the current trend by using differential privacy mechanisms. Moreover, P-SL and its cache-based version achieve comparable accuracy to baseline SL under various data distributions, while cost less computation and communication. Additionally, caching-based training in P-SL mitigates the negative effect of forgetting, stabilizes the learning, and enables practical and low-complexity training in a dynamic environment with late-arriving clients.

7/23/2024

👀

Make Split, not Hijack: Preventing Feature-Space Hijacking Attacks in Split Learning

Tanveer Khan, Mindaugas Budzys, Antonis Michalas

The popularity of Machine Learning (ML) makes the privacy of sensitive data more imperative than ever. Collaborative learning techniques like Split Learning (SL) aim to protect client data while enhancing ML processes. Though promising, SL has been proved to be vulnerable to a plethora of attacks, thus raising concerns about its effectiveness on data privacy. In this work, we introduce a hybrid approach combining SL and Function Secret Sharing (FSS) to ensure client data privacy. The client adds a random mask to the activation map before sending it to the servers. The servers cannot access the original function but instead work with shares generated using FSS. Consequently, during both forward and backward propagation, the servers cannot reconstruct the client's raw data from the activation map. Furthermore, through visual invertibility, we demonstrate that the server is incapable of reconstructing the raw image data from the activation map when using FSS. It enhances privacy by reducing privacy leakage compared to other SL-based approaches where the server can access client input information. Our approach also ensures security against feature space hijacking attack, protecting sensitive information from potential manipulation. Our protocols yield promising results, reducing communication overhead by over 2x and training time by over 7x compared to the same model with FSS, without any SL. Also, we show that our approach achieves >96% accuracy and remains equivalent to the plaintext models.

4/16/2024

New!Enhancing Privacy in ControlNet and Stable Diffusion via Split Learning

Dixi Yao

With the emerging trend of large generative models, ControlNet is introduced to enable users to fine-tune pre-trained models with their own data for various use cases. A natural question arises: how can we train ControlNet models while ensuring users' data privacy across distributed devices? Exploring different distributed training schemes, we find conventional federated learning and split learning unsuitable. Instead, we propose a new distributed learning structure that eliminates the need for the server to send gradients back. Through a comprehensive evaluation of existing threats, we discover that in the context of training ControlNet with split learning, most existing attacks are ineffective, except for two mentioned in previous literature. To counter these threats, we leverage the properties of diffusion models and design a new timestep sampling policy during forward processes. We further propose a privacy-preserving activation function and a method to prevent private text prompts from leaving clients, tailored for image generation with diffusion models. Our experimental results demonstrate that our algorithms and systems greatly enhance the efficiency of distributed training for ControlNet while ensuring users' data privacy without compromising image generation quality.

9/16/2024

A deep cut into Split Federated Self-supervised Learning

Marcin Przewik{e}'zlikowski, Marcin Osial, Bartosz Zieli'nski, Marek 'Smieja

Collaborative self-supervised learning has recently become feasible in highly distributed environments by dividing the network layers between client devices and a central server. However, state-of-the-art methods, such as MocoSFL, are optimized for network division at the initial layers, which decreases the protection of the client data and increases communication overhead. In this paper, we demonstrate that splitting depth is crucial for maintaining privacy and communication efficiency in distributed training. We also show that MocoSFL suffers from a catastrophic quality deterioration for the minimal communication overhead. As a remedy, we introduce Momentum-Aligned contrastive Split Federated Learning (MonAcoSFL), which aligns online and momentum client models during training procedure. Consequently, we achieve state-of-the-art accuracy while significantly reducing the communication overhead, making MonAcoSFL more practical in real-world scenarios.

6/13/2024