Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

2406.14322

Published 6/21/2024 by Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Daogao Liu, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

cs.CL cs.CR cs.LG

Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Abstract

Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy (DP) offers a promising solution by ensuring models are `almost indistinguishable' with or without any particular privacy unit, current evaluations on LLMs mostly treat each example (text record) as the privacy unit. This leads to uneven user privacy guarantees when contributions per user vary. We therefore study user-level DP motivated by applications where it necessary to ensure uniform privacy protection across users. We present a systematic evaluation of user-level DP for LLM fine-tuning on natural language generation tasks. Focusing on two mechanisms for achieving user-level DP guarantees, Group Privacy and User-wise DP-SGD, we investigate design choices like data selection strategies and parameter tuning for the best privacy-utility tradeoff.

Create account to get full access

Overview

Introduces a user-level differential privacy mechanism for fine-tuning language models to protect individual privacy.
Proposes two techniques: User-level Differential Privacy and Synthetic Query Generation.
Demonstrates effectiveness on a variety of language model benchmarks while preserving high performance.

Plain English Explanation

This paper focuses on preserving the privacy of individuals when fine-tuning large language models. Fine-tuning is the process of adapting a pre-trained model to a specific task or dataset, which can lead to concerns about information leakage about the individuals in the training data.

The researchers introduce two techniques to address this challenge. The first is user-level differential privacy, which adds noise to the model updates during fine-tuning to ensure that the final model cannot be used to identify individuals in the training data. This builds on prior work on differential privacy for machine learning.

The second technique is synthetic query generation, where the model is trained on synthetic data that mimics the statistical properties of the original data, but does not contain any real individual information. This allows the model to be fine-tuned without exposing private user data.

The researchers demonstrate that these techniques can be effectively applied to a variety of language model benchmarks, while still maintaining high performance on the target tasks. This is an important step towards developing privacy-preserving AI systems that can be deployed safely and ethically.

Technical Explanation

The paper introduces two key techniques for preserving user-level privacy when fine-tuning language models:

User-level Differential Privacy: The authors propose a user-level differential privacy mechanism that adds noise to the model updates during fine-tuning. This ensures that the final model cannot be used to identify individuals in the training data, even if an adversary has access to the model parameters. This builds on prior work on differential privacy for machine learning.
Synthetic Query Generation: The authors also introduce a technique to fine-tune language models on synthetic data that mimics the statistical properties of the original training data, but does not contain any real user information. This allows the model to be fine-tuned without exposing private user data, as in prior work on privacy-preserving deep retrieval.

The authors evaluate these techniques on a variety of language model benchmarks, including PrivLM-Bench, and demonstrate that they can achieve strong performance while preserving user-level privacy. They also show that their approach outperforms prior work on differentially private knowledge distillation.

Critical Analysis

The paper presents a promising approach to preserving user-level privacy in language model fine-tuning, but there are a few potential limitations and areas for further research:

The authors note that their user-level differential privacy mechanism may still leak some information about individual users, and further research is needed to quantify and minimize this leakage.
The synthetic data generation technique relies on the ability to accurately model the statistical properties of the original training data, which may be challenging for complex real-world datasets.
The paper focuses on language model benchmarks, and it's unclear how well the techniques would scale to larger, more diverse datasets or more complex machine learning tasks.

Despite these caveats, the work represents an important step towards developing privacy-preserving AI systems that can be deployed safely and ethically. Readers are encouraged to think critically about the research and consider how these techniques could be further improved and applied in real-world scenarios.

Conclusion

This paper introduces two novel techniques, user-level differential privacy and synthetic query generation, to preserve individual privacy when fine-tuning language models. The authors demonstrate the effectiveness of these approaches on a variety of benchmarks, while maintaining high performance on the target tasks.

This research is a significant contribution to the field of privacy-preserving machine learning, as it addresses a critical challenge in the deployment of large language models. By protecting individual privacy, these techniques can help enable the safe and ethical use of powerful AI systems in a wide range of real-world applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Learning with User-Level Local Differential Privacy

Puning Zhao, Li Shen, Rongfei Fan, Qingming Li, Huiwen Wu, Jiafei Wu, Zhe Liu

User-level privacy is important in distributed systems. Previous research primarily focuses on the central model, while the local models have received much less attention. Under the central model, user-level DP is strictly stronger than the item-level one. However, under the local model, the relationship between user-level and item-level LDP becomes more complex, thus the analysis is crucially different. In this paper, we first analyze the mean estimation problem and then apply it to stochastic optimization, classification, and regression. In particular, we propose adaptive strategies to achieve optimal performance at all privacy levels. Moreover, we also obtain information-theoretic lower bounds, which show that the proposed methods are minimax optimal up to logarithmic factors. Unlike the central DP model, where user-level DP always leads to slower convergence, our result shows that under the local model, the convergence rates are nearly the same between user-level and item-level cases for distributions with bounded support. For heavy-tailed distributions, the user-level rate is even faster than the item-level one.

5/28/2024

stat.ML cs.LG

LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models

Qin Yang, Meisam Mohammad, Han Wang, Ali Payani, Ashish Kundu, Kai Shu, Yan Yan, Yuan Hong

Differentially Private Stochastic Gradient Descent (DP-SGD) and its variants have been proposed to ensure rigorous privacy for fine-tuning large-scale pre-trained language models. However, they rely heavily on the Gaussian mechanism, which may overly perturb the gradients and degrade the accuracy, especially in stronger privacy regimes (e.g., the privacy budget $epsilon < 3$). To address such limitations, we propose a novel Language Model-based Optimal Differential Privacy (LMO-DP) mechanism, which takes the first step to enable the tight composition of accurately fine-tuning (large) language models with a sub-optimal DP mechanism, even in strong privacy regimes (e.g., $0.1leq epsilon<3$). Furthermore, we propose a novel offline optimal noise search method to efficiently derive the sub-optimal DP that significantly reduces the noise magnitude. For instance, fine-tuning RoBERTa-large (with 300M parameters) on the SST-2 dataset can achieve an accuracy of 92.20% (given $epsilon=0.3$, $delta=10^{-10}$) by drastically outperforming the Gaussian mechanism (e.g., $sim 50%$ for small $epsilon$ and $delta$). We also draw similar findings on the text generation tasks on GPT-2. Finally, to our best knowledge, LMO-DP is also the first solution to accurately fine-tune Llama-2 with strong differential privacy guarantees. The code will be released soon and available upon request.

5/30/2024

cs.CR cs.CL cs.LG

🛸

Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems using Differentially Private Language Models

Aldo Gael Carranza, Rezsa Farahani, Natalia Ponomareva, Alex Kurakin, Matthew Jagielski, Milad Nasr

We address the challenge of ensuring differential privacy (DP) guarantees in training deep retrieval systems. Training these systems often involves the use of contrastive-style losses, which are typically non-per-example decomposable, making them difficult to directly DP-train with since common techniques require per-example gradients. To address this issue, we propose an approach that prioritizes ensuring query privacy prior to training a deep retrieval system. Our method employs DP language models (LMs) to generate private synthetic queries representative of the original data. These synthetic queries can be used in downstream retrieval system training without compromising privacy. Our approach demonstrates a significant enhancement in retrieval quality compared to direct DP-training, all while maintaining query-level privacy guarantees. This work highlights the potential of harnessing LMs to overcome limitations in standard DP-training methods.

5/24/2024

cs.CL cs.CR cs.IR

💬

PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models

Haoran Li, Dadi Guo, Donghao Li, Wei Fan, Qi Hu, Xin Liu, Chunkit Chan, Duanyi Yao, Yuan Yao, Yangqiu Song

The rapid development of language models (LMs) brings unprecedented accessibility and usage for both models and users. On the one hand, powerful LMs achieve state-of-the-art performance over numerous downstream NLP tasks. On the other hand, more and more attention is paid to unrestricted model accesses that may bring malicious privacy risks of data leakage. To address these issues, many recent works propose privacy-preserving language models (PPLMs) with differential privacy (DP). Unfortunately, different DP implementations make it challenging for a fair comparison among existing PPLMs. In this paper, we present PrivLM-Bench, a multi-perspective privacy evaluation benchmark to empirically and intuitively quantify the privacy leakage of LMs. Instead of only reporting DP parameters, PrivLM-Bench sheds light on the neglected inference data privacy during actual usage. PrivLM-Bench first clearly defines multi-faceted privacy objectives. Then, PrivLM-Bench constructs a unified pipeline to perform private fine-tuning. Lastly, PrivLM-Bench performs existing privacy attacks on LMs with pre-defined privacy objectives as the empirical evaluation results. The empirical attack results are used to fairly and intuitively evaluate the privacy leakage of various PPLMs. We conduct extensive experiments on three datasets of GLUE for mainstream LMs.

6/4/2024

cs.CL cs.CR