Privacy Amplification for Matrix Mechanisms

Read original: arXiv:2310.15526 - Published 5/7/2024 by Christopher A. Choquette-Choo, Arun Ganesh, Thomas Steinke, Abhradeep Thakurta

🔎

Overview

The paper proposes a new algorithm called MMCC for analyzing privacy amplification via sampling in the context of the matrix mechanism, a technique used in newer differential privacy (DP) algorithms like DP-FTRL.
Previous work on privacy amplification focused on DP-SGD, but was not readily applicable to newer algorithms that use correlated noise rather than independent noise.
MMCC is nearly tight, meaning it closely approaches a lower bound on privacy loss as the privacy parameter (ε) approaches 0.
The paper also introduces a conditional composition theorem that allows analyzing correlated outputs as if they were independent, which has broader utility.
The proposed amplification algorithm leads to significant improvements in the privacy-utility trade-offs for DP-FTRL algorithms on standard benchmarks.

Plain English Explanation

Differential privacy (DP) is a technique used to protect the privacy of individuals in data-driven applications, like machine learning. DP-SGD is a popular DP algorithm that adds random noise to the data to obscure individual contributions and provide privacy guarantees.

However, newer DP algorithms, like DP-FTRL, use a different approach called the "matrix mechanism" to add correlated noise instead of independent noise. This makes the previous work on privacy amplification (which exploits randomness in data selection) not directly applicable.

To address this, the researchers propose a new algorithm called MMCC that can analyze privacy amplification for any generic matrix mechanism. MMCC is designed to be "nearly tight," meaning it closely matches the theoretical lower bound on privacy loss as the privacy parameter gets smaller.

The key innovation is a "conditional composition theorem" that allows the researchers to analyze the correlated outputs of the matrix mechanism as if they were independent, by conditioning them on the previous outputs. This theorem has broader applications beyond the specific algorithm.

By using MMCC, the researchers show that the noise added to a DP-FTRL algorithm called "binary-tree-DP-FTRL" can asymptotically match the noise added to DP-SGD with amplification. This leads to significant improvements in the privacy-utility trade-offs for DP-FTRL algorithms on standard benchmarks.

Technical Explanation

The paper proposes the MMCC algorithm, which is the first to analyze privacy amplification via sampling for any generic matrix mechanism. The matrix mechanism is used in newer differential privacy (DP) algorithms, such as DP-FTRL, to add correlated noise instead of the independent noise used in DP-SGD.

The key technical contribution is a conditional composition theorem that allows the researchers to analyze the correlated outputs of the matrix mechanism as if they were independent, by conditioning them on the previous outputs. This theorem has broader utility beyond the specific MMCC algorithm.

The MMCC algorithm is designed to be "nearly tight," meaning it closely approaches a lower bound on privacy loss as the privacy parameter (ε) approaches 0. This is an important property, as it ensures the algorithm provides strong privacy guarantees.

The researchers demonstrate the practical utility of MMCC by showing that it can be used to improve the privacy-utility trade-offs for DP-FTRL algorithms on standard benchmarks. Specifically, they show that the noise added to a DP-FTRL algorithm called "binary-tree-DP-FTRL" can asymptotically match the noise added to DP-SGD with amplification.

Critical Analysis

The paper presents a novel and technically sophisticated approach to analyzing privacy amplification for the matrix mechanism, which is a crucial component of newer DP algorithms like DP-FTRL. The conditional composition theorem introduced in the paper has broader applications beyond the specific MMCC algorithm.

However, the paper does not address the potential computational and memory overhead associated with the matrix mechanism, which can be a significant concern in practical applications. Additionally, the paper does not discuss the scalability of the MMCC algorithm to large-scale problems or its robustness to various data distributions and noise patterns.

Further research could explore the trade-offs between the privacy guarantees provided by MMCC and the computational and memory requirements of the algorithm, as well as its performance on a wider range of benchmarks and use cases. Additionally, it would be valuable to investigate the broader implications of the conditional composition theorem and its potential applications in other areas of differential privacy and machine learning.

Conclusion

The paper presents an important contribution to the field of differential privacy, proposing the MMCC algorithm as a way to analyze privacy amplification for the matrix mechanism used in newer DP algorithms like DP-FTRL. The key innovations include a conditional composition theorem that allows analyzing correlated outputs as independent, and the development of a nearly tight algorithm for privacy amplification.

This work has significant implications for improving the privacy-utility trade-offs in a wide range of data-driven applications, including machine learning, where differential privacy is becoming an increasingly important tool for protecting individual privacy. The insights and techniques introduced in this paper could also have broader applications in other areas of privacy-preserving computing and data analysis.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Privacy Amplification for Matrix Mechanisms

Christopher A. Choquette-Choo, Arun Ganesh, Thomas Steinke, Abhradeep Thakurta

Privacy amplification exploits randomness in data selection to provide tighter differential privacy (DP) guarantees. This analysis is key to DP-SGD's success in machine learning, but, is not readily applicable to the newer state-of-the-art algorithms. This is because these algorithms, known as DP-FTRL, use the matrix mechanism to add correlated noise instead of independent noise as in DP-SGD. In this paper, we propose MMCC, the first algorithm to analyze privacy amplification via sampling for any generic matrix mechanism. MMCC is nearly tight in that it approaches a lower bound as $epsilonto0$. To analyze correlated outputs in MMCC, we prove that they can be analyzed as if they were independent, by conditioning them on prior outputs. Our conditional composition theorem has broad utility: we use it to show that the noise added to binary-tree-DP-FTRL can asymptotically match the noise added to DP-SGD with amplification. Our amplification algorithm also has practical empirical utility: we show it leads to significant improvement in the privacy-utility trade-offs for DP-FTRL algorithms on standard benchmarks.

5/7/2024

🌐

Unified Mechanism-Specific Amplification by Subsampling and Group Privacy Amplification

Jan Schuchardt, Mihail Stoian, Arthur Kosmala, Stephan Gunnemann

Amplification by subsampling is one of the main primitives in machine learning with differential privacy (DP): Training a model on random batches instead of complete datasets results in stronger privacy. This is traditionally formalized via mechanism-agnostic subsampling guarantees that express the privacy parameters of a subsampled mechanism as a function of the original mechanism's privacy parameters. We propose the first general framework for deriving mechanism-specific guarantees, which leverage additional information beyond these parameters to more tightly characterize the subsampled mechanism's privacy. Such guarantees are of particular importance for privacy accounting, i.e., tracking privacy over multiple iterations. Overall, our framework based on conditional optimal transport lets us derive existing and novel guarantees for approximate DP, accounting with R'enyi DP, and accounting with dominating pairs in a unified, principled manner. As an application, we analyze how subsampling affects the privacy of groups of multiple users. Our tight mechanism-specific bounds outperform tight mechanism-agnostic bounds and classic group privacy results.

6/12/2024

Scaling up the Banded Matrix Factorization Mechanism for Differentially Private ML

Ryan McKenna

DP-BandMF offers a powerful approach to differentially private machine learning, balancing privacy amplification with noise correlation for optimal noise reduction. However, its scalability has been limited to settings where the number of training iterations is less than $10^4$. In this work, we present techniques that significantly extend DP-BandMF's reach, enabling use in settings with and over $10^6$ training iterations. Our enhanced implementation, coupled with extensive experiments, provides clear guidelines on selecting the optimal number of bands. These insights offer practitioners a deeper understanding of DP-BandMF's performance and how to maximize its utility for privacy-preserving machine learning.

5/28/2024

📉

Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Yingyu Lin, Yi-An Ma, Yu-Xiang Wang, Rachel Redberg, Zhiqi Bu

Posterior sampling, i.e., exponential mechanism to sample from the posterior distribution, provides $varepsilon$-pure differential privacy (DP) guarantees and does not suffer from potentially unbounded privacy breach introduced by $(varepsilon,delta)$-approximate DP. In practice, however, one needs to apply approximate sampling methods such as Markov chain Monte Carlo (MCMC), thus re-introducing the unappealing $delta$-approximation error into the privacy guarantees. To bridge this gap, we propose the Approximate SAample Perturbation (abbr. ASAP) algorithm which perturbs an MCMC sample with noise proportional to its Wasserstein-infinity ($W_infty$) distance from a reference distribution that satisfies pure DP or pure Gaussian DP (i.e., $delta=0$). We then leverage a Metropolis-Hastings algorithm to generate the sample and prove that the algorithm converges in $W_infty$ distance. We show that by combining our new techniques with a localization step, we obtain the first nearly linear-time algorithm that achieves the optimal rates in the DP-ERM problem with strongly convex and smooth losses.

5/2/2024