BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Read original: arXiv:2406.12222 - Published 6/19/2024 by Yi Liu, Cong Wang, Xingliang Yuan

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Overview

This paper introduces a novel data poisoning attack called BadSampler that can effectively bypass Byzantine-robust aggregation methods in federated learning.
BadSampler leverages the phenomenon of catastrophic forgetting to craft malicious updates that are indistinguishable from clean updates, making them difficult to detect.
The attack is shown to be effective against state-of-the-art Byzantine-robust defenses, highlighting the need for further research into more robust federated learning systems.

Plain English Explanation

In the world of machine learning, federated learning has emerged as a powerful approach that allows multiple devices or organizations to collectively train a shared model without directly sharing their private data. This is especially useful in scenarios where data privacy is a concern, such as healthcare or finance.

However, this decentralized training process also introduces new vulnerabilities. One major threat is data poisoning attacks, where malicious actors try to inject corrupted data into the training process to undermine the model's performance. Relevance: Byzantine-robust Optimization Against Data Poisoning

The paper introduces a new type of data poisoning attack called BadSampler that can bypass existing defenses, even those designed to be robust against Byzantine (malicious) clients. Relevance: Byzantine-robust Decentralized Federated Learning

The key insight behind BadSampler is the concept of "catastrophic forgetting" - the tendency of neural networks to quickly forget previously learned information when trained on new tasks. By exploiting this phenomenon, the attacker can craft malicious updates that appear similar to clean updates, making them hard to detect. Relevance: GAN-based Data Poisoning Attack Against Federated

The paper demonstrates the effectiveness of BadSampler against state-of-the-art Byzantine-robust defenses, highlighting the need for further research into more robust federated learning systems. Relevance: Aggressive or Imperceptible or Both? Network Pruning, Relevance: Precision-Guided Approach to Mitigate Data Poisoning

Technical Explanation

The paper presents a novel data poisoning attack called BadSampler that can effectively bypass Byzantine-robust aggregation methods in federated learning. The key idea behind BadSampler is to leverage the phenomenon of catastrophic forgetting in neural networks to craft malicious updates that are indistinguishable from clean updates.

The authors first provide a formal definition of the federated learning setting and the threat model, where a subset of participating clients are assumed to be malicious and attempt to poison the global model. They then introduce the BadSampler attack, which involves three main steps:

Crafting Malicious Updates: The attacker first trains a "shadow model" on a small set of clean data to learn the target task. They then fine-tune this model on a carefully crafted dataset designed to cause catastrophic forgetting of the original task. The resulting malicious updates are designed to appear similar to clean updates, making them hard to detect.
Targeted Poisoning: The attacker strategically selects a subset of clients to participate in the federated learning process and sends the malicious updates from these clients during the aggregation step. By targeting a small number of clients, the attacker can maximize the impact of the poisoning while minimizing the risk of detection.
Bypassing Byzantine-robust Defenses: The authors show that existing Byzantine-robust aggregation methods, such as Krum and Bulyan, are ineffective against the BadSampler attack. This is because the malicious updates are crafted to be indistinguishable from clean updates, even though they are highly damaging to the global model's performance.

The paper presents extensive experimental results demonstrating the effectiveness of the BadSampler attack against various benchmark datasets and model architectures. The attack is shown to significantly degrade the global model's performance, even when the number of malicious clients is small.

Critical Analysis

The BadSampler attack highlights a concerning vulnerability in existing Byzantine-robust defenses for federated learning. While these defenses have been shown to be effective against traditional data poisoning attacks, the authors demonstrate that they are unable to detect and mitigate the more sophisticated BadSampler attack.

One key limitation of the paper is that it assumes the attacker has access to a small amount of clean data to train the "shadow model." In real-world scenarios, this may not always be the case, and the attacker may need to find other ways to obtain or generate the necessary data.

Additionally, the authors do not discuss potential countermeasures or defense strategies that could be employed to mitigate the BadSampler attack. Further research is needed to explore more robust federated learning architectures and aggregation methods that can effectively detect and defend against such advanced data poisoning attacks.

Conclusion

The BadSampler attack presented in this paper poses a significant threat to the security and reliability of federated learning systems. By leveraging the phenomenon of catastrophic forgetting, the attacker can craft malicious updates that are indistinguishable from clean updates, allowing them to bypass existing Byzantine-robust defenses.

This research highlights the need for continued innovation in the field of federated learning, with a focus on developing more robust and secure aggregation methods that can reliably detect and mitigate advanced data poisoning attacks. As federated learning continues to gain traction in various domains, ensuring the integrity and trustworthiness of the shared model is of utmost importance.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Yi Liu, Cong Wang, Xingliang Yuan

Federated Learning (FL) is susceptible to poisoning attacks, wherein compromised clients manipulate the global model by modifying local datasets or sending manipulated model updates. Experienced defenders can readily detect and mitigate the poisoning effects of malicious behaviors using Byzantine-robust aggregation rules. However, the exploration of poisoning attacks in scenarios where such behaviors are absent remains largely unexplored for Byzantine-robust FL. This paper addresses the challenging problem of poisoning Byzantine-robust FL by introducing catastrophic forgetting. To fill this gap, we first formally define generalization error and establish its connection to catastrophic forgetting, paving the way for the development of a clean-label data poisoning attack named BadSampler. This attack leverages only clean-label data (i.e., without poisoned data) to poison Byzantine-robust FL and requires the adversary to selectively sample training data with high loss to feed model training and maximize the model's generalization error. We formulate the attack as an optimization problem and present two elegant adversarial sampling strategies, Top-$kappa$ sampling, and meta-sampling, to approximately solve it. Additionally, our formal error upper bound and time complexity analysis demonstrate that our design can preserve attack utility with high efficiency. Extensive evaluations on two real-world datasets illustrate the effectiveness and performance of our proposed attacks.

6/19/2024

👀

Advancing Hybrid Defense for Byzantine Attacks in Federated Learning

Kai Yue, Richeng Jin, Chau-Wai Wong, Huaiyu Dai

Federated learning (FL) enables multiple clients to collaboratively train a global model without sharing their local data. Recent studies have highlighted the vulnerability of FL to Byzantine attacks, where malicious clients send poisoned updates to degrade model performance. Notably, many attacks have been developed targeting specific aggregation rules, whereas various defense mechanisms have been designed for dedicated threat models. This paper studies the resilience of an attack-agnostic FL scenario, where the server lacks prior knowledge of both the attackers' strategies and the number of malicious clients involved. We first introduce a hybrid defense against state-of-the-art attacks. Our goal is to identify a general-purpose aggregation rule that performs well on average while also avoiding worst-case vulnerabilities. By adaptively selecting from available defenses, we demonstrate that the server remains robust even when confronted with a substantial proportion of poisoned updates. To better understand this resilience, we then assess the attackers' capability using a proxy called client heterogeneity. We also emphasize that the existing FL defenses should not be regarded as secure, as demonstrated through the newly proposed Trapsetter attack. The proposed attack outperforms other state-of-the-art attacks by further reducing the model test accuracy by 8-10%. Our findings highlight the ongoing need for the development of Byzantine-resilient aggregation algorithms in FL.

9/11/2024

🛠️

On the Relevance of Byzantine Robust Optimization Against Data Poisoning

Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot

The success of machine learning (ML) has been intimately linked with the availability of large amounts of data, typically collected from heterogeneous sources and processed on vast networks of computing devices (also called {em workers}). Beyond accuracy, the use of ML in critical domains such as healthcare and autonomous driving calls for robustness against {em data poisoning}and some {em faulty workers}. The problem of {em Byzantine ML} formalizes these robustness issues by considering a distributed ML environment in which workers (storing a portion of the global dataset) can deviate arbitrarily from the prescribed algorithm. Although the problem has attracted a lot of attention from a theoretical point of view, its practical importance for addressing realistic faults (where the behavior of any worker is locally constrained) remains unclear. It has been argued that the seemingly weaker threat model where only workers' local datasets get poisoned is more reasonable. We prove that, while tolerating a wider range of faulty behaviors, Byzantine ML yields solutions that are, in a precise sense, optimal even under the weaker data poisoning threat model. Then, we study a generic data poisoning model wherein some workers have {em fully-poisonous local data}, i.e., their datasets are entirely corruptible, and the remainders have {em partially-poisonous local data}, i.e., only a fraction of their local datasets is corruptible. We prove that Byzantine-robust schemes yield optimal solutions against both these forms of data poisoning, and that the former is more harmful when workers have {em heterogeneous} local data.

5/2/2024

Byzantine-Robust Decentralized Federated Learning

Minghong Fang, Zifan Zhang, Hairi, Prashant Khanduri, Jia Liu, Songtao Lu, Yuchen Liu, Neil Gong

Federated learning (FL) enables multiple clients to collaboratively train machine learning models without revealing their private training data. In conventional FL, the system follows the server-assisted architecture (server-assisted FL), where the training process is coordinated by a central server. However, the server-assisted FL framework suffers from poor scalability due to a communication bottleneck at the server, and trust dependency issues. To address challenges, decentralized federated learning (DFL) architecture has been proposed to allow clients to train models collaboratively in a serverless and peer-to-peer manner. However, due to its fully decentralized nature, DFL is highly vulnerable to poisoning attacks, where malicious clients could manipulate the system by sending carefully-crafted local models to their neighboring clients. To date, only a limited number of Byzantine-robust DFL methods have been proposed, most of which are either communication-inefficient or remain vulnerable to advanced poisoning attacks. In this paper, we propose a new algorithm called BALANCE (Byzantine-robust averaging through local similarity in decentralization) to defend against poisoning attacks in DFL. In BALANCE, each client leverages its own local model as a similarity reference to determine if the received model is malicious or benign. We establish the theoretical convergence guarantee for BALANCE under poisoning attacks in both strongly convex and non-convex settings. Furthermore, the convergence rate of BALANCE under poisoning attacks matches those of the state-of-the-art counterparts in Byzantine-free settings. Extensive experiments also demonstrate that BALANCE outperforms existing DFL methods and effectively defends against poisoning attacks.

7/16/2024