Let's Focus: Focused Backdoor Attack against Federated Transfer Learning

Read original: arXiv:2404.19420 - Published 5/1/2024 by Marco Arazzi, Stefanos Koffas, Antonino Nocera, Stjepan Picek

Let's Focus: Focused Backdoor Attack against Federated Transfer Learning

Overview

• This paper introduces a new type of backdoor attack, called a "Focused Backdoor Attack," against federated transfer learning systems.

• Federated transfer learning allows machine learning models to be trained across multiple devices or organizations without sharing the underlying data. However, this approach can be vulnerable to backdoor attacks, where malicious actors inject hidden triggers into the model during training.

• The researchers propose a novel "Focused Backdoor Attack" that strategically targets specific features or components of the model, rather than indiscriminately poisoning the entire training data.

• The attack aims to degrade the model's performance on specific tasks or inputs while maintaining high overall accuracy, making it harder to detect.

Plain English Explanation

The paper describes a new way for hackers to secretly sabotage machine learning models that are trained across multiple devices or organizations, a process called federated transfer learning.

Normally, these federated models are trained without sharing the underlying data, which protects people's privacy. However, the researchers found a clever way for attackers to sneak in "backdoors" that can make the model perform poorly on certain tasks or inputs, while still maintaining high overall accuracy.

Instead of just randomly poisoning the training data, the attackers focus their backdoor on specific features or components of the model. This makes the attack harder to detect, since the model may still work well for most users, but can be sabotaged in targeted ways.

The paper demonstrates how this "Focused Backdoor Attack" can be used to degrade a model's performance on certain tasks, while leaving its overall performance intact. This is a concerning development, as it shows how federated learning systems may be vulnerable to sophisticated, targeted attacks.

Technical Explanation

The researchers propose a new type of backdoor attack called a "Focused Backdoor Attack" against federated transfer learning systems. In federated learning, machine learning models are trained across multiple devices or organizations without directly sharing the underlying data. This protects user privacy, but can make the models vulnerable to backdoor attacks, where malicious actors inject hidden triggers into the model during training.

The Focused Backdoor Attack strategically targets specific features or components of the model, rather than indiscriminately poisoning the entire training data. This allows the attack to degrade the model's performance on particular tasks or inputs, while maintaining high overall accuracy. The researchers demonstrate this attack on a text classification task, where they are able to make the model misclassify certain politically sensitive phrases while retaining high accuracy on most inputs.

The key innovation is that the Focused Backdoor Attack selectively poisons the model, rather than broadly contaminating the training data. This makes the attack harder to detect, since the model may still perform well for most users. The researchers explore different techniques for identifying the most critical model components to target, and show how the attack can be carried out effectively in a federated learning setting.

Critical Analysis

The paper presents a concerning new type of attack that highlights the potential vulnerabilities of federated transfer learning systems. By strategically targeting specific model components, the Focused Backdoor Attack can degrade performance on particular tasks or inputs in a hard-to-detect way.

However, the paper does not fully address how this attack could be mitigated in practice. The authors suggest potential defenses like anomaly detection or model verification, but do not provide a comprehensive solution. Federated learning systems will need to develop robust safeguards against these types of targeted, evasive attacks.

Additionally, the paper focuses on a specific text classification task and does not explore the attack's broader applicability. Further research is needed to understand how the Focused Backdoor Attack could manifest in other machine learning domains and use cases.

Overall, this paper makes an important contribution by exposing a new type of backdoor vulnerability in federated learning. It serves as a timely warning for the machine learning community to prioritize the security and robustness of these distributed training systems.

Conclusion

The "Focused Backdoor Attack" introduced in this paper represents a concerning development in the ongoing battle against backdoor vulnerabilities in machine learning. By strategically targeting specific model components, attackers can degrade a federated learning system's performance on particular tasks or inputs, while maintaining high overall accuracy to avoid detection.

This attack highlights the need for federated learning systems to develop robust safeguards against sophisticated, evasive attacks. As machine learning becomes more widely deployed in high-stakes applications, ensuring the security and integrity of these systems will be of paramount importance. The insights from this paper can help drive further research and progress in this critical area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Let's Focus: Focused Backdoor Attack against Federated Transfer Learning

Marco Arazzi, Stefanos Koffas, Antonino Nocera, Stjepan Picek

Federated Transfer Learning (FTL) is the most general variation of Federated Learning. According to this distributed paradigm, a feature learning pre-step is commonly carried out by only one party, typically the server, on publicly shared data. After that, the Federated Learning phase takes place to train a classifier collaboratively using the learned feature extractor. Each involved client contributes by locally training only the classification layers on a private training set. The peculiarity of an FTL scenario makes it hard to understand whether poisoning attacks can be developed to craft an effective backdoor. State-of-the-art attack strategies assume the possibility of shifting the model attention toward relevant features introduced by a forged trigger injected in the input data by some untrusted clients. Of course, this is not feasible in FTL, as the learned features are fixed once the server performs the pre-training step. Consequently, in this paper, we investigate this intriguing Federated Learning scenario to identify and exploit a vulnerability obtained by combining eXplainable AI (XAI) and dataset distillation. In particular, the proposed attack can be carried out by one of the clients during the Federated Learning phase of FTL by identifying the optimal local for the trigger through XAI and encapsulating compressed information of the backdoor class. Due to its behavior, we refer to our approach as a focused backdoor approach (FB-FTL for short) and test its performance by explicitly referencing an image classification scenario. With an average 80% attack success rate, obtained results show the effectiveness of our attack also against existing defenses for Federated Learning.

5/1/2024

Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning

Xiaoting Lyu, Yufei Han, Wei Wang, Jingkai Liu, Yongsheng Zhu, Guangquan Xu, Jiqiang Liu, Xiangliang Zhang

Federated Learning (FL) is a collaborative machine learning technique where multiple clients work together with a central server to train a global model without sharing their private data. However, the distribution shift across non-IID datasets of clients poses a challenge to this one-model-fits-all method hindering the ability of the global model to effectively adapt to each client's unique local data. To echo this challenge, personalized FL (PFL) is designed to allow each client to create personalized local models tailored to their private data. While extensive research has scrutinized backdoor risks in FL, it has remained underexplored in PFL applications. In this study, we delve deep into the vulnerabilities of PFL to backdoor attacks. Our analysis showcases a tale of two cities. On the one hand, the personalization process in PFL can dilute the backdoor poisoning effects injected into the personalized local models. Furthermore, PFL systems can also deploy both server-end and client-end defense mechanisms to strengthen the barrier against backdoor attacks. On the other hand, our study shows that PFL fortified with these defense methods may offer a false sense of security. We propose textit{PFedBA}, a stealthy and effective backdoor attack strategy applicable to PFL systems. textit{PFedBA} ingeniously aligns the backdoor learning task with the main learning task of PFL by optimizing the trigger generation process. Our comprehensive experiments demonstrate the effectiveness of textit{PFedBA} in seamlessly embedding triggers into personalized local models. textit{PFedBA} yields outstanding attack performance across 10 state-of-the-art PFL algorithms, defeating the existing 6 defense mechanisms. Our study sheds light on the subtle yet potent backdoor threats to PFL systems, urging the community to bolster defenses against emerging backdoor challenges.

6/11/2024

Non-Cooperative Backdoor Attacks in Federated Learning: A New Threat Landscape

Tuan Nguyen, Dung Thuy Nguyen, Khoa D Doan, Kok-Seng Wong

Despite the promise of Federated Learning (FL) for privacy-preserving model training on distributed data, it remains susceptible to backdoor attacks. These attacks manipulate models by embedding triggers (specific input patterns) in the training data, forcing misclassification as predefined classes during deployment. Traditional single-trigger attacks and recent work on cooperative multiple-trigger attacks, where clients collaborate, highlight limitations in attack realism due to coordination requirements. We investigate a more alarming scenario: non-cooperative multiple-trigger attacks. Here, independent adversaries introduce distinct triggers targeting unique classes. These parallel attacks exploit FL's decentralized nature, making detection difficult. Our experiments demonstrate the alarming vulnerability of FL to such attacks, where individual backdoors can be successfully learned without impacting the main task. This research emphasizes the critical need for robust defenses against diverse backdoor attacks in the evolving FL landscape. While our focus is on empirical analysis, we believe it can guide backdoor research toward more realistic settings, highlighting the crucial role of FL in building robust defenses against diverse backdoor threats. The code is available at url{https://anonymous.4open.science/r/nba-980F/}.

7/12/2024

Dual Model Replacement:invisible Multi-target Backdoor Attack based on Federal Learning

Rong Wang, Guichen Zhou, Mingjun Gao, Yunpeng Xiao

In recent years, the neural network backdoor hidden in the parameters of the federated learning model has been proved to have great security risks. Considering the characteristics of trigger generation, data poisoning and model training in backdoor attack, this paper designs a backdoor attack method based on federated learning. Firstly, aiming at the concealment of the backdoor trigger, a TrojanGan steganography model with encoder-decoder structure is designed. The model can encode specific attack information as invisible noise and attach it to the image as a backdoor trigger, which improves the concealment and data transformations of the backdoor trigger.Secondly, aiming at the problem of single backdoor trigger mode, an image poisoning attack method called combination trigger attack is proposed. This method realizes multi-backdoor triggering by multiplexing combined triggers and improves the robustness of backdoor attacks. Finally, aiming at the problem that the local training mechanism leads to the decrease of the success rate of backdoor attack, a dual model replacement backdoor attack algorithm based on federated learning is designed. This method can improve the success rate of backdoor attack while maintaining the performance of the federated learning aggregation model. Experiments show that the attack strategy in this paper can not only achieve high backdoor concealment and diversification of trigger forms under federated learning, but also achieve good attack success rate in multi-target attacks.door concealment and diversification of trigger forms but also achieve good results in multi-target attacks.

4/23/2024