Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective

2405.10757

Published 6/24/2024 by Zhiwei Zhang, Minhua Lin, Enyan Dai, Suhang Wang

Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective

Abstract

Graph Neural Networks (GNNs) have shown remarkable performance in various tasks. However, recent works reveal that GNNs are vulnerable to backdoor attacks. Generally, backdoor attack poisons the graph by attaching backdoor triggers and the target class label to a set of nodes in the training graph. A GNN trained on the poisoned graph will then be misled to predict test nodes attached with trigger to the target class. Despite their effectiveness, our empirical analysis shows that triggers generated by existing methods tend to be out-of-distribution (OOD), which significantly differ from the clean data. Hence, these injected triggers can be easily detected and pruned with widely used outlier detection methods in real-world applications. Therefore, in this paper, we study a novel problem of unnoticeable graph backdoor attacks with in-distribution (ID) triggers. To generate ID triggers, we introduce an OOD detector in conjunction with an adversarial learning strategy to generate the attributes of the triggers within distribution. To ensure a high attack success rate with ID triggers, we introduce novel modules designed to enhance trigger memorization by the victim model trained on poisoned graph. Extensive experiments on real-world datasets demonstrate the effectiveness of the proposed method in generating in distribution triggers that can by-pass various defense strategies while maintaining a high attack success rate.

Create account to get full access

Overview

This paper rethinks the approach to graph backdoor attacks, proposing a distribution-preserving perspective that aims to maintain the original graph structure and data distribution.
Backdoor attacks on graph neural networks (GNNs) have emerged as a significant security concern, as they can allow attackers to manipulate the model's behavior without being detected.
The authors introduce a new attack method called Clean Graph Backdoor Attack that aims to preserve the original graph distribution and structure, making the attack harder to detect.

Plain English Explanation

The paper looks at a type of security vulnerability in graph neural networks (GNNs) called backdoor attacks. In a backdoor attack, an attacker can secretly alter a GNN model to behave in a certain way, even if the model appears to be working correctly on the surface.

The researchers propose a new approach to these backdoor attacks that tries to keep the original structure and data distribution of the graph intact. This makes the backdoor attack harder to detect, as the modified graph will still look similar to the original.

The key idea is to carefully craft the backdoor trigger in a way that preserves the overall characteristics of the graph, rather than making obvious changes that would raise suspicion. This "distribution-preserving" approach is a new perspective on how to make backdoor attacks more stealthy and effective.

Technical Explanation

The paper introduces a new backdoor attack method called the Clean Graph Backdoor Attack. Unlike previous graph backdoor attacks that significantly alter the graph structure, this approach aims to preserve the original graph distribution and topology.

The authors develop a optimization-based framework to craft the backdoor trigger while constraining the changes to the graph. This ensures the modified graph remains statistically similar to the original, making the attack harder to detect.

Experiments on real-world graph datasets show the Clean Graph Backdoor Attack can achieve high attack success rates while leaving the graph structure and distribution largely intact. This demonstrates the effectiveness of their distribution-preserving perspective on graph backdoor attacks.

The paper also discusses connections to other related attack methods, such as Efficient Backdoor Attacks on Deep Neural Networks, Poisoning-based Backdoor Attacks with Arbitrary Target Label, and Concealing Backdoor Model Updates in Federated Learning.

Critical Analysis

The key contribution of this paper is the distribution-preserving perspective on graph backdoor attacks, which represents a more realistic and stealthy attack vector compared to previous methods. By preserving the graph structure and data distribution, the proposed attack is harder to detect through manual inspection or statistical anomaly detection.

However, the paper does not fully address the potential countermeasures that could be developed to detect or mitigate such "clean" backdoor attacks. The authors mention the need for "more advanced defense mechanisms" but do not provide details on what those might entail.

Additionally, the experimental evaluation is limited to relatively small-scale graph datasets. Applying the Clean Graph Backdoor Attack to larger, more complex real-world graphs would be an important next step to fully assess its practical implications and challenges.

Overall, this paper represents a significant advancement in the understanding of graph backdoor attacks, but further research is needed to develop comprehensive defenses and explore the scalability of the proposed attack method.

Conclusion

This paper introduces a new perspective on graph backdoor attacks, proposing a distribution-preserving approach that aims to maintain the original graph structure and data distribution. By carefully crafting the backdoor trigger, the Clean Graph Backdoor Attack can achieve high attack success rates while remaining stealthy and difficult to detect.

The distribution-preserving perspective represents an important advancement in the understanding of graph-based security vulnerabilities, and highlights the need for more sophisticated defense mechanisms to combat these types of "clean" backdoor attacks. As GNNs become more widely deployed in critical applications, this research provides valuable insights for improving the robustness and trustworthiness of these models.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Robustness-Inspired Defense Against Backdoor Attacks on Graph Neural Networks

Zhiwei Zhang, Minhua Lin, Junjie Xu, Zongyu Wu, Enyan Dai, Suhang Wang

Graph Neural Networks (GNNs) have achieved promising results in tasks such as node classification and graph classification. However, recent studies reveal that GNNs are vulnerable to backdoor attacks, posing a significant threat to their real-world adoption. Despite initial efforts to defend against specific graph backdoor attacks, there is no work on defending against various types of backdoor attacks where generated triggers have different properties. Hence, we first empirically verify that prediction variance under edge dropping is a crucial indicator for identifying poisoned nodes. With this observation, we propose using random edge dropping to detect backdoors and theoretically show that it can efficiently distinguish poisoned nodes from clean ones. Furthermore, we introduce a novel robust training strategy to efficiently counteract the impact of the triggers. Extensive experiments on real-world datasets show that our framework can effectively identify poisoned nodes, significantly degrade the attack success rate, and maintain clean accuracy when defending against various types of graph backdoor attacks with different properties.

6/17/2024

cs.LG cs.CR

Graph Neural Backdoor: Fundamentals, Methodologies, Applications, and Future Directions

Xiao Yang, Gaolei Li, Jianhua Li

Graph Neural Networks (GNNs) have significantly advanced various downstream graph-relevant tasks, encompassing recommender systems, molecular structure prediction, social media analysis, etc. Despite the boosts of GNN, recent research has empirically demonstrated its potential vulnerability to backdoor attacks, wherein adversaries employ triggers to poison input samples, inducing GNN to adversary-premeditated malicious outputs. This is typically due to the controlled training process, or the deployment of untrusted models, such as delegating model training to third-party service, leveraging external training sets, and employing pre-trained models from online sources. Although there's an ongoing increase in research on GNN backdoors, comprehensive investigation into this field is lacking. To bridge this gap, we propose the first survey dedicated to GNN backdoors. We begin by outlining the fundamental definition of GNN, followed by the detailed summarization and categorization of current GNN backdoor attacks and defenses based on their technical characteristics and application scenarios. Subsequently, the analysis of the applicability and use cases of GNN backdoors is undertaken. Finally, the exploration of potential research directions of GNN backdoors is presented. This survey aims to explore the principles of graph backdoors, provide insights to defenders, and promote future security research.

6/18/2024

cs.LG cs.AI cs.CR

A Clean-graph Backdoor Attack against Graph Convolutional Networks with Poisoned Label Only

Jiazhu Dai, Haoyu Sun

Graph Convolutional Networks (GCNs) have shown excellent performance in dealing with various graph structures such as node classification, graph classification and other tasks. However,recent studies have shown that GCNs are vulnerable to a novel threat known as backdoor attacks. However, all existing backdoor attacks in the graph domain require modifying the training samples to accomplish the backdoor injection, which may not be practical in many realistic scenarios where adversaries have no access to modify the training samples and may leads to the backdoor attack being detected easily. In order to explore the backdoor vulnerability of GCNs and create a more practical and stealthy backdoor attack method, this paper proposes a clean-graph backdoor attack against GCNs (CBAG) in the node classification task,which only poisons the training labels without any modification to the training samples, revealing that GCNs have this security vulnerability. Specifically, CBAG designs a new trigger exploration method to find important feature dimensions as the trigger patterns to improve the attack performance. By poisoning the training labels, a hidden backdoor is injected into the GCNs model. Experimental results show that our clean graph backdoor can achieve 99% attack success rate while maintaining the functionality of the GCNs model on benign samples.

4/22/2024

cs.AI

Efficient Backdoor Attacks for Deep Neural Networks in Real-world Scenarios

Ziqiang Li, Hong Sun, Pengfei Xia, Heng Li, Beihao Xia, Yi Wu, Bin Li

Recent deep neural networks (DNNs) have came to rely on vast amounts of training data, providing an opportunity for malicious attackers to exploit and contaminate the data to carry out backdoor attacks. However, existing backdoor attack methods make unrealistic assumptions, assuming that all training data comes from a single source and that attackers have full access to the training data. In this paper, we introduce a more realistic attack scenario where victims collect data from multiple sources, and attackers cannot access the complete training data. We refer to this scenario as data-constrained backdoor attacks. In such cases, previous attack methods suffer from severe efficiency degradation due to the entanglement between benign and poisoning features during the backdoor injection process. To tackle this problem, we introduce three CLIP-based technologies from two distinct streams: Clean Feature Suppression and Poisoning Feature Augmentation.effective solution for data-constrained backdoor attacks. The results demonstrate remarkable improvements, with some settings achieving over 100% improvement compared to existing attacks in data-constrained scenarios. Code is available at https://github.com/sunh1113/Efficient-backdoor-attacks-for-deep-neural-networks-in-real-world-scenarios

4/22/2024

cs.CR cs.CV