Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks

2402.02036

Published 5/30/2024 by Zhuomin Chen, Jiaxing Zhang, Jingchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mohan Mondal, Hua Wei, Dongsheng Luo

cs.LG

Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks

Abstract

Graph Neural Networks (GNNs) have become a building block in graph data processing, with wide applications in critical domains. The growing needs to deploy GNNs in high-stakes applications necessitate explainability for users in the decision-making processes. A popular paradigm for the explainability of GNNs is to identify explainable subgraphs by comparing their labels with the ones of original graphs. This task is challenging due to the substantial distributional shift from the original graphs in the training set to the set of explainable subgraphs, which prevents accurate prediction of labels with the subgraphs. To address it, in this paper, we propose a novel method that generates proxy graphs for explainable subgraphs that are in the distribution of training data. We introduce a parametric method that employs graph generators to produce proxy graphs. A new training objective based on information theory is designed to ensure that proxy graphs not only adhere to the distribution of training data but also preserve explanatory factors. Such generated proxy graphs can be reliably used to approximate the predictions of the labels of explainable subgraphs. Empirical evaluations across various datasets demonstrate our method achieves more accurate explanations for GNNs.

Create account to get full access

Overview

This paper presents a method for interpreting the inner workings of Graph Neural Networks (GNNs) using in-distribution proxies.
The authors propose a new technique called In-Distributed Proxy Interpretation (IDPI) that aims to improve upon existing GNN interpretation methods.
IDPI leverages in-distribution samples to construct proxy models that approximate the target GNN's behavior, enabling more accurate and interpretable explanations.

Plain English Explanation

The paper focuses on improving our understanding of how Graph Neural Networks (GNNs) make decisions. GNNs are a type of machine learning model that can analyze and make predictions based on data represented as graphs, which are collections of interconnected nodes and edges.

While GNNs have shown impressive performance on various tasks, it can be challenging to understand the reasoning behind their decisions. The authors of this paper introduce a new technique called In-Distributed Proxy Interpretation (IDPI) to address this issue.

IDPI works by creating simplified "proxy" models that closely mimic the behavior of the original GNN. These proxy models are built using samples that are representative of the data the GNN was trained on. By analyzing the proxy models, researchers and developers can gain insights into how the GNN is making its decisions, which can help improve the model's performance and trustworthiness.

The key idea behind IDPI is that the proxy models are trained on "in-distribution" samples, meaning they are drawn from the same underlying data distribution as the original GNN. This allows the proxy models to capture the GNN's decision-making process more accurately than previous interpretation methods, which may have relied on less representative samples.

By using IDPI, researchers and developers can better understand the inner workings of GNNs, leading to more transparent and explainable AI systems. This can be particularly useful in domains where it is important to understand the reasoning behind an AI's decisions, such as in healthcare, finance, or policy-making.

Technical Explanation

The paper begins by introducing the problem of interpreting Graph Neural Networks (GNNs), which are a class of machine learning models used to analyze and make predictions on graph-structured data. While GNNs have shown impressive performance, their complex and non-linear nature can make it challenging to understand the reasons behind their decisions.

The authors propose a new technique called In-Distributed Proxy Interpretation (IDPI) to address this challenge. IDPI works by constructing simplified "proxy" models that closely mimic the behavior of the target GNN. These proxy models are trained on "in-distribution" samples, meaning they are drawn from the same underlying data distribution as the original GNN.

By using in-distribution samples, the proxy models are able to capture the GNN's decision-making process more accurately than previous interpretation methods, which may have relied on less representative samples. The authors demonstrate that IDPI outperforms existing GNN interpretation techniques, such as GraphFrameX, EIG-Search, and Design Requirements for Human-Centered GNNs, in terms of fidelity and interpretability.

The paper also includes experiments on both synthetic and real-world datasets, showcasing the effectiveness of IDPI in interpreting the decision-making process of GNNs in various domains, such as Interpreting GNNs for Tabular Data and Global Concept Explanations for Graphs.

Critical Analysis

The paper presents a novel and promising approach for interpreting the inner workings of Graph Neural Networks. The authors have carefully designed the IDPI method to leverage in-distribution samples, which appears to be a key advantage over previous interpretation techniques.

One potential limitation of the paper is that it does not explore the generalizability of IDPI to a wider range of GNN architectures and tasks. The experiments focus on a specific set of GNN models and applications, and it would be valuable to see how IDPI performs across a broader spectrum of GNN use cases.

Additionally, while the authors discuss the fidelity and interpretability of IDPI, they do not provide a detailed analysis of the computational cost or scalability of the method. As GNNs are increasingly deployed in large-scale and real-time applications, the efficiency of the interpretation process becomes an important consideration.

Overall, the paper offers a significant contribution to the field of GNN interpretation and provides a solid foundation for further research in this area. The IDPI approach represents an important step towards improving the transparency and trustworthiness of Graph Neural Networks, which is crucial for their widespread adoption in high-stakes domains.

Conclusion

This paper introduces a novel technique called In-Distributed Proxy Interpretation (IDPI) for interpreting the decision-making process of Graph Neural Networks (GNNs). IDPI leverages in-distribution samples to construct simplified proxy models that closely mimic the behavior of the target GNN, enabling more accurate and interpretable explanations.

The authors demonstrate the effectiveness of IDPI through experiments on both synthetic and real-world datasets, showing that it outperforms existing GNN interpretation methods in terms of fidelity and interpretability. This work represents a significant contribution to the field of GNN interpretation and paves the way for more transparent and trustworthy AI systems in domains such as healthcare, finance, and policy-making.

While the paper has some limitations, such as the need for further exploration of the method's generalizability and efficiency, the IDPI approach offers a promising direction for advancing the interpretability of Graph Neural Networks and enhancing their impact on real-world problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🧠

L2XGNN: Learning to Explain Graph Neural Networks

Giuseppe Serra, Mathias Niepert

Graph Neural Networks (GNNs) are a popular class of machine learning models. Inspired by the learning to explain (L2X) paradigm, we propose L2XGNN, a framework for explainable GNNs which provides faithful explanations by design. L2XGNN learns a mechanism for selecting explanatory subgraphs (motifs) which are exclusively used in the GNNs message-passing operations. L2XGNN is able to select, for each input graph, a subgraph with specific properties such as being sparse and connected. Imposing such constraints on the motifs often leads to more interpretable and effective explanations. Experiments on several datasets suggest that L2XGNN achieves the same classification accuracy as baseline methods using the entire input graph while ensuring that only the provided explanations are used to make predictions. Moreover, we show that L2XGNN is able to identify motifs responsible for the graph's properties it is intended to predict.

6/17/2024

cs.LG cs.AI

Explainable Graph Neural Networks Under Fire

Zhong Li, Simon Geisler, Yuhang Wang, Stephan Gunnemann, Matthijs van Leeuwen

Predictions made by graph neural networks (GNNs) usually lack interpretability due to their complex computational behavior and the abstract nature of graphs. In an attempt to tackle this, many GNN explanation methods have emerged. Their goal is to explain a model's predictions and thereby obtain trust when GNN models are deployed in decision critical applications. Most GNN explanation methods work in a post-hoc manner and provide explanations in the form of a small subset of important edges and/or nodes. In this paper we demonstrate that these explanations can unfortunately not be trusted, as common GNN explanation methods turn out to be highly susceptible to adversarial perturbations. That is, even small perturbations of the original graph structure that preserve the model's predictions may yield drastically different explanations. This calls into question the trustworthiness and practical utility of post-hoc explanation methods for GNNs. To be able to attack GNN explanation models, we devise a novel attack method dubbed textit{GXAttack}, the first textit{optimization-based} adversarial attack method for post-hoc GNN explanations under such settings. Due to the devastating effectiveness of our attack, we call for an adversarial evaluation of future GNN explainers to demonstrate their robustness.

6/11/2024

cs.LG cs.AI

Graph Neural Network Explanations are Fragile

Jiate Li, Meng Pang, Yun Dong, Jinyuan Jia, Binghui Wang

Explainable Graph Neural Network (GNN) has emerged recently to foster the trust of using GNNs. Existing GNN explainers are developed from various perspectives to enhance the explanation performance. We take the first step to study GNN explainers under adversarial attack--We found that an adversary slightly perturbing graph structure can ensure GNN model makes correct predictions, but the GNN explainer yields a drastically different explanation on the perturbed graph. Specifically, we first formulate the attack problem under a practical threat model (i.e., the adversary has limited knowledge about the GNN explainer and a restricted perturbation budget). We then design two methods (i.e., one is loss-based and the other is deduction-based) to realize the attack. We evaluate our attacks on various GNN explainers and the results show these explainers are fragile.

6/6/2024

cs.CR cs.LG

GNNAnatomy: Systematic Generation and Evaluation of Multi-Level Explanations for Graph Neural Networks

Hsiao-Ying Lu, Yiran Li, Ujwal Pratap Krishna Kaluvakolanu Thyagarajan, Kwan-Liu Ma

Graph Neural Networks (GNNs) have proven highly effective in various machine learning (ML) tasks involving graphs, such as node/graph classification and link prediction. However, explaining the decisions made by GNNs poses challenges because of the aggregated relational information based on graph structure, leading to complex data transformations. Existing methods for explaining GNNs often face limitations in systematically exploring diverse substructures and evaluating results in the absence of ground truths. To address this gap, we introduce GNNAnatomy, a model- and dataset-agnostic visual analytics system designed to facilitate the generation and evaluation of multi-level explanations for GNNs. In GNNAnatomy, we employ graphlets to elucidate GNN behavior in graph-level classification tasks. By analyzing the associations between GNN classifications and graphlet frequencies, we formulate hypothesized factual and counterfactual explanations. To validate a hypothesized graphlet explanation, we introduce two metrics: (1) the correlation between its frequency and the classification confidence, and (2) the change in classification confidence after removing this substructure from the original graph. To demonstrate the effectiveness of GNNAnatomy, we conduct case studies on both real-world and synthetic graph datasets from various domains. Additionally, we qualitatively compare GNNAnatomy with a state-of-the-art GNN explainer, demonstrating the utility and versatility of our design.

6/10/2024

cs.LG cs.IR cs.SI