On GNN explanability with activation rules

2406.11594

Published 6/18/2024 by Luca Veyrin-Forrer, Ataollah Kamal, Stefan Duffner, Marc Plantevit, C'eline Robardet

On GNN explanability with activation rules

Abstract

GNNs are powerful models based on node representation learning that perform particularly well in many machine learning problems related to graphs. The major obstacle to the deployment of GNNs is mostly a problem of societal acceptability and trustworthiness, properties which require making explicit the internal functioning of such models. Here, we propose to mine activation rules in the hidden layers to understand how the GNNs perceive the world. The problem is not to discover activation rules that are individually highly discriminating for an output of the model. Instead, the challenge is to provide a small set of rules that cover all input graphs. To this end, we introduce the subjective activation pattern domain. We define an effective and principled algorithm to enumerate activations rules in each hidden layer. The proposed approach for quantifying the interest of these rules is rooted in information theory and is able to account for background knowledge on the input graph data. The activation rules can then be redescribed thanks to pattern languages involving interpretable features. We show that the activation rules provide insights on the characteristics used by the GNN to classify the graphs. Especially, this allows to identify the hidden features built by the GNN through its different layers. Also, these rules can subsequently be used for explaining GNN decisions. Experiments on both synthetic and real-life datasets show highly competitive performance, with up to 200% improvement in fidelity on explaining graph classification over the SOTA methods.

Create account to get full access

Overview

This paper explores new methods for interpreting and explaining the inner workings of graph neural networks (GNNs) - a powerful type of machine learning model used for analyzing graph-structured data.
The key idea is to analyze the "activation patterns" of individual neurons within the GNN to gain insights into how the model is making decisions.
The authors propose an "introspection-based" approach that involves mining frequent activation patterns and using them to generate explanations for the GNN's behavior.

Plain English Explanation

The paper addresses the challenge of understanding how graph neural networks (GNNs) work under the hood. GNNs are a type of machine learning model that can analyze data organized in the form of graphs, with nodes and connections between them. They have become very powerful at tasks like predicting relationships in social networks or classifying molecules.

However, GNNs are often treated as "black boxes" - it's not always clear how they arrived at a particular prediction or classification. The authors of this paper wanted to open up this black box and shed light on the inner workings of GNNs. Their key insight was to look at the "activation patterns" of individual neurons in the neural network.

Neurons in a neural network fire (or activate) based on the input they receive. By analyzing which neurons are firing and how they're connected, the researchers believed they could gain valuable insights into how the GNN is making decisions. They developed a method to automatically identify common activation patterns and then use those patterns to explain the GNN's behavior.

For example, the researchers might find that a certain set of neurons tend to activate together when the GNN is classifying a particular type of molecule. This activation pattern could reveal important features the GNN is focusing on to make its prediction. The goal is to provide more transparency and interpretability for these powerful but often opaque machine learning models.

Technical Explanation

The paper proposes an "introspection-based" approach to explaining the behavior of graph neural networks (GNNs). The core idea is to analyze the "activation patterns" of individual neurons within the GNN model to gain insights into how it is making predictions or classifications.

The authors develop a two-stage process:

Activation pattern mining: They first mine the GNN to identify frequently occurring activation patterns - specific sets of neurons that tend to fire together. This is done using frequent subgraph mining techniques adapted for the neural network setting.
Explanation generation: The researchers then use these mined activation patterns as the basis for generating explanations of the GNN's outputs. For example, they can identify which activation patterns are most strongly associated with a particular class prediction.

The authors evaluate their approach on a range of GNN benchmarks, showing that the mined activation patterns can provide meaningful and interpretable explanations for the model's decisions. This includes being able to highlight the most important input features and graph structures that the GNN is focusing on.

The work builds on prior research into understanding the expressivity of GNNs, generating multi-level GNN explanations, and the role of activation functions in GNNs. It also relates to other efforts to learn to explain GNNs and leverage symbolic methods for interpreting hidden neurons.

Critical Analysis

The proposed approach provides a novel and potentially valuable way to open up the "black box" of graph neural networks and gain deeper insights into their inner workings. By focusing on the activation patterns of individual neurons, the method offers a principled way to highlight the key features and graph structures that are driving the GNN's predictions.

However, the paper also acknowledges some limitations and areas for further research. For example, the activation pattern mining process can be computationally expensive, especially for larger and more complex GNN models. The researchers suggest exploring more efficient mining algorithms or potentially focusing the analysis on a subset of "important" neurons.

Additionally, while the mined activation patterns can provide useful explanations, it's not clear how generalizable or robust these explanations are. The paper does not deeply explore whether the insights gleaned from the activation patterns would hold up across different datasets or model architectures.

There is also the broader question of how much we can really trust these types of "introspection-based" explanations of neural networks. Even if we can identify the specific neurons and activation patterns behind a prediction, it's still challenging to know whether these patterns truly capture the model's full decision-making process or if there are deeper, more complex relationships at play.

Overall, the work represents an important step forward in GNN interpretability, but more research is likely needed to fully understand the strengths, limitations, and broader implications of this approach. Readers are encouraged to think critically about the tradeoffs and consider how this type of technique could be combined with other explanation methods to paint a more complete picture of GNN behavior.

Conclusion

This paper introduces a novel technique for explaining the inner workings of graph neural networks (GNNs) by analyzing the activation patterns of individual neurons in the model. The key idea is to mine the GNN for frequently occurring activation patterns and then use those patterns to generate interpretable explanations for the model's predictions or classifications.

The authors demonstrate that this "introspection-based" approach can provide meaningful insights, highlighting the important input features and graph structures that are driving the GNN's decision-making. While the technique has some limitations, it represents an important step forward in efforts to open up the "black box" of these powerful machine learning models and make their behavior more transparent and understandable.

As GNNs continue to see widespread adoption across a range of domains, techniques like the one proposed in this paper will become increasingly important for building trust, accountability, and transparency in these systems. The findings presented here lay the groundwork for further research into GNN interpretability and its real-world applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🤔

Understanding Expressivity of GNN in Rule Learning

Haiquan Qiu, Yongqi Zhang, Yong Li, Quanming Yao

Rule learning is critical to improving knowledge graph (KG) reasoning due to their ability to provide logical and interpretable explanations. Recently, Graph Neural Networks (GNNs) with tail entity scoring achieve the state-of-the-art performance on KG reasoning. However, the theoretical understandings for these GNNs are either lacking or focusing on single-relational graphs, leaving what the kind of rules these GNNs can learn an open problem. We propose to fill the above gap in this paper. Specifically, GNNs with tail entity scoring are unified into a common framework. Then, we analyze their expressivity by formally describing the rule structures they can learn and theoretically demonstrating their superiority. These results further inspire us to propose a novel labeling strategy to learn more rules in KG reasoning. Experimental results are consistent with our theoretical findings and verify the effectiveness of our proposed method. The code is publicly available at https://github.com/LARS-research/Rule-learning-expressivity.

4/11/2024

cs.LG cs.AI

🧠

On the power of graph neural networks and the role of the activation function

Sammy Khalife, Amitabh Basu

In this article we present new results about the expressivity of Graph Neural Networks (GNNs). We prove that for any GNN with piecewise polynomial activations, whose architecture size does not grow with the graph input sizes, there exists a pair of non-isomorphic rooted trees of depth two such that the GNN cannot distinguish their root vertex up to an arbitrary number of iterations. The proof relies on tools from the algebra of symmetric polynomials. In contrast, it was already known that unbounded GNNs (those whose size is allowed to change with the graph sizes) with piecewise polynomial activations can distinguish these vertices in only two iterations. It was also known prior to our work that with ReLU (piecewise linear) activations, bounded GNNs are weaker than unbounded GNNs [Aamand & Al., 2022]. Our approach adds to this result by extending it to handle any piecewise polynomial activation function, which goes towards answering an open question formulated by Grohe [Grohe,2021] more completely. Our second result states that if one allows activations that are not piecewise polynomial, then in two iterations a single neuron perceptron can distinguish the root vertices of any pair of nonisomorphic trees of depth two (our results hold for activations like the sigmoid, hyperbolic tan and others). This shows how the power of graph neural networks can change drastically if one changes the activation function of the neural networks. The proof of this result utilizes the Lindemann-Weierstrauss theorem from transcendental number theory.

5/8/2024

cs.LG

GNNAnatomy: Systematic Generation and Evaluation of Multi-Level Explanations for Graph Neural Networks

Hsiao-Ying Lu, Yiran Li, Ujwal Pratap Krishna Kaluvakolanu Thyagarajan, Kwan-Liu Ma

Graph Neural Networks (GNNs) have proven highly effective in various machine learning (ML) tasks involving graphs, such as node/graph classification and link prediction. However, explaining the decisions made by GNNs poses challenges because of the aggregated relational information based on graph structure, leading to complex data transformations. Existing methods for explaining GNNs often face limitations in systematically exploring diverse substructures and evaluating results in the absence of ground truths. To address this gap, we introduce GNNAnatomy, a model- and dataset-agnostic visual analytics system designed to facilitate the generation and evaluation of multi-level explanations for GNNs. In GNNAnatomy, we employ graphlets to elucidate GNN behavior in graph-level classification tasks. By analyzing the associations between GNN classifications and graphlet frequencies, we formulate hypothesized factual and counterfactual explanations. To validate a hypothesized graphlet explanation, we introduce two metrics: (1) the correlation between its frequency and the classification confidence, and (2) the change in classification confidence after removing this substructure from the original graph. To demonstrate the effectiveness of GNNAnatomy, we conduct case studies on both real-world and synthetic graph datasets from various domains. Additionally, we qualitatively compare GNNAnatomy with a state-of-the-art GNN explainer, demonstrating the utility and versatility of our design.

6/10/2024

cs.LG cs.IR cs.SI

🧠

L2XGNN: Learning to Explain Graph Neural Networks

Giuseppe Serra, Mathias Niepert

Graph Neural Networks (GNNs) are a popular class of machine learning models. Inspired by the learning to explain (L2X) paradigm, we propose L2XGNN, a framework for explainable GNNs which provides faithful explanations by design. L2XGNN learns a mechanism for selecting explanatory subgraphs (motifs) which are exclusively used in the GNNs message-passing operations. L2XGNN is able to select, for each input graph, a subgraph with specific properties such as being sparse and connected. Imposing such constraints on the motifs often leads to more interpretable and effective explanations. Experiments on several datasets suggest that L2XGNN achieves the same classification accuracy as baseline methods using the entire input graph while ensuring that only the provided explanations are used to make predictions. Moreover, we show that L2XGNN is able to identify motifs responsible for the graph's properties it is intended to predict.

6/17/2024

cs.LG cs.AI