Explaining Graph Neural Networks via Structure-aware Interaction Index

Read original: arXiv:2405.14352 - Published 5/24/2024 by Ngoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying

🧠

Overview

The Shapley value is a popular tool for interpreting black-box machine learning models, but it has limitations when dealing with structured inputs like graph neural networks.
This paper introduces the Myerson-Taylor interaction index, which incorporates the graph structure into the attribution of node values and node interactions.
The authors prove that the Myerson-Taylor index is the unique index that satisfies a set of five natural axioms accounting for graph structure and high-order interactions.
The paper proposes Myerson-Taylor Structure-Aware Graph Explainer (MAGE), which uses the second-order Myerson-Taylor index to identify the most important motifs influencing the model prediction.
Experiments show that MAGE provides superior subgraph explanations compared to state-of-the-art methods.

Plain English Explanation

Machine learning models are often treated as "black boxes," meaning it can be difficult to understand how they make their predictions. The Shapley value is a popular tool for interpreting these black-box models, but it has limitations when dealing with complex, structured data like graph neural networks.

This paper introduces a new approach called the Myerson-Taylor interaction index, which is designed to better account for the graph structure of the input data. Unlike the Shapley value, the Myerson-Taylor index decomposes the input into components that satisfy a specific connectivity criterion, allowing it to capture the importance of different subgraphs or "motifs" within the larger graph.

The authors prove that the Myerson-Taylor index is the unique index that satisfies a set of five natural axioms, which ensure that it properly accounts for the graph structure and high-order interactions between nodes. This strong theoretical foundation is what makes the Myerson-Taylor index a promising tool for interpreting graph-based machine learning models.

Building on this, the researchers propose a novel explainer called MAGE (Myerson-Taylor Structure-Aware Graph Explainer), which uses the second-order Myerson-Taylor index to identify the most important motifs that are influencing the model's predictions, both positively and negatively. Through extensive experiments, they show that MAGE consistently provides better explanations than other state-of-the-art methods, especially for graph-based models.

Technical Explanation

The key innovation in this paper is the introduction of the Myerson-Taylor interaction index, which addresses limitations of the Shapley value when applied to graph-structured data.

Unlike the Shapley value, which either focuses solely on node-wise importance or neglects the graph structure when perturbing the input, the Myerson-Taylor index internalizes the graph structure into the attribution of node values and interaction values. Specifically, it decomposes coalitions (sets of nodes) into components that satisfy a pre-chosen connectivity criterion, rather than treating each node independently.

The authors prove that the Myerson-Taylor index is the unique index that satisfies a system of five natural axioms accounting for graph structure and high-order interaction among nodes. These axioms ensure properties like linearity, efficiency, and the ability to capture both node importance and higher-order interactions.

Leveraging these desirable properties, the researchers propose MAGE, a novel explainer that uses the second-order Myerson-Taylor index to identify the most important motifs (subgraphs) influencing the model's predictions. MAGE outperforms state-of-the-art methods like KernelSHAP-IQ and Error Analysis on various graph datasets and models, providing superior subgraph explanations.

Critical Analysis

The Myerson-Taylor interaction index and the MAGE explainer presented in this paper offer a promising approach to interpreting graph-based machine learning models. The strong theoretical foundation, with the index satisfying a set of natural axioms, is a compelling aspect of this work.

However, the paper does not extensively discuss the limitations or potential downsides of the Myerson-Taylor index. For example, the computational complexity of computing the index is not explored, which could be a practical concern for large-scale graphs. Additionally, the paper does not delve into the robustness of the index to noise or perturbations in the input graph, which would be an important consideration for real-world applications.

Furthermore, while the experiments demonstrate the superiority of MAGE over other state-of-the-art methods, the paper does not provide a deep analysis of the specific cases where MAGE excels or where it may fall short. A more nuanced discussion of the strengths and weaknesses of the proposed approach would help readers better assess its overall merits and limitations.

Conclusion

This paper introduces the Myerson-Taylor interaction index, a novel tool for interpreting graph-based machine learning models. By incorporating the graph structure into the attribution of node values and interactions, the Myerson-Taylor index addresses key limitations of the Shapley value and other existing explainability approaches.

The authors prove that the Myerson-Taylor index is the unique index satisfying a set of natural axioms, ensuring its strong theoretical grounding. Building on this, they propose the MAGE explainer, which leverages the second-order Myerson-Taylor index to identify the most influential subgraph motifs for a model's predictions.

Empirical results demonstrate that MAGE consistently outperforms state-of-the-art explainability methods, highlighting the potential of the Myerson-Taylor index to advance the interpretability of graph neural networks and other graph-based models. As the use of such models continues to grow, tools like MAGE could play a crucial role in enhancing the transparency and trustworthiness of these powerful machine learning techniques.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Explaining Graph Neural Networks via Structure-aware Interaction Index

Ngoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying

The Shapley value is a prominent tool for interpreting black-box machine learning models thanks to its strong theoretical foundation. However, for models with structured inputs, such as graph neural networks, existing Shapley-based explainability approaches either focus solely on node-wise importance or neglect the graph structure when perturbing the input instance. This paper introduces the Myerson-Taylor interaction index that internalizes the graph structure into attributing the node values and the interaction values among nodes. Unlike the Shapley-based methods, the Myerson-Taylor index decomposes coalitions into components satisfying a pre-chosen connectivity criterion. We prove that the Myerson-Taylor index is the unique one that satisfies a system of five natural axioms accounting for graph structure and high-order interaction among nodes. Leveraging these properties, we propose Myerson-Taylor Structure-Aware Graph Explainer (MAGE), a novel explainer that uses the second-order Myerson-Taylor index to identify the most important motifs influencing the model prediction, both positively and negatively. Extensive experiments on various graph datasets and models demonstrate that our method consistently provides superior subgraph explanations compared to state-of-the-art methods.

5/24/2024

ShapG: new feature importance method based on the Shapley value

Chi Zhao, Jing Liu, Elena Parilina

With wide application of Artificial Intelligence (AI), it has become particularly important to make decisions of AI systems explainable and transparent. In this paper, we proposed a new Explainable Artificial Intelligence (XAI) method called ShapG (Explanations based on Shapley value for Graphs) for measuring feature importance. ShapG is a model-agnostic global explanation method. At the first stage, it defines an undirected graph based on the dataset, where nodes represent features and edges are added based on calculation of correlation coefficients between features. At the second stage, it calculates an approximated Shapley value by sampling the data taking into account this graph structure. The sampling approach of ShapG allows to calculate the importance of features efficiently, i.e. to reduce computational complexity. Comparison of ShapG with other existing XAI methods shows that it provides more accurate explanations for two examined datasets. We also compared other XAI methods developed based on cooperative game theory with ShapG in running time, and the results show that ShapG exhibits obvious advantages in its running time, which further proves efficiency of ShapG. In addition, extensive experiments demonstrate a wide range of applicability of the ShapG method for explaining complex models. We find ShapG an important tool in improving explainability and transparency of AI systems and believe it can be widely used in various fields.

7/2/2024

🤔

Succinct Interaction-Aware Explanations

Sascha Xu, Joscha Cuppers, Jilles Vreeken

SHAP is a popular approach to explain black-box models by revealing the importance of individual features. As it ignores feature interactions, SHAP explanations can be confusing up to misleading. NSHAP, on the other hand, reports the additive importance for all subsets of features. While this does include all interacting sets of features, it also leads to an exponentially sized, difficult to interpret explanation. In this paper, we propose to combine the best of these two worlds, by partitioning the features into parts that significantly interact, and use these parts to compose a succinct, interpretable, additive explanation. We derive a criterion by which to measure the representativeness of such a partition for a models behavior, traded off against the complexity of the resulting explanation. To efficiently find the best partition out of super-exponentially many, we show how to prune sub-optimal solutions using a statistical test, which not only improves runtime but also helps to detect spurious interactions. Experiments on synthetic and real world data show that our explanations are both more accurate resp. more easily interpretable than those of SHAP and NSHAP.

4/22/2024

Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Gwladys Kelodjou, Laurence Roz'e, V'eronique Masson, Luis Gal'arraga, Romaric Gaudel, Maurice Tchuente, Alexandre Termier

Machine learning techniques, such as deep learning and ensemble methods, are widely used in various domains due to their ability to handle complex real-world tasks. However, their black-box nature has raised multiple concerns about the fairness, trustworthiness, and transparency of computer-assisted decision-making. This has led to the emergence of local post-hoc explainability methods, which offer explanations for individual decisions made by black-box algorithms. Among these methods, Kernel SHAP is widely used due to its model-agnostic nature and its well-founded theoretical framework. Despite these strengths, Kernel SHAP suffers from high instability: different executions of the method with the same inputs can lead to significantly different explanations, which diminishes the relevance of the explanations. The contribution of this paper is two-fold. On the one hand, we show that Kernel SHAP's instability is caused by its stochastic neighbor selection procedure, which we adapt to achieve full stability without compromising explanation fidelity. On the other hand, we show that by restricting the neighbors generation to perturbations of size 1 -- which we call the coalitions of Layer 1 -- we obtain a novel feature-attribution method that is fully stable, computationally efficient, and still meaningful.

6/18/2024