A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges

2210.12089

Published 6/12/2024 by Mario Alfonso Prado-Romero, Bardh Prenkaj, Giovanni Stilo, Fosca Giannotti

👁️

Abstract

Graph Neural Networks (GNNs) perform well in community detection and molecule classification. Counterfactual Explanations (CE) provide counter-examples to overcome the transparency limitations of black-box models. Due to the growing attention in graph learning, we focus on the concepts of CE for GNNs. We analysed the SoA to provide a taxonomy, a uniform notation, and the benchmarking datasets and evaluation metrics. We discuss fourteen methods, their evaluation protocols, twenty-two datasets, and nineteen metrics. We integrated the majority of methods into the GRETEL library to conduct an empirical evaluation to understand their strengths and pitfalls. We highlight open challenges and future work.

Create account to get full access

Overview

Graph Neural Networks (GNNs) are effective for community detection and molecule classification tasks.
Counterfactual Explanations (CE) can help overcome the transparency limitations of black-box models like GNNs.
This paper focuses on the concepts of CE for GNNs, providing a taxonomy, notation, and an analysis of evaluation datasets and metrics.
The paper discusses 14 CE methods, their evaluation protocols, 22 datasets, and 19 metrics.
The majority of the methods were integrated into the GRETEL library for empirical evaluation to understand their strengths and limitations.
The paper highlights open challenges and future work in this area.

Plain English Explanation

Graph Neural Networks (GNNs) are a type of machine learning model that can effectively analyze data represented as networks or graphs. These models have shown great performance in tasks like community detection and molecule classification.

However, GNNs, like many other advanced machine learning models, can be considered "black boxes" - their inner workings are not always transparent or easy to understand. This can be a problem, as users may want to know why a model made a particular decision.

Counterfactual Explanations (CE) are a way to address this issue. CEs provide counter-examples that show how a model's output would change if certain input features were different. This can help users understand the model's reasoning and decision-making process.

This paper focuses on the concepts of CEs for GNNs, which are becoming increasingly important as graph-based machine learning continues to grow. The researchers provide a detailed taxonomy, a standardized notation, and an analysis of the datasets and evaluation metrics used to assess CE methods for GNNs.

They also discuss 14 different CE methods, how they are evaluated, and the strengths and limitations of each approach. To help researchers and practitioners, the team integrated most of these methods into a library called GRETEL, allowing for a comprehensive empirical evaluation.

Finally, the paper highlights several open challenges and areas for future research in this field, which will be important as CE techniques continue to evolve and become more widely used with GNNs and other complex machine learning models.

Technical Explanation

The paper begins by highlighting the success of Graph Neural Networks (GNNs) in tasks like community detection and molecule classification. However, the authors note that GNNs, like many other advanced machine learning models, suffer from a lack of transparency, as their inner workings are not always easily interpretable.

To address this, the researchers focus on the concept of Counterfactual Explanations (CE), which can provide counter-examples to help users understand why a model made a particular decision. The paper presents a taxonomy, a uniform notation, and an analysis of the benchmarking datasets and evaluation metrics used in the CE for GNN literature.

The researchers then discuss 14 different CE methods for GNNs, covering their evaluation protocols, the 22 datasets used, and the 19 metrics employed. To facilitate further research, the team integrated the majority of these methods into the GRETEL library, allowing for a comprehensive empirical evaluation to understand the strengths and limitations of each approach.

The paper concludes by highlighting several open challenges and areas for future work in this field, such as improving the plausibility and robustness of CEs, as well as quantifying the uncertainty associated with the explanations provided.

Critical Analysis

The paper provides a thorough and well-structured overview of the current state of Counterfactual Explanations (CE) for Graph Neural Networks (GNNs). The authors' decision to integrate the majority of the discussed methods into the GRETEL library is particularly commendable, as it will facilitate further research and collaboration in this field.

One potential limitation of the paper is the sheer number of methods, datasets, and evaluation metrics covered, which could make it challenging for readers to fully grasp the nuances of each approach. The authors acknowledge this issue and suggest that future work could focus on a more in-depth analysis of a smaller subset of the most promising CE methods for GNNs.

Additionally, while the paper highlights several open challenges, such as improving the plausibility and robustness of CEs, it would have been valuable to discuss these issues in greater depth. For example, the authors could have delved into the specific technical hurdles that need to be overcome to address these challenges, or suggested potential research directions to tackle them.

Overall, this paper serves as a comprehensive and valuable resource for researchers and practitioners working on interpretability and explainability in the context of graph-based machine learning. The detailed taxonomy, standardized notation, and thorough empirical evaluation provide a solid foundation for future advancements in this rapidly evolving field.

Conclusion

This paper presents a detailed analysis of the current state of Counterfactual Explanations (CE) for Graph Neural Networks (GNNs). The researchers provide a taxonomy, a uniform notation, and an evaluation of the benchmarking datasets and metrics used in this field.

By discussing 14 different CE methods for GNNs, their evaluation protocols, and the strengths and limitations of each approach, the paper offers valuable insights for researchers and practitioners working on interpretability and explainability in graph-based machine learning. The integration of these methods into the GRETEL library further enhances the accessibility and usability of the research.

While the paper covers a large number of techniques, datasets, and metrics, it also highlights several open challenges, such as improving the plausibility and robustness of CEs, as well as quantifying the uncertainty associated with the explanations. Addressing these issues will be crucial for the continued advancement of CE methods and their widespread adoption in real-world applications.

Overall, this paper provides a comprehensive and well-structured resource for understanding the current state of the art in CE for GNNs, and serves as a solid foundation for future research in this rapidly evolving field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Graph Edits for Counterfactual Explanations: A comparative study

Angeliki Dimitriou, Nikolaos Chaidos, Maria Lymperaiou, Giorgos Stamou

Counterfactuals have been established as a popular explainability technique which leverages a set of minimal edits to alter the prediction of a classifier. When considering conceptual counterfactuals on images, the edits requested should correspond to salient concepts present in the input data. At the same time, conceptual distances are defined by knowledge graphs, ensuring the optimality of conceptual edits. In this work, we extend previous endeavors on graph edits as counterfactual explanations by conducting a comparative study which encompasses both supervised and unsupervised Graph Neural Network (GNN) approaches. To this end, we pose the following significant research question: should we represent input data as graphs, which is the optimal GNN approach in terms of performance and time efficiency to generate minimal and meaningful counterfactual explanations for black-box image classifiers?

4/19/2024

cs.LG cs.AI

Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels

Patryk Wielopolski, Oleksii Furman, Jerzy Stefanowski, Maciej Zik{e}ba

Growing regulatory and societal pressures demand increased transparency in AI, particularly in understanding the decisions made by complex machine learning models. Counterfactual Explanations (CFs) have emerged as a promising technique within Explainable AI (xAI), offering insights into individual model predictions. However, to understand the systemic biases and disparate impacts of AI models, it is crucial to move beyond local CFs and embrace global explanations, which offer a~holistic view across diverse scenarios and populations. Unfortunately, generating Global Counterfactual Explanations (GCEs) faces challenges in computational complexity, defining the scope of global, and ensuring the explanations are both globally representative and locally plausible. We introduce a novel unified approach for generating Local, Group-wise, and Global Counterfactual Explanations for differentiable classification models via gradient-based optimization to address these challenges. This framework aims to bridge the gap between individual and systemic insights, enabling a deeper understanding of model decisions and their potential impact on diverse populations. Our approach further innovates by incorporating a probabilistic plausibility criterion, enhancing actionability and trustworthiness. By offering a cohesive solution to the optimization and plausibility challenges in GCEs, our work significantly advances the interpretability and accountability of AI models, marking a step forward in the pursuit of transparent AI.

5/29/2024

cs.LG cs.AI

🧠

Utilizing Description Logics for Global Explanations of Heterogeneous Graph Neural Networks

Dominik Kohler, Stefan Heindorf

Graph Neural Networks (GNNs) are effective for node classification in graph-structured data, but they lack explainability, especially at the global level. Current research mainly utilizes subgraphs of the input as local explanations or generates new graphs as global explanations. However, these graph-based methods are limited in their ability to explain classes with multiple sufficient explanations. To provide more expressive explanations, we propose utilizing class expressions (CEs) from the field of description logic (DL). Our approach explains heterogeneous graphs with different types of nodes using CEs in the EL description logic. To identify the best explanation among multiple candidate explanations, we employ and compare two different scoring functions: (1) For a given CE, we construct multiple graphs, have the GNN make a prediction for each graph, and aggregate the predicted scores. (2) We score the CE in terms of fidelity, i.e., we compare the predictions of the GNN to the predictions by the CE on a separate validation set. Instead of subgraph-based explanations, we offer CE-based explanations.

5/22/2024

cs.AI cs.LO

🧠

Provably Robust and Plausible Counterfactual Explanations for Neural Networks via Robust Optimisation

Junqi Jiang, Jianglin Lan, Francesco Leofante, Antonio Rago, Francesca Toni

Counterfactual Explanations (CEs) have received increasing interest as a major methodology for explaining neural network classifiers. Usually, CEs for an input-output pair are defined as data points with minimum distance to the input that are classified with a different label than the output. To tackle the established problem that CEs are easily invalidated when model parameters are updated (e.g. retrained), studies have proposed ways to certify the robustness of CEs under model parameter changes bounded by a norm ball. However, existing methods targeting this form of robustness are not sound or complete, and they may generate implausible CEs, i.e., outliers wrt the training dataset. In fact, no existing method simultaneously optimises for closeness and plausibility while preserving robustness guarantees. In this work, we propose Provably RObust and PLAusible Counterfactual Explanations (PROPLACE), a method leveraging on robust optimisation techniques to address the aforementioned limitations in the literature. We formulate an iterative algorithm to compute provably robust CEs and prove its convergence, soundness and completeness. Through a comparative experiment involving six baselines, five of which target robustness, we show that PROPLACE achieves state-of-the-art performances against metrics on three evaluation aspects.

4/5/2024

cs.LG cs.AI