Graph Edits for Counterfactual Explanations: A comparative study

2401.11609

Published 4/19/2024 by Angeliki Dimitriou, Nikolaos Chaidos, Maria Lymperaiou, Giorgos Stamou

Graph Edits for Counterfactual Explanations: A comparative study

Abstract

Counterfactuals have been established as a popular explainability technique which leverages a set of minimal edits to alter the prediction of a classifier. When considering conceptual counterfactuals on images, the edits requested should correspond to salient concepts present in the input data. At the same time, conceptual distances are defined by knowledge graphs, ensuring the optimality of conceptual edits. In this work, we extend previous endeavors on graph edits as counterfactual explanations by conducting a comparative study which encompasses both supervised and unsupervised Graph Neural Network (GNN) approaches. To this end, we pose the following significant research question: should we represent input data as graphs, which is the optimal GNN approach in terms of performance and time efficiency to generate minimal and meaningful counterfactual explanations for black-box image classifiers?

Create account to get full access

Overview

This paper proposes a new approach for generating counterfactual explanations using a unified Graph Neural Network (GNN) framework.
Counterfactual explanations provide information about how an input would need to change to result in a different model prediction.
The proposed method can generate counterfactual examples by directly modifying the input graph structure through a series of graph edits.
The approach is unified in the sense that it can be applied to a wide range of graph-structured data, including social networks, knowledge graphs, and molecular graphs.

Plain English Explanation

Counterfactual explanations are a way to understand how a machine learning model makes decisions. They show how an input would need to change in order to get a different output from the model. For example, a counterfactual explanation for a loan application might show that if the applicant had a higher income, they would have been approved.

This paper introduces a new method for generating these counterfactual explanations. It uses a type of machine learning called a Graph Neural Network (GNN) to directly modify the structure of the input graph, rather than just changing the node or edge features. The key insight is that by editing the connections between nodes in the graph, the model can find ways to change the overall representation of the input in order to get a different prediction.

The advantage of this approach is that it can work with a wide variety of graph-structured data, such as social networks, knowledge graphs, and molecular structures. This makes it a flexible tool that can be applied to many different types of machine learning problems.

Technical Explanation

The paper introduces a new framework for generating counterfactual explanations using a unified GNN-based approach. The core idea is to cast the problem of finding counterfactual examples as a graph editing task, where the goal is to modify the input graph structure in a way that changes the model's prediction.

Specifically, the authors propose a GNN-based model that takes an input graph and a target prediction as input, and learns to output a sequence of graph edit operations (e.g., adding/removing edges, merging/splitting nodes) that transform the input graph into a counterfactual example. This GNN model is trained end-to-end using a combination of reconstruction and reinforcement learning losses.

The key advantages of this approach are:

Generality: The method can be applied to a wide range of graph-structured data, including social networks, knowledge graphs, and molecular graphs, as long as they can be represented as input to a GNN.
Interpretability: The sequence of graph edit operations provides an interpretable explanation of how the input needs to be modified to change the model's prediction.
Flexibility: The approach can generate both positive and negative counterfactuals, and can handle multiple target predictions.

The authors evaluate their method on several benchmark datasets and show that it outperforms existing approaches for counterfactual explanation generation, particularly on more complex graph-structured data.

Critical Analysis

The proposed approach represents an interesting and promising direction for generating counterfactual explanations for graph-structured data. By directly modifying the graph structure, the method can potentially find more meaningful and impactful counterfactual examples compared to approaches that only change node/edge features.

However, the paper does not address several important limitations and caveats:

Computational Complexity: The graph editing process can be computationally expensive, especially as the size and complexity of the input graph increases. The authors do not provide a detailed analysis of the scalability of their method.
Faithfulness to the Data: The generated counterfactual examples may not always be realistic or feasible, as the graph editing process is not constrained by the underlying data distribution. This could limit the practical usefulness of the explanations.
Robustness to Noise: The paper does not investigate how the method would perform in the presence of noisy or incomplete input graphs, which is a common challenge in real-world graph-structured data.

Additionally, it would be valuable to see the authors compare their approach to other recent work on counterfactual explanation generation, cardinality-constrained counterfactuals, and fair GNN models, to better understand the relative strengths and weaknesses of their method.

Conclusion

This paper presents a novel approach for generating counterfactual explanations using a unified GNN-based framework. By directly modifying the input graph structure through a sequence of graph edit operations, the method can find meaningful counterfactual examples across a wide range of graph-structured data.

While the proposed approach shows promising results, there are several important limitations and areas for further research, such as computational complexity, faithfulness to the data, and robustness to noise. Nonetheless, this work represents an important step towards more interpretable and actionable counterfactual explanations for graph-based machine learning models.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

👁️

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research Challenges

Mario Alfonso Prado-Romero, Bardh Prenkaj, Giovanni Stilo, Fosca Giannotti

Graph Neural Networks (GNNs) perform well in community detection and molecule classification. Counterfactual Explanations (CE) provide counter-examples to overcome the transparency limitations of black-box models. Due to the growing attention in graph learning, we focus on the concepts of CE for GNNs. We analysed the SoA to provide a taxonomy, a uniform notation, and the benchmarking datasets and evaluation metrics. We discuss fourteen methods, their evaluation protocols, twenty-two datasets, and nineteen metrics. We integrated the majority of methods into the GRETEL library to conduct an empirical evaluation to understand their strengths and pitfalls. We highlight open challenges and future work.

6/12/2024

cs.LG cs.AI

🖼️

Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

Silvan Mertes, Tobias Huber, Christina Karle, Katharina Weitz, Ruben Schlagowski, Cristina Conati, Elisabeth Andr'e

In this paper, we demonstrate the feasibility of alterfactual explanations for black box image classifiers. Traditional explanation mechanisms from the field of Counterfactual Thinking are a widely-used paradigm for Explainable Artificial Intelligence (XAI), as they follow a natural way of reasoning that humans are familiar with. However, most common approaches from this field are based on communicating information about features or characteristics that are especially important for an AI's decision. However, to fully understand a decision, not only knowledge about relevant features is needed, but the awareness of irrelevant information also highly contributes to the creation of a user's mental model of an AI system. To this end, a novel approach for explaining AI systems called alterfactual explanations was recently proposed on a conceptual level. It is based on showing an alternative reality where irrelevant features of an AI's input are altered. By doing so, the user directly sees which input data characteristics can change arbitrarily without influencing the AI's decision. In this paper, we show for the first time that it is possible to apply this idea to black box models based on neural networks. To this end, we present a GAN-based approach to generate these alterfactual explanations for binary image classifiers. Further, we present a user study that gives interesting insights on how alterfactual explanations can complement counterfactual explanations.

5/10/2024

cs.CV cs.AI cs.LG

🤿

Counterfactual Explanations for Deep Learning-Based Traffic Forecasting

Rushan Wang, Yanan Xin, Yatao Zhang, Fernando Perez-Cruz, Martin Raubal

Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifically, the goal is to elucidate relationships between various input contextual features and their corresponding predictions. We present a comprehensive framework that generates counterfactual explanations for traffic forecasting and provides usable insights through the proposed scenario-driven counterfactual explanations. The study first implements a deep learning model to predict traffic speed based on historical traffic data and contextual variables. Counterfactual explanations are then used to illuminate how alterations in these input variables affect predicted outcomes, thereby enhancing the transparency of the deep learning model. We investigated the impact of contextual features on traffic speed prediction under varying spatial and temporal conditions. The scenario-driven counterfactual explanations integrate two types of user-defined constraints, directional and weighting constraints, to tailor the search for counterfactual explanations to specific use cases. These tailored explanations benefit machine learning practitioners who aim to understand the model's learning mechanisms and domain experts who seek insights for real-world applications. The results showcase the effectiveness of counterfactual explanations in revealing traffic patterns learned by deep learning models, showing its potential for interpreting black-box deep learning models used for spatiotemporal predictions in general.

5/2/2024

cs.LG cs.AI

GLANCE: Global Actions in a Nutshell for Counterfactual Explainability

Ioannis Emiris, Dimitris Fotakis, Giorgos Giannopoulos, Dimitrios Gunopulos, Loukas Kavouras, Kleopatra Markou, Eleni Psaroudaki, Dimitrios Rontogiannis, Dimitris Sacharidis, Nikolaos Theologitis, Dimitrios Tomaras, Konstantinos Tsopelas

Counterfactual explanations have emerged as an important tool to understand, debug, and audit complex machine learning models. To offer global counterfactual explainability, state-of-the-art methods construct summaries of local explanations, offering a trade-off among conciseness, counterfactual effectiveness, and counterfactual cost or burden imposed on instances. In this work, we provide a concise formulation of the problem of identifying global counterfactuals and establish principled criteria for comparing solutions, drawing inspiration from Pareto dominance. We introduce innovative algorithms designed to address the challenge of finding global counterfactuals for either the entire input space or specific partitions, employing clustering and decision trees as key components. Additionally, we conduct a comprehensive experimental evaluation, considering various instances of the problem and comparing our proposed algorithms with state-of-the-art methods. The results highlight the consistent capability of our algorithms to generate meaningful and interpretable global counterfactual explanations.

5/30/2024

cs.LG