Robust Ante-hoc Graph Explainer using Bilevel Optimization

Read original: arXiv:2305.15745 - Published 6/6/2024 by Kha-Dinh Luong, Mert Kosan, Arlei Lopes Da Silva, Ambuj Singh

🛠️

Overview

Explaining the decisions made by machine learning models, especially for high-stakes applications, is crucial for increasing transparency and improving these decisions
This is particularly important for models based on graph data, where decisions often depend on complex patterns combining structural and attribute information
While recent research has focused on "post-hoc" explainers, the broader question of what makes a good explanation remains open
One intuitive property is that explanations should be informative enough to reproduce the model's predictions given the data
Post-hoc explainers often fail to achieve this, as their explanations depend heavily on fixed model parameters

Plain English Explanation

Machine learning models are increasingly being used to make important decisions, such as those related to healthcare, finance, or criminal justice. However, these "black box" models can be difficult to understand, making it hard to trust their decisions and know how to improve them.

This is especially true for models that work with graph data, which can represent complex relationships, like the connections between atoms in a molecule. The decisions these models make often depend on intricate patterns in the graph data, making them even harder to explain.

Researchers have tried to address this by developing "explainers" - tools that can break down the model's decision-making process. But these post-hoc explainers have a major limitation: they can only explain the model after it has already made a decision, and they rely heavily on the specific parameters the model was trained with.

To overcome this, the researchers propose a new type of explainer called RAGE (Robust Ante-hoc Graph Explainer). RAGE is designed to work with graph neural networks, a type of model well-suited for graph data. RAGE can identify the key substructures within a graph that contain all the information needed for the model to make a prediction. This allows users to understand the model's reasoning and suggest improvements.

The researchers tested RAGE on various tasks related to classifying molecules, and found that it outperformed existing post-hoc and ante-hoc explainers. This suggests RAGE could be a valuable tool for increasing transparency and trust in high-stakes machine learning applications involving graph data.

Technical Explanation

The researchers propose RAGE (Robust Ante-hoc Graph Explainer), a novel explainer designed to work with graph neural networks (GNNs). Unlike post-hoc explainers that attempt to explain a model after it has already made a decision, RAGE is an ante-hoc explainer that discovers explanations during the model-building process.

The key idea behind RAGE is to formulate the task of finding explanations as a bilevel optimization problem. The upper-level objective aims to discover explanatory substructures that contain all the information needed to reproduce the model's predictions, while the lower-level objective trains the GNN model to make accurate predictions using only these substructures.

By jointly optimizing these objectives, RAGE can identify the most relevant molecular substructures that drive the GNN's predictions. The researchers focused on the chemical domain, where the ability to explain GNN decisions is particularly important.

Experiments on various molecular classification tasks show that RAGE outperforms existing post-hoc and ante-hoc explainers, such as GNNExplainer and GLocalX. RAGE's explanations are more informative, as they can be used to effectively reproduce the model's predictions.

Critical Analysis

The researchers acknowledge that RAGE, like any explainer, has limitations. The bilevel optimization approach can be computationally intensive, and the discovered explanations may not always be intuitive or align with domain-specific knowledge. Additionally, the researchers note that RAGE's explanations are sensitive to the specific GNN architecture and training process, which could limit their generalizability.

Furthermore, the paper does not address potential issues with the quality or reliability of the data used to train the GNN models. If the training data contains biases or errors, these could be reflected in the explanations generated by RAGE, potentially leading to misleading or harmful conclusions.

It would also be valuable to see the researchers expand their evaluation beyond just molecular classification tasks, as the ability to explain GNN decisions is critical across a wide range of high-stakes applications, such as healthcare, finance, and criminal justice.

Overall, RAGE represents an important step forward in the quest for more transparent and trustworthy graph-based machine learning models. However, as with any technical solution, there are still open challenges and potential pitfalls that require further research and careful consideration.

Conclusion

The paper proposes RAGE, a novel ante-hoc explainer for graph neural networks that can identify the most relevant substructures driving a model's predictions. By formulating explanation discovery as a bilevel optimization problem, RAGE can generate informative explanations that can effectively reproduce the model's outputs.

This is a significant advancement over existing post-hoc explainers, which often fail to provide explanations that are sufficiently informative. RAGE's ability to explain GNN decisions is particularly valuable in high-stakes domains, such as chemistry, where transparency and trust are crucial.

While RAGE has limitations and further research is needed, this work represents an important contribution to the ongoing effort to make complex machine learning models more interpretable and accountable. As AI systems become increasingly integral to important decision-making processes, tools like RAGE will be essential for ensuring these systems are fair, reliable, and aligned with human values.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🛠️

Robust Ante-hoc Graph Explainer using Bilevel Optimization

Kha-Dinh Luong, Mert Kosan, Arlei Lopes Da Silva, Ambuj Singh

Explaining the decisions made by machine learning models for high-stakes applications is critical for increasing transparency and guiding improvements to these decisions. This is particularly true in the case of models for graphs, where decisions often depend on complex patterns combining rich structural and attribute data. While recent work has focused on designing so-called post-hoc explainers, the broader question of what constitutes a good explanation remains open. One intuitive property is that explanations should be sufficiently informative to reproduce the predictions given the data. In other words, a good explainer can be repurposed as a predictor. Post-hoc explainers do not achieve this goal as their explanations are highly dependent on fixed model parameters (e.g., learned GNN weights). To address this challenge, we propose RAGE (Robust Ante-hoc Graph Explainer), a novel and flexible ante-hoc explainer designed to discover explanations for graph neural networks using bilevel optimization, with a focus on the chemical domain. RAGE can effectively identify molecular substructures that contain the full information needed for prediction while enabling users to rank these explanations in terms of relevance. Our experiments on various molecular classification tasks show that RAGE explanations are better than existing post-hoc and ante-hoc approaches.

6/6/2024

Explainable Graph Neural Networks Under Fire

Zhong Li, Simon Geisler, Yuhang Wang, Stephan Gunnemann, Matthijs van Leeuwen

Predictions made by graph neural networks (GNNs) usually lack interpretability due to their complex computational behavior and the abstract nature of graphs. In an attempt to tackle this, many GNN explanation methods have emerged. Their goal is to explain a model's predictions and thereby obtain trust when GNN models are deployed in decision critical applications. Most GNN explanation methods work in a post-hoc manner and provide explanations in the form of a small subset of important edges and/or nodes. In this paper we demonstrate that these explanations can unfortunately not be trusted, as common GNN explanation methods turn out to be highly susceptible to adversarial perturbations. That is, even small perturbations of the original graph structure that preserve the model's predictions may yield drastically different explanations. This calls into question the trustworthiness and practical utility of post-hoc explanation methods for GNNs. To be able to attack GNN explanation models, we devise a novel attack method dubbed textit{GXAttack}, the first textit{optimization-based} adversarial attack method for post-hoc GNN explanations under such settings. Due to the devastating effectiveness of our attack, we call for an adversarial evaluation of future GNN explainers to demonstrate their robustness.

6/11/2024

📈

From Model Explanation to Data Misinterpretation: Uncovering the Pitfalls of Post Hoc Explainers in Business Research

Ronilo Ragodos (Jeffrey), Tong Wang (Jeffrey), Lu Feng (Jeffrey), Yu (Jeffrey), Hu

Machine learning models have been increasingly used in business research. However, most state-of-the-art machine learning models, such as deep neural networks and XGBoost, are black boxes in nature. Therefore, post hoc explainers that provide explanations for machine learning models by, for example, estimating numerical importance of the input features, have been gaining wide usage. Despite the intended use of post hoc explainers being explaining machine learning models, we found a growing trend in business research where post hoc explanations are used to draw inferences about the data. In this work, we investigate the validity of such use. Specifically, we investigate with extensive experiments whether the explanations obtained by the two most popular post hoc explainers, SHAP and LIME, provide correct information about the true marginal effects of X on Y in the data, which we call data-alignment. We then identify what factors influence the alignment of explanations. Finally, we propose a set of mitigation strategies to improve the data-alignment of explanations and demonstrate their effectiveness with real-world data in an econometric context. In spite of this effort, we nevertheless conclude that it is often not appropriate to infer data insights from post hoc explanations. We articulate appropriate alternative uses, the most important of which is to facilitate the proposition and subsequent empirical investigation of hypotheses. The ultimate goal of this paper is to caution business researchers against translating post hoc explanations of machine learning models into potentially false insights and understanding of data.

9/2/2024

GNNAnatomy: Systematic Generation and Evaluation of Multi-Level Explanations for Graph Neural Networks

Hsiao-Ying Lu, Yiran Li, Ujwal Pratap Krishna Kaluvakolanu Thyagarajan, Kwan-Liu Ma

Graph Neural Networks (GNNs) have proven highly effective in various machine learning (ML) tasks involving graphs, such as node/graph classification and link prediction. However, explaining the decisions made by GNNs poses challenges because of the aggregated relational information based on graph structure, leading to complex data transformations. Existing methods for explaining GNNs often face limitations in systematically exploring diverse substructures and evaluating results in the absence of ground truths. To address this gap, we introduce GNNAnatomy, a model- and dataset-agnostic visual analytics system designed to facilitate the generation and evaluation of multi-level explanations for GNNs. In GNNAnatomy, we employ graphlets to elucidate GNN behavior in graph-level classification tasks. By analyzing the associations between GNN classifications and graphlet frequencies, we formulate hypothesized factual and counterfactual explanations. To validate a hypothesized graphlet explanation, we introduce two metrics: (1) the correlation between its frequency and the classification confidence, and (2) the change in classification confidence after removing this substructure from the original graph. To demonstrate the effectiveness of GNNAnatomy, we conduct case studies on both real-world and synthetic graph datasets from various domains. Additionally, we qualitatively compare GNNAnatomy with a state-of-the-art GNN explainer, demonstrating the utility and versatility of our design.

6/10/2024