CoGS: Causality Constrained Counterfactual Explanations using goal-directed ASP

Read original: arXiv:2407.08179 - Published 7/12/2024 by Sopam Dasgupta, Joaqu'in Arias, Elmer Salazar, Gopal Gupta

CoGS: Causality Constrained Counterfactual Explanations using goal-directed ASP

Overview

• This paper presents CoGS, a new approach for generating causality-constrained counterfactual explanations for black-box machine learning models.

• CoGS uses goal-directed Answer Set Programming (ASP) to generate counterfactual explanations that satisfy causal constraints, making them more plausible and actionable.

• The research is supported by US NSF Grants IIS 1910131, US DoD, and industry grants.

Plain English Explanation

Counterfactual explanations are a way to understand how machine learning models make predictions. They show what changes would need to be made to the input for the model to predict a different outcome. For example, a counterfactual explanation for a loan denial might show that if the applicant's income was $5,000 higher, they would have been approved.

However, existing counterfactual explanation methods can produce explanations that are not realistic or actionable, because they don't take into account the causal relationships between the input features. The CoGS method addresses this by using a technique called goal-directed Answer Set Programming (ASP) to generate counterfactual explanations that satisfy causal constraints.

This means the explanations generated by CoGS are more plausible and actionable, because they respect the underlying causal structure of the problem. For example, the counterfactual explanation for the loan denial might show that if the applicant's education level was higher, their income would also be higher, and they would then be approved for the loan.

By incorporating causal reasoning, CoGS can produce counterfactual explanations that are more useful for users to understand and act upon the model's decisions.

Technical Explanation

The CoGS method builds on previous work on plausible counterfactual explanations and counterfactual explanations for black-box models.

CoGS uses goal-directed Answer Set Programming (ASP) to generate counterfactual explanations that satisfy causal constraints. ASP is a declarative programming paradigm that allows the researchers to specify the causal relationships between input features as logical rules.

The CoGS framework first learns a causal representation of the problem domain. It then uses this causal model along with the target machine learning model to generate counterfactual explanations that respect the causal constraints.

The key insight is that by incorporating causal reasoning, CoGS can produce counterfactual explanations that are more plausible and actionable, because they reflect the underlying causal structure of the problem. The researchers demonstrate the effectiveness of CoGS through experiments on several real-world datasets.

Critical Analysis

The authors acknowledge several limitations of the CoGS approach. First, the method relies on having accurate causal models, which may be difficult to obtain in practice. Second, the computational complexity of the goal-directed ASP reasoning can be high, especially for large or complex models.

Additionally, the paper does not explore the potential biases or fairness issues that may arise when using causal reasoning to generate counterfactual explanations. There may be cases where the causal model itself reflects societal biases, which could then be amplified in the counterfactual explanations.

Further research is needed to address these limitations and explore the broader implications of incorporating causal reasoning into counterfactual explanations. Nonetheless, the CoGS approach represents an important step forward in making counterfactual explanations more plausible and actionable for users.

Conclusion

The CoGS method presented in this paper offers a novel approach to generating counterfactual explanations for black-box machine learning models. By incorporating causal reasoning using goal-directed ASP, CoGS can produce counterfactual explanations that are more plausible and actionable than previous methods.

This work has important implications for improving the interpretability and transparency of complex machine learning models, which is crucial for building trust and accountability. The causal-aware counterfactual explanations generated by CoGS can help users better understand and act upon the decisions made by these models.

While the method has some limitations, the overall contribution of this research is significant and represents an important advancement in the field of explainable AI.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CoGS: Causality Constrained Counterfactual Explanations using goal-directed ASP

Sopam Dasgupta, Joaqu'in Arias, Elmer Salazar, Gopal Gupta

Machine learning models are increasingly used in areas such as loan approvals and hiring, yet they often function as black boxes, obscuring their decision-making processes. Transparency is crucial, and individuals need explanations to understand decisions, especially for the ones not desired by the user. Ethical and legal considerations require informing individuals of changes in input attribute values (features) that could lead to a desired outcome for the user. Our work aims to generate counterfactual explanations by considering causal dependencies between features. We present the CoGS (Counterfactual Generation with s(CASP)) framework that utilizes the goal-directed Answer Set Programming system s(CASP) to generate counterfactuals from rule-based machine learning models, specifically the FOLD-SE algorithm. CoGS computes realistic and causally consistent changes to attribute values taking causal dependencies between them into account. It finds a path from an undesired outcome to a desired one using counterfactuals. We present details of the CoGS framework along with its evaluation.

7/12/2024

CFGs: Causality Constrained Counterfactual Explanations using goal-directed ASP

Sopam Dasgupta, Joaqu'in Arias, Elmer Salazar, Gopal Gupta

Machine learning models that automate decision-making are increasingly used in consequential areas such as loan approvals, pretrial bail approval, and hiring. Unfortunately, most of these models are black boxes, i.e., they are unable to reveal how they reach these prediction decisions. A need for transparency demands justification for such predictions. An affected individual might also desire explanations to understand why a decision was made. Ethical and legal considerations require informing the individual of changes in the input attribute (s) that could be made to produce a desirable outcome. Our work focuses on the latter problem of generating counterfactual explanations by considering the causal dependencies between features. In this paper, we present the framework CFGs, CounterFactual Generation with s(CASP), which utilizes the goal-directed Answer Set Programming (ASP) system s(CASP) to automatically generate counterfactual explanations from models generated by rule-based machine learning algorithms in particular. We benchmark CFGs with the FOLD-SE model. Reaching the counterfactual state from the initial state is planned and achieved using a series of interventions. To validate our proposal, we show how counterfactual explanations are computed and justified by imagining worlds where some or all factual assumptions are altered/changed. More importantly, we show how CFGs navigates between these worlds, namely, go from our initial state where we obtain an undesired outcome to the imagined goal state where we obtain the desired decision, taking into account the causal relationships among features.

5/28/2024

A multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers

Ignacy Stk{e}pka, Mateusz Lango, Jerzy Stefanowski

Counterfactuals are widely used to explain ML model predictions by providing alternative scenarios for obtaining the more desired predictions. They can be generated by a variety of methods that optimize different, sometimes conflicting, quality measures and produce quite different solutions. However, choosing the most appropriate explanation method and one of the generated counterfactuals is not an easy task. Instead of forcing the user to test many different explanation methods and analysing conflicting solutions, in this paper, we propose to use a multi-stage ensemble approach that will select single counterfactual based on the multiple-criteria analysis. It offers a compromise solution that scores well on several popular quality measures. This approach exploits the dominance relation and the ideal point decision aid method, which selects one counterfactual from the Pareto front. The conducted experiments demonstrated that the proposed approach generates fully actionable counterfactuals with attractive compromise values of the considered quality measures.

8/6/2024

A Framework for Feasible Counterfactual Exploration incorporating Causality, Sparsity and Density

Kleopatra Markou, Dimitrios Tomaras, Vana Kalogeraki, Dimitrios Gunopulos

The imminent need to interpret the output of a Machine Learning model with counterfactual (CF) explanations - via small perturbations to the input - has been notable in the research community. Although the variety of CF examples is important, the aspect of them being feasible at the same time, does not necessarily apply in their entirety. This work uses different benchmark datasets to examine through the preservation of the logical causal relations of their attributes, whether CF examples can be generated after a small amount of changes to the original input, be feasible and actually useful to the end-user in a real-world case. To achieve this, we used a black box model as a classifier, to distinguish the desired from the input class and a Variational Autoencoder (VAE) to generate feasible CF examples. As an extension, we also extracted two-dimensional manifolds (one for each dataset) that located the majority of the feasible examples, a representation that adequately distinguished them from infeasible ones. For our experimentation we used three commonly used datasets and we managed to generate feasible and at the same time sparse, CF examples that satisfy all possible predefined causal constraints, by confirming their importance with the attributes in a dataset.

4/23/2024