On the tractability of SHAP explanations under Markovian distributions

Read original: arXiv:2405.02936 - Published 5/28/2024 by Reda Marzouk, Colin de La Higuera

📉

Overview

The paper investigates the computational complexity of the Shapley Additive Explanations (SHAP) framework, a widely used method for explaining the predictions of machine learning models.
SHAP is known to be computationally challenging, with its exact computation being NP-Hard in various configurations.
The paper relaxes the assumption of feature independence, which is often simplistic in real-world scenarios, and introduces a Markovian perspective to the problem.

Plain English Explanation

The SHAP framework is a popular way to explain how machine learning models make their predictions. It assigns a "SHAP score" to each feature, showing how much that feature contributed to the final prediction. Despite its widespread use, calculating these SHAP scores can be very difficult, and has been proven to be a computationally complex problem.

Previous research has found that for certain types of machine learning models, like decision trees and random forests, the SHAP scores can be computed efficiently, as long as the features in the model are independent of each other. However, in real-world scenarios, this assumption of feature independence is often an oversimplification.

In this paper, the researchers investigate what happens when we relax this assumption of feature independence. They introduce a "Markovian" perspective, which allows for some level of dependence between the features. Surprisingly, they find that under this Markovian assumption, the SHAP scores can be computed efficiently for certain classes of machine learning models, like weighted automata, disjoint DNFs, and decision trees. This is the first positive complexity result for SHAP score computation that goes beyond the limitations of the feature independence assumption.

Technical Explanation

The paper focuses on the computational complexity of calculating the SHAP score, a widely used framework for explaining the predictions of machine learning models. The SHAP score quantifies the contribution of each feature to the final prediction of a model.

Previous work has shown that the exact computation of the SHAP score is NP-Hard in various configurations. However, recent research has unveiled positive complexity results for specific model families, such as decision trees, random forests, and some classes of boolean circuits, under the assumption of feature independence.

The key contribution of this paper is to relax the feature independence assumption, which is often overly simplistic in real-world scenarios, and introduce a Markovian perspective to the problem. The authors show that under the Markovian assumption, the SHAP score can be computed in polynomial time for the classes of weighted automata, disjoint DNFs, and decision trees. This is the first positive complexity result for SHAP score computation that transcends the limitations of the feature independence assumption.

Critical Analysis

The paper provides an important step forward in understanding the computational complexity of the SHAP framework, a widely used technique for explaining the predictions of machine learning models. By relaxing the assumption of feature independence, which is often an oversimplification in real-world scenarios, the authors have shown that efficient computation of SHAP scores is possible for certain classes of models under a Markovian assumption.

However, it is important to note that the Markovian assumption may still be restrictive in some domains, and there may be cases where even this assumption does not hold. Additionally, the paper focuses on specific model classes, and the complexity results may not generalize to more complex or diverse model families.

Further research is needed to understand the computational complexity of SHAP score computation in more general settings, potentially exploring alternative explanatory frameworks or approximation techniques that can handle a wider range of machine learning models and real-world data structures. Efforts to correct SHAP scores and develop alternative explanation methods could also be valuable in this context.

Conclusion

This paper makes an important contribution to the field of interpretable machine learning by investigating the computational complexity of the SHAP framework under more realistic assumptions. By relaxing the feature independence assumption and introducing a Markovian perspective, the authors have shown that efficient SHAP score computation is possible for certain model classes, offering a significant advancement beyond previous complexity results.

This work has implications for the practical application of SHAP and other explanation methods, as it suggests that it may be possible to explain the predictions of a broader range of machine learning models in a computationally tractable manner. As the demand for interpretable AI systems continues to grow, research like this will be crucial in developing scalable and effective explanation techniques for complex real-world problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📉

On the tractability of SHAP explanations under Markovian distributions

Reda Marzouk, Colin de La Higuera

Thanks to its solid theoretical foundation, the SHAP framework is arguably one the most widely utilized frameworks for local explainability of ML models. Despite its popularity, its exact computation is known to be very challenging, proven to be NP-Hard in various configurations. Recent works have unveiled positive complexity results regarding the computation of the SHAP score for specific model families, encompassing decision trees, random forests, and some classes of boolean circuits. Yet, all these positive results hinge on the assumption of feature independence, often simplistic in real-world scenarios. In this article, we investigate the computational complexity of the SHAP score by relaxing this assumption and introducing a Markovian perspective. We show that, under the Markovian assumption, computing the SHAP score for the class of Weighted automata, Disjoint DNFs and Decision Trees can be performed in polynomial time, offering a first positive complexity result for the problem of SHAP score computation that transcends the limitations of the feature independence assumption.

5/28/2024

The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Santiago Cifuentes, Leopoldo Bertossi, Nina Pardal, Sergio Abriola, Maria Vanina Martinez, Miguel Romero

Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an instantiation of the general Shapley value used in coalition game theory. The definition of this score relies on a probability distribution on the entity population. Since the exact distribution is generally unknown, it needs to be assigned subjectively or be estimated from data, which may lead to misleading feature scores. In this paper, we propose a principled framework for reasoning on SHAP scores under unknown entity population distributions. In our framework, we consider an uncertainty region that contains the potential distributions, and the SHAP score of a feature becomes a function defined over this region. We study the basic problems of finding maxima and minima of this function, which allows us to determine tight ranges for the SHAP scores of all features. In particular, we pinpoint the complexity of these problems, and other related ones, showing them to be NP-complete. Finally, we present experiments on a real-world dataset, showing that our framework may contribute to a more robust feature scoring.

8/14/2024

🤔

Succinct Interaction-Aware Explanations

Sascha Xu, Joscha Cuppers, Jilles Vreeken

SHAP is a popular approach to explain black-box models by revealing the importance of individual features. As it ignores feature interactions, SHAP explanations can be confusing up to misleading. NSHAP, on the other hand, reports the additive importance for all subsets of features. While this does include all interacting sets of features, it also leads to an exponentially sized, difficult to interpret explanation. In this paper, we propose to combine the best of these two worlds, by partitioning the features into parts that significantly interact, and use these parts to compose a succinct, interpretable, additive explanation. We derive a criterion by which to measure the representativeness of such a partition for a models behavior, traded off against the complexity of the resulting explanation. To efficiently find the best partition out of super-exponentially many, we show how to prune sub-optimal solutions using a statistical test, which not only improves runtime but also helps to detect spurious interactions. Experiments on synthetic and real world data show that our explanations are both more accurate resp. more easily interpretable than those of SHAP and NSHAP.

4/22/2024

Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Gwladys Kelodjou, Laurence Roz'e, V'eronique Masson, Luis Gal'arraga, Romaric Gaudel, Maurice Tchuente, Alexandre Termier

Machine learning techniques, such as deep learning and ensemble methods, are widely used in various domains due to their ability to handle complex real-world tasks. However, their black-box nature has raised multiple concerns about the fairness, trustworthiness, and transparency of computer-assisted decision-making. This has led to the emergence of local post-hoc explainability methods, which offer explanations for individual decisions made by black-box algorithms. Among these methods, Kernel SHAP is widely used due to its model-agnostic nature and its well-founded theoretical framework. Despite these strengths, Kernel SHAP suffers from high instability: different executions of the method with the same inputs can lead to significantly different explanations, which diminishes the relevance of the explanations. The contribution of this paper is two-fold. On the one hand, we show that Kernel SHAP's instability is caused by its stochastic neighbor selection procedure, which we adapt to achieve full stability without compromising explanation fidelity. On the other hand, we show that by restricting the neighbors generation to perturbations of size 1 -- which we call the coalitions of Layer 1 -- we obtain a novel feature-attribution method that is fully stable, computationally efficient, and still meaningful.

6/18/2024