The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Read original: arXiv:2401.12731 - Published 8/14/2024 by Santiago Cifuentes, Leopoldo Bertossi, Nina Pardal, Sergio Abriola, Maria Vanina Martinez, Miguel Romero

The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Overview

The paper explores the distributional uncertainty of the SHAP score, a popular method for explaining the predictions of machine learning models.
It examines how the SHAP score can be affected by the choice of background data used in its computation.
The authors propose a new method, called Distributional SHAP, to address this issue and provide more robust feature importance estimates.

Plain English Explanation

The SHAP score is a way of understanding how much each feature of a machine learning model contributes to its predictions. It does this by calculating the change in the model's output when a feature is removed or changed.

However, the authors of this paper found that the SHAP score can be quite uncertain, meaning that it can give different results depending on the data used to compute it. This is because the SHAP score relies on using a "background" dataset to represent the distribution of possible feature values.

To address this issue, the authors propose a new method called Distributional SHAP. This method takes into account the full distribution of possible feature values, rather than just a single background dataset. This makes the SHAP score more robust and less dependent on the choice of background data.

The authors demonstrate the effectiveness of Distributional SHAP through experiments on several machine learning models and datasets. They show that it can provide more reliable and consistent feature importance estimates compared to the standard SHAP score.

Technical Explanation

The paper begins by discussing the related work on interpreting machine learning models, including the SHAP (Shapley Additive Explanations) method. SHAP is a powerful technique for computing feature importances, but the authors note that it can be sensitive to the choice of "background" data used in its computation.

To address this issue, the authors propose a new method called Distributional SHAP. This approach takes into account the full distribution of possible feature values, rather than just a single background dataset. By considering the uncertainty in the feature distributions, Distributional SHAP can provide more robust and reliable feature importance estimates.

The authors evaluate their method on several machine learning models and datasets, including regression and classification tasks. They compare the performance of Distributional SHAP to the standard SHAP score, as well as other feature importance techniques. The results show that Distributional SHAP can outperform the standard SHAP score, especially in cases where the feature distributions are complex or the background data is not representative of the true distribution.

Critical Analysis

The paper makes a valuable contribution by highlighting an important limitation of the SHAP score – its sensitivity to the choice of background data. The authors' proposed Distributional SHAP method is a promising solution that addresses this issue, and the experimental results demonstrate its effectiveness.

However, the paper does not discuss the computational complexity of Distributional SHAP, which could be a concern for large-scale or high-dimensional problems. Additionally, the authors only evaluate their method on a limited set of datasets and models, so further validation on a wider range of applications would be beneficial.

It would also be interesting to see the authors explore the potential trade-offs between the increased robustness of Distributional SHAP and its interpretability, as the inclusion of the full feature distributions may make the explanations less intuitive or easy to understand for human users.

Conclusion

This paper presents an important advancement in the field of explainable machine learning by addressing a key limitation of the SHAP score. The authors' Distributional SHAP method provides a more robust and reliable way of computing feature importances, which can be crucial for understanding and debugging complex machine learning models.

The proposed approach has the potential to significantly improve the trustworthiness and practical utility of feature attribution techniques, making them more suitable for high-stakes applications where model interpretability is a critical requirement. Further research and development in this direction could lead to even more powerful and versatile explainability tools for the machine learning community.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Santiago Cifuentes, Leopoldo Bertossi, Nina Pardal, Sergio Abriola, Maria Vanina Martinez, Miguel Romero

Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an instantiation of the general Shapley value used in coalition game theory. The definition of this score relies on a probability distribution on the entity population. Since the exact distribution is generally unknown, it needs to be assigned subjectively or be estimated from data, which may lead to misleading feature scores. In this paper, we propose a principled framework for reasoning on SHAP scores under unknown entity population distributions. In our framework, we consider an uncertainty region that contains the potential distributions, and the SHAP score of a feature becomes a function defined over this region. We study the basic problems of finding maxima and minima of this function, which allows us to determine tight ranges for the SHAP scores of all features. In particular, we pinpoint the complexity of these problems, and other related ones, showing them to be NP-complete. Finally, we present experiments on a real-world dataset, showing that our framework may contribute to a more robust feature scoring.

8/14/2024

🏋️

On Correcting SHAP Scores

Olivier Letoffe, Xuanxiang Huang, Joao Marques-Silva

Recent work uncovered examples of classifiers for which SHAP scores yield misleading feature attributions. While such examples might be perceived as suggesting the inadequacy of Shapley values for explainability, this paper shows that the source of the identified shortcomings of SHAP scores resides elsewhere. Concretely, the paper makes the case that the failings of SHAP scores result from the characteristic functions used in earlier works. Furthermore, the paper identifies a number of properties that characteristic functions ought to respect, and proposes several novel characteristic functions, each exhibiting one or more of the desired properties. More importantly, some of the characteristic functions proposed in this paper are guaranteed not to exhibit any of the shortcomings uncovered by earlier work. The paper also investigates the impact of the new characteristic functions on the complexity of computing SHAP scores. Finally, the paper proposes modifications to the tool SHAP to use instead one of our novel characteristic functions, thereby eliminating some of the limitations reported for SHAP scores.

5/2/2024

📉

On the tractability of SHAP explanations under Markovian distributions

Reda Marzouk, Colin de La Higuera

Thanks to its solid theoretical foundation, the SHAP framework is arguably one the most widely utilized frameworks for local explainability of ML models. Despite its popularity, its exact computation is known to be very challenging, proven to be NP-Hard in various configurations. Recent works have unveiled positive complexity results regarding the computation of the SHAP score for specific model families, encompassing decision trees, random forests, and some classes of boolean circuits. Yet, all these positive results hinge on the assumption of feature independence, often simplistic in real-world scenarios. In this article, we investigate the computational complexity of the SHAP score by relaxing this assumption and introducing a Markovian perspective. We show that, under the Markovian assumption, computing the SHAP score for the class of Weighted automata, Disjoint DNFs and Decision Trees can be performed in polynomial time, offering a first positive complexity result for the problem of SHAP score computation that transcends the limitations of the feature independence assumption.

5/28/2024

✨

From SHAP Scores to Feature Importance Scores

Olivier Letoffe, Xuanxiang Huang, Nicholas Asher, Joao Marques-Silva

A central goal of eXplainable Artificial Intelligence (XAI) is to assign relative importance to the features of a Machine Learning (ML) model given some prediction. The importance of this task of explainability by feature attribution is illustrated by the ubiquitous recent use of tools such as SHAP and LIME. Unfortunately, the exact computation of feature attributions, using the game-theoretical foundation underlying SHAP and LIME, can yield manifestly unsatisfactory results, that tantamount to reporting misleading relative feature importance. Recent work targeted rigorous feature attribution, by studying axiomatic aggregations of features based on logic-based definitions of explanations by feature selection. This paper shows that there is an essential relationship between feature attribution and a priori voting power, and that those recently proposed axiomatic aggregations represent a few instantiations of the range of power indices studied in the past. Furthermore, it remains unclear how some of the most widely used power indices might be exploited as feature importance scores (FISs), i.e. the use of power indices in XAI, and which of these indices would be the best suited for the purposes of XAI by feature attribution, namely in terms of not producing results that could be deemed as unsatisfactory. This paper proposes novel desirable properties that FISs should exhibit. In addition, the paper also proposes novel FISs exhibiting the proposed properties. Finally, the paper conducts a rigorous analysis of the best-known power indices in terms of the proposed properties.

5/21/2024