Black Box Model Explanations and the Human Interpretability Expectations -- An Analysis in the Context of Homicide Prediction

Read original: arXiv:2210.10849 - Published 7/8/2024 by Jos'e Ribeiro, N'ikolas Carneiro, Ronnie Alves

📈

Overview

Strategies based on Explainable Artificial Intelligence (XAI) can make the results of complex "black box" models more interpretable for humans.
This opens up questions about whether the explanations generated by XAI methods actually meet human expectations.
Various XAI methods (Ciu, Dalex, Eli5, Lofo, Shap, and Skater) provide different types of explanations, like rankings of feature importance.
This research examines the explanations generated by XAI methods and compares them to the expectations of human experts, using a real-world homicide prediction problem as a case study.

Plain English Explanation

Artificial intelligence (AI) models can sometimes be like "black boxes" - their inner workings are complex and difficult for humans to understand. Explainable AI (XAI) methods aim to make these models more interpretable by providing explanations for their results.

However, it's not clear whether the explanations generated by XAI methods actually match what human experts expect or want to see. This research dives into that question, using a real-world problem of predicting homicides as a case study.

The researchers took an existing AI model for homicide prediction and applied six different XAI methods to generate explanations. They then compared those explanations to the expectations of six human experts. The goal was to see how well the XAI explanations aligned with what the human experts thought was important.

The results showed that even though the underlying AI model was complex, about 75% of the human experts' expectations were met by the XAI explanations. There was also around 48% agreement between the XAI methods and the human experts on what the key factors were.

This suggests that XAI can be a useful tool for making AI models more understandable, but there's still work to be done to fully align the explanations with human intuition and needs. Continued research in this area could help improve the task performance of human-AI teams.

Technical Explanation

This research set out to investigate whether the explanations generated by various Explainable AI (XAI) methods meet the expectations of human experts. To do this, they:

Replicated an existing black box model for a real-world homicide prediction problem that had been peer-validated.
Applied six different XAI methods (Ciu, Dalex, Eli5, Lofo, Shap, and Skater) to generate explanations for the model's outputs.
Gathered input from six human experts on their expectations for how the model should be explained.
Analyzed the correlations, comparative analyses, and relationships between the feature rankings produced by the XAI methods and the human experts.

The results showed that even though the underlying model was difficult to explain, 75% of the human experts' expectations were met by the XAI explanations. There was also around 48% agreement between the XAI methods and the human experts on the relative importance of different features.

This suggests that current XAI methods can provide meaningful explanations that align reasonably well with human intuition, at least for this particular problem domain. However, the researchers note that there is still room for improvement, as the agreement was not perfect.

Further research could explore ways to make XAI explanations even more reliable, stable, and tailored to human needs. There are also potential privacy implications of XAI that would need to be considered.

Critical Analysis

The researchers make a solid effort to rigorously evaluate the alignment between XAI explanations and human expectations for a real-world problem. The use of multiple XAI methods and expert perspectives adds breadth and depth to the analysis.

However, a few limitations and potential issues are worth noting:

The study focused on a single problem domain (homicide prediction), so the findings may not generalize to other types of AI models and applications.
The sample size of human experts (6) is relatively small, and their backgrounds/expertise levels were not fully detailed. A larger and more diverse set of experts could provide more robust insights.
The researchers did not delve into the specific reasons why there was sometimes a mismatch between XAI explanations and human expectations. More qualitative analysis in this area could shed light on how to improve the alignment.
The paper does not address potential biases or blindspots in the human experts' own perspectives and intuitions. These could also be factors influencing the level of agreement.

Overall, this research represents a valuable contribution to understanding the current state of XAI and its alignment with human understanding. But there is still much work to be done to fully realize the potential of explainable AI and integrate it effectively with human decision-making.

Conclusion

This research explored whether the explanations generated by popular Explainable AI (XAI) methods align with the expectations of human experts, using a real-world problem of homicide prediction as a case study.

The results showed that even for a complex "black box" AI model, the XAI explanations were able to meet around 75% of the human experts' expectations. There was also a reasonable level of agreement (around 48%) between the XAI methods and the human experts on the relative importance of different factors.

This suggests that current XAI techniques can provide meaningful insights that resonate with human intuition, at least in this particular domain. However, there is still room for improvement to ensure XAI explanations are fully reliable, stable, and tailored to the needs of human users.

Continued research in this area could yield important advances in human-AI collaboration and the responsible development of explainable AI systems that build trust and transparency.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Black Box Model Explanations and the Human Interpretability Expectations -- An Analysis in the Context of Homicide Prediction

Jos'e Ribeiro, N'ikolas Carneiro, Ronnie Alves

Strategies based on Explainable Artificial Intelligence (XAI) have promoted better human interpretability of the results of black box models. This opens up the possibility of questioning whether explanations created by XAI methods meet human expectations. The XAI methods being currently used (Ciu, Dalex, Eli5, Lofo, Shap, and Skater) provide various forms of explanations, including global rankings of relevance of features, which allow for an overview of how the model is explained as a result of its inputs and outputs. These methods provide for an increase in the explainability of the model and a greater interpretability grounded on the context of the problem. Intending to shed light on the explanations generated by XAI methods and their interpretations, this research addresses a real-world classification problem related to homicide prediction, already peer-validated, replicated its proposed black box model and used 6 different XAI methods to generate explanations and 6 different human experts. The results were generated through calculations of correlations, comparative analysis and identification of relationships between all ranks of features produced. It was found that even though it is a model that is difficult to explain, 75% of the expectations of human experts were met, with approximately 48% agreement between results from XAI methods and human experts. The results allow for answering questions such as: Are the Expectation of Interpretation generated among different human experts similar?, Do the different XAI methods generate similar explanations for the proposed problem?, Can explanations generated by XAI methods meet human expectation of Interpretations?, and Can Explanations and Expectations of Interpretation work together?.

7/8/2024

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Vinitra Swamy, Jibril Frej, Tanja Kaser

Explainable Artificial Intelligence (XAI) plays a crucial role in enabling human understanding and trust in deep learning systems. As models get larger, more ubiquitous, and pervasive in aspects of daily life, explainability is necessary to minimize adverse effects of model mistakes. Unfortunately, current approaches in human-centric XAI (e.g. predictive tasks in healthcare, education, or personalized ads) tend to rely on a single post-hoc explainer, whereas recent work has identified systematic disagreement between post-hoc explainers when applied to the same instances of underlying black-box models. In this paper, we therefore present a call for action to address the limitations of current state-of-the-art explainers. We propose a shift from post-hoc explainability to designing interpretable neural network architectures. We identify five needs of human-centric XAI (real-time, accurate, actionable, human-interpretable, and consistent) and propose two schemes for interpretable-by-design neural network workflows (adaptive routing with InterpretCC and temporal diagnostics with I2MD). We postulate that the future of human-centric XAI is neither in explaining black-boxes nor in reverting to traditional, interpretable models, but in neural networks that are intrinsically interpretable.

5/29/2024

How Reliable and Stable are Explanations of XAI Methods?

Jos'e Ribeiro, Lucas Cardoso, Vitor Santos, Eduardo Carvalho, N'ikolas Carneiro, Ronnie Alves

Black box models are increasingly being used in the daily lives of human beings living in society. Along with this increase, there has been the emergence of Explainable Artificial Intelligence (XAI) methods aimed at generating additional explanations regarding how the model makes certain predictions. In this sense, methods such as Dalex, Eli5, eXirt, Lofo and Shap emerged as different proposals and methodologies for generating explanations of black box models in an agnostic way. Along with the emergence of these methods, questions arise such as How Reliable and Stable are XAI Methods?. With the aim of shedding light on this main question, this research creates a pipeline that performs experiments using the diabetes dataset and four different machine learning models (LGBM, MLP, DT and KNN), creating different levels of perturbations of the test data and finally generates explanations from the eXirt method regarding the confidence of the models and also feature relevances ranks from all XAI methods mentioned, in order to measure their stability in the face of perturbations. As a result, it was found that eXirt was able to identify the most reliable models among all those used. It was also found that current XAI methods are sensitive to perturbations, with the exception of one specific method.

7/4/2024

Explainable AI improves task performance in human-AI collaboration

Julian Senoner, Simon Schallmoser, Bernhard Kratzwald, Stefan Feuerriegel, Torbj{o}rn Netland

Artificial intelligence (AI) provides considerable opportunities to assist human work. However, one crucial challenge of human-AI collaboration is that many AI algorithms operate in a black-box manner where the way how the AI makes predictions remains opaque. This makes it difficult for humans to validate a prediction made by AI against their own domain knowledge. For this reason, we hypothesize that augmenting humans with explainable AI as a decision aid improves task performance in human-AI collaboration. To test this hypothesis, we analyze the effect of augmenting domain experts with explainable AI in the form of visual heatmaps. We then compare participants that were either supported by (a) black-box AI or (b) explainable AI, where the latter supports them to follow AI predictions when the AI is accurate or overrule the AI when the AI predictions are wrong. We conducted two preregistered experiments with representative, real-world visual inspection tasks from manufacturing and medicine. The first experiment was conducted with factory workers from an electronics factory, who performed $N=9,600$ assessments of whether electronic products have defects. The second experiment was conducted with radiologists, who performed $N=5,650$ assessments of chest X-ray images to identify lung lesions. The results of our experiments with domain experts performing real-world tasks show that task performance improves when participants are supported by explainable AI instead of black-box AI. For example, in the manufacturing setting, we find that augmenting participants with explainable AI (as opposed to black-box AI) leads to a five-fold decrease in the median error rate of human decisions, which gives a significant improvement in task performance.

6/13/2024