Understanding Prediction Discrepancies in Machine Learning Classifiers

Read original: arXiv:2104.05467 - Published 8/1/2024 by Xavier Renard, Thibault Laugel, Marcin Detyniecki

🤔

Overview

Multiple machine learning models can be trained on the same data to achieve similar test performance, despite learning significantly different classification patterns.
This phenomenon, called "prediction discrepancies," often leads to the arbitrary selection of one model over another with similar performance.
The machine learning practitioner may not understand the differences between the models, their limitations, and where they agree or disagree.
The chosen model's classification pattern can have concrete consequences for instances in the discrepancy zone, potentially resulting in unfairness or missed opportunities.

Plain English Explanation

The paper explains a common issue in machine learning called prediction discrepancies. When training multiple models on the same data, the models can learn very different ways of making classifications, even though they all perform similarly well on test data.

This can lead to a blind selection of one model over another, since the practitioner may not understand the differences between the models or where they disagree. The chosen model's classification pattern could then have real-world consequences for instances that fall in the discrepancy zone - for example, leading to unfair decisions or missed opportunities.

The paper proposes an algorithm called DIG to analyze and explain these prediction discrepancies. This allows practitioners to make more informed decisions when selecting a model, by anticipating potential issues with each model's classification patterns.

Technical Explanation

The paper explores the phenomenon of "prediction discrepancies," where multiple machine learning models trained on the same data can achieve similar test-time performance while learning significantly different classification patterns. The authors propose a model-agnostic algorithm called DIG to capture and explain these discrepancies locally.

The DIG algorithm works by analyzing a pool of best-performing models to identify regions where their predictions diverge. It then explains these discrepancies by highlighting the key features that lead the models to make different decisions in those regions. This enables the practitioner to understand the differences between the models, their limitations, and where they agree or disagree.

The authors evaluate DIG on several real-world datasets and show that it can effectively identify and explain prediction discrepancies. They also demonstrate how this information can help practitioners make more informed model selection decisions and anticipate potential undesired consequences of their choices.

Critical Analysis

The paper provides a valuable approach for addressing the important problem of prediction discrepancies in machine learning. By enabling a deeper understanding of model differences, the DIG algorithm can help practitioners make more informed decisions and avoid the pitfalls of arbitrary model selection.

One potential limitation of the research is that it focuses on explaining discrepancies rather than resolving them. While the explanations can guide practitioners, the paper does not propose methods for directly reconciling or combining the divergent models. Further research could explore techniques for model reconciliation or ensemble learning to leverage the strengths of multiple models.

Additionally, the paper does not address the potential systemic biases or fairness issues that may arise from the different classification patterns learned by the models. Future work could investigate how the DIG algorithm's insights could be used to mitigate bias and ensure fairer outcomes.

Conclusion

This paper tackles the important problem of prediction discrepancies in machine learning, where multiple models trained on the same data can learn significantly different classification patterns despite similar test-time performance. The proposed DIG algorithm provides a way to capture and explain these discrepancies, empowering practitioners to make more informed model selection decisions and anticipate potential issues with their choices.

By shedding light on the "black box" of model differences, this research represents an important step towards increasing the transparency and trustworthiness of machine learning systems. Further work is needed to address the limitations and expand the applications of this approach, but the paper's insights can already help practitioners navigate the complex landscape of model selection and deployment.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤔

Understanding Prediction Discrepancies in Machine Learning Classifiers

Xavier Renard, Thibault Laugel, Marcin Detyniecki

A multitude of classifiers can be trained on the same data to achieve similar performances during test time, while having learned significantly different classification patterns. This phenomenon, which we call prediction discrepancies, is often associated with the blind selection of one model instead of another with similar performances. When making a choice, the machine learning practitioner has no understanding on the differences between models, their limits, where they agree and where they don't. But his/her choice will result in concrete consequences for instances to be classified in the discrepancy zone, since the final decision will be based on the selected classification pattern. Besides the arbitrary nature of the result, a bad choice could have further negative consequences such as loss of opportunity or lack of fairness. This paper proposes to address this question by analyzing the prediction discrepancies in a pool of best-performing models trained on the same data. A model-agnostic algorithm, DIG, is proposed to capture and explain discrepancies locally, to enable the practitioner to make the best educated decision when selecting a model by anticipating its potential undesired consequences. All the code to reproduce the experiments is available.

8/1/2024

⚙️

The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective

Satyapriya Krishna, Tessa Han, Alex Gu, Steven Wu, Shahin Jabbari, Himabindu Lakkaraju

As various post hoc explanation methods are increasingly being leveraged to explain complex models in high-stakes settings, it becomes critical to develop a deeper understanding of if and when the explanations output by these methods disagree with each other, and how such disagreements are resolved in practice. However, there is little to no research that provides answers to these critical questions. In this work, we introduce and study the disagreement problem in explainable machine learning. More specifically, we formalize the notion of disagreement between explanations, analyze how often such disagreements occur in practice, and how practitioners resolve these disagreements. We first conduct interviews with data scientists to understand what constitutes disagreement between explanations generated by different methods for the same model prediction and introduce a novel quantitative framework to formalize this understanding. We then leverage this framework to carry out a rigorous empirical analysis with four real-world datasets, six state-of-the-art post hoc explanation methods, and six different predictive models, to measure the extent of disagreement between the explanations generated by various popular explanation methods. In addition, we carry out an online user study with data scientists to understand how they resolve the aforementioned disagreements. Our results indicate that (1) state-of-the-art explanation methods often disagree in terms of the explanations they output, and (2) machine learning practitioners often employ ad hoc heuristics when resolving such disagreements. These findings suggest that practitioners may be relying on misleading explanations when making consequential decisions. They also underscore the importance of developing principled frameworks for effectively evaluating and comparing explanations output by various explanation techniques.

7/9/2024

Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making

Drago Plecko, Elias Bareinboim

Investigating fairness and equity of automated systems has become a critical field of inquiry. Most of the literature in fair machine learning focuses on defining and achieving fairness criteria in the context of prediction, while not explicitly focusing on how these predictions may be used later on in the pipeline. For instance, if commonly used criteria, such as independence or sufficiency, are satisfied for a prediction score $S$ used for binary classification, they need not be satisfied after an application of a simple thresholding operation on $S$ (as commonly used in practice). In this paper, we take an important step to address this issue in numerous statistical and causal notions of fairness. We introduce the notion of a margin complement, which measures how much a prediction score $S$ changes due to a thresholding operation. We then demonstrate that the marginal difference in the optimal 0/1 predictor $widehat Y$ between groups, written $P(hat y mid x_1) - P(hat y mid x_0)$, can be causally decomposed into the influences of $X$ on the $L_2$-optimal prediction score $S$ and the influences of $X$ on the margin complement $M$, along different causal pathways (direct, indirect, spurious). We then show that under suitable causal assumptions, the influences of $X$ on the prediction score $S$ are equal to the influences of $X$ on the true outcome $Y$. This yields a new decomposition of the disparity in the predictor $widehat Y$ that allows us to disentangle causal differences inherited from the true outcome $Y$ that exists in the real world vs. those coming from the optimization procedure itself. This observation highlights the need for more regulatory oversight due to the potential for bias amplification, and to address this issue we introduce new notions of weak and strong business necessity, together with an algorithm for assessing whether these notions are satisfied.

5/27/2024

🔮

Variation in prediction accuracy due to randomness in data division and fair evaluation using interval estimation

Isao Goto

This paper attempts to answer a simple question in building predictive models using machine learning algorithms. Although diagnostic and predictive models for various diseases have been proposed using data from large cohort studies and machine learning algorithms, challenges remain in their generalizability. Several causes for this challenge have been pointed out, and partitioning of the dataset with randomness is considered to be one of them. In this study, we constructed 33,600 diabetes diagnosis models with initial state dependent randomness using autoML (automatic machine learning framework) and open diabetes data, and evaluated their prediction accuracy. The results showed that the prediction accuracy had an initial state-dependent distribution. Since this distribution could follow a normal distribution, we estimated the expected interval of prediction accuracy using statistical interval estimation in order to fairly compare the accuracy of the prediction models.

9/4/2024