PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans

2308.13651

Published 4/23/2024 by Giang Nguyen, Valerie Chen, Mohammad Reza Taesiri, Anh Nguyen

🖼️

Abstract

Nearest neighbors (NN) are traditionally used to compute final decisions, e.g., in Support Vector Machines or k-NN classifiers, and to provide users with explanations for the model's decision. In this paper, we show a novel utility of nearest neighbors: To improve predictions of a frozen, pretrained classifier C. We leverage an image comparator S that (1) compares the input image with NN images from the top-K most probable classes; and (2) uses S's output scores to weight the confidence scores of C. Our method consistently improves fine-grained image classification accuracy on CUB-200, Cars-196, and Dogs-120. Also, a human study finds that showing lay users our probable-class nearest neighbors (PCNN) improves their decision accuracy over prior work which only shows only the top-1 class examples.

Create account to get full access

Overview

This paper explores a novel use of nearest neighbors (NN) to improve the predictions of a pre-trained image classifier.
The proposed method compares the input image to NN images from the top-K most probable classes, and uses the comparison scores to adjust the confidence scores of the classifier.
The method consistently improves fine-grained image classification accuracy on several datasets, and a human study finds it helps lay users make more accurate decisions compared to prior work.

Plain English Explanation

In machine learning, nearest neighbors (NN) are often used to make final decisions or provide explanations for a model's predictions. This paper introduces a new way to use NN to actually improve the predictions of an existing, pre-trained image classifier.

The key idea is to take the input image and compare it to the nearest neighbor images from the top-K classes that the classifier thinks are most likely. The results of this comparison are then used to adjust the confidence scores that the classifier gives to each possible class. This helps the classifier make more accurate predictions, especially for fine-grained classification tasks like telling different breeds of dogs apart.

The authors show that this approach consistently improves accuracy on several challenging image datasets. They also found that showing users the NN images from the top predicted classes helps lay people make more accurate decisions, compared to just showing the single top prediction like in prior work.

Technical Explanation

The paper proposes a novel technique to leverage nearest neighbors (NN) to improve the predictions of a pre-trained image classifier C. The method works as follows:

For a given input image, the classifier C outputs a set of confidence scores, one for each possible class.
The method then compares the input image to the NN images from the top-K classes that C deemed most probable.
The output of this image comparator S is used to re-weight the confidence scores from C. Classes whose NN images are more similar to the input get a boost in confidence, while classes with less similar NNs get their scores reduced.

The authors evaluate this "probable-class nearest neighbors" (PCNN) approach on fine-grained image classification tasks like distinguishing between different car models or dog breeds. They find consistent improvements in accuracy compared to the base classifier C.

Additionally, the authors conducted a human study where lay users were shown either the top-1 prediction from C, or the top-K predictions plus the PCNN NN images. The latter approach led to significantly higher decision accuracy, demonstrating the value of this NN-based explanation.

Critical Analysis

The paper presents a compelling and relatively simple way to leverage nearest neighbors to enhance the predictions of a pre-trained classifier. This is an interesting departure from the traditional use of NN for final decision-making or explanation.

That said, the paper does not explore the limits or failure modes of this approach. For example, it's unclear how the method would perform on more coarse-grained classification tasks, or how sensitive it is to the hyperparameters like the choice of K. The authors also don't compare to more sophisticated NN-based explanation methods or adversarial training approaches.

Overall, this is a promising line of research, but more work is needed to fully understand the strengths, weaknesses, and appropriate applications of the PCNN technique.

Conclusion

This paper introduces a novel method to leverage nearest neighbors to improve the predictions of a pre-trained image classifier. By comparing the input to NN images from the top predicted classes, the approach can boost the classifier's confidence in the correct predictions.

The authors demonstrate consistent accuracy gains on fine-grained classification tasks, and show that surfacing these NN explanations also helps human users make more accurate decisions. While more research is needed, this work points to an intriguing new use of NN in machine learning systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

👀

CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models

Teodor Chiaburu, Frank Hau{ss}er, Felix Bie{ss}mann

Mounting evidence in explainability for artificial intelligence (XAI) research suggests that good explanations should be tailored to individual tasks and should relate to concepts relevant to the task. However, building task specific explanations is time consuming and requires domain expertise which can be difficult to integrate into generic XAI methods. A promising approach towards designing useful task specific explanations with domain experts is based on compositionality of semantic concepts. Here, we present a novel approach that enables domain experts to quickly create concept-based explanations for computer vision tasks intuitively via natural language. Leveraging recent progress in deep generative methods we propose to generate visual concept-based prototypes via text-to-image methods. These prototypes are then used to explain predictions of computer vision models via a simple k-Nearest-Neighbors routine. The modular design of CoProNN is simple to implement, it is straightforward to adapt to novel tasks and allows for replacing the classification and text-to-image models as more powerful models are released. The approach can be evaluated offline against the ground-truth of predefined prototypes that can be easily communicated also to domain experts as they are based on visual concepts. We show that our strategy competes very well with other concept-based XAI approaches on coarse grained image classification tasks and may even outperform those methods on more demanding fine grained tasks. We demonstrate the effectiveness of our method for human-machine collaboration settings in qualitative and quantitative user studies. All code and experimental data can be found in our GitHub $href{https://github.com/TeodorChiaburu/beexplainable}{repository}$.

4/24/2024

cs.CV cs.AI

🏋️

On adversarial training and the 1 Nearest Neighbor classifier

Amir Hagai, Yair Weiss

The ability to fool deep learning classifiers with tiny perturbations of the input has lead to the development of adversarial training in which the loss with respect to adversarial examples is minimized in addition to the training examples. While adversarial training improves the robustness of the learned classifiers, the procedure is computationally expensive, sensitive to hyperparameters and may still leave the classifier vulnerable to other types of small perturbations. In this paper we analyze the adversarial robustness of the 1 Nearest Neighbor (1NN) classifier and compare its performance to adversarial training. We prove that under reasonable assumptions, the 1 NN classifier will be robust to {em any} small image perturbation of the training images and will give high adversarial accuracy on test images as the number of training examples goes to infinity. In experiments with 45 different binary image classification problems taken from CIFAR10, we find that 1NN outperform TRADES (a powerful adversarial training algorithm) in terms of average adversarial accuracy. In additional experiments with 69 pretrained robust models for CIFAR10, we find that 1NN outperforms almost all of them in terms of robustness to perturbations that are only slightly different from those seen during training. Taken together, our results suggest that modern adversarial training methods still fall short of the robustness of the simple 1NN classifier. our code can be found at https://github.com/amirhagai/On-Adversarial-Training-And-The-1-Nearest-Neighbor-Classifier

4/12/2024

cs.LG

🏷️

Local Neighborhood Features for 3D Classification

Shivanand Venkanna Sheshappanavar, Chandra Kambhamettu

With advances in deep learning model training strategies, the training of Point cloud classification methods is significantly improving. For example, PointNeXt, which adopts prominent training techniques and InvResNet layers into PointNet++, achieves over 7% improvement on the real-world ScanObjectNN dataset. However, most of these models use point coordinates features of neighborhood points mapped to higher dimensional space while ignoring the neighborhood point features computed before feeding to the network layers. In this paper, we revisit the PointNeXt model to study the usage and benefit of such neighborhood point features. We train and evaluate PointNeXt on ModelNet40 (synthetic), ScanObjectNN (real-world), and a recent large-scale, real-world grocery dataset, i.e., 3DGrocery100. In addition, we provide an additional inference strategy of weight averaging the top two checkpoints of PointNeXt to improve classification accuracy. Together with the abovementioned ideas, we gain 0.5%, 1%, 4.8%, 3.4%, and 1.6% overall accuracy on the PointNeXt model with real-world datasets, ScanObjectNN (hardest variant), 3DGrocery100's Apple10, Fruits, Vegetables, and Packages subsets, respectively. We also achieve a comparable 0.2% accuracy gain on ModelNet40.

4/11/2024

cs.CV cs.MM

Information Modified K-Nearest Neighbor

Mohammad Ali Vahedifar, Azim Akhtarshenas, Maryam Sabbaghian, Mohammad Mohammadi Rafatpanah, Ramin Toosi

The fundamental concept underlying K-Nearest Neighbors (KNN) is the classification of samples based on the majority through their nearest neighbors. Although distance and neighbors' labels are critical in KNN, traditional KNN treats all samples equally. However, some KNN variants weigh neighbors differently based on a specific rule, considering each neighbor's distance and label. Many KNN methodologies introduce complex algorithms that do not significantly outperform the traditional KNN, often leading to less satisfactory outcomes. The gap in reliably extracting information for accurately predicting true weights remains an open research challenge. In our proposed method, information-modified KNN (IMKNN), we bridge the gap by presenting a straightforward algorithm that achieves effective results. To this end, we introduce a classification method to improve the performance of the KNN algorithm. By exploiting mutual information (MI) and incorporating ideas from Shapley's values, we improve the traditional KNN performance in accuracy, precision, and recall, offering a more refined and effective solution. To evaluate the effectiveness of our method, it is compared with eight variants of KNN. We conduct experiments on 12 widely-used datasets, achieving 11.05%, 12.42%, and 12.07% in accuracy, precision, and recall performance, respectively, compared to traditional KNN. Additionally, we compared IMKNN with traditional KNN across four large-scale datasets to highlight the distinct advantages of IMKNN in the impact of monotonicity, noise, density, subclusters, and skewed distributions. Our research indicates that IMKNN consistently surpasses other methods in diverse datasets.

5/15/2024

cs.LG cs.IT