Positive Label Is All You Need for Multi-Label Classification

2306.16016

Published 4/17/2024 by Zhixiang Yuan, Kaixin Zhang, Tao Huang

Positive Label Is All You Need for Multi-Label Classification

Abstract

Multi-label classification (MLC) faces challenges from label noise in training data due to annotating diverse semantic labels for each image. Current methods mainly target identifying and correcting label mistakes using trained MLC models, but still struggle with persistent noisy labels during training, resulting in imprecise recognition and reduced performance. Our paper addresses label noise in MLC by introducing a positive and unlabeled multi-label classification (PU-MLC) method. To counteract noisy labels, we directly discard negative labels, focusing on the abundance of negative labels and the origin of most noisy labels. PU-MLC employs positive-unlabeled learning, training the model with only positive labels and unlabeled data. The method incorporates adaptive re-balance factors and temperature coefficients in the loss function to address label distribution imbalance and prevent over-smoothing of probabilities during training. Additionally, we introduce a local-global convolution module to capture both local and global dependencies in the image without requiring backbone retraining. PU-MLC proves effective on MLC and MLC with partial labels (MLC-PL) tasks, demonstrating significant improvements on MS-COCO and PASCAL VOC datasets with fewer annotations. Code is available at: https://github.com/TAKELAMAG/PU-MLC.

Create account to get full access

Overview

This paper proposes a novel multi-label classification method that only requires positive labels, without the need for negative labels.
The method, called "Positive Label Is All You Need" (PLAN), leverages contrastive learning to effectively learn from positive-only data.
PLAN outperforms state-of-the-art multi-label classification methods on several benchmark datasets, demonstrating its effectiveness.

Plain English Explanation

In multi-label classification, the goal is to assign multiple labels to each input, such as identifying multiple objects in an image. Traditionally, this has required having both positive labels (the objects that are present) and negative labels (the objects that are not present). However, the authors of this paper propose a new method that can perform well even when only positive labels are available.

The key idea is to use a technique called "contrastive learning." Instead of trying to directly predict the labels, the model learns to identify which labels are present by comparing the features of the input to the features of other inputs that have different labels. This allows the model to learn useful representations without needing negative labels.

The authors show that their "Positive Label Is All You Need" (PLAN) method outperforms other state-of-the-art multi-label classification techniques on several benchmark datasets. This is an important advance, as in many real-world scenarios, it may be easier or more practical to only obtain positive labels, rather than both positive and negative labels.

Technical Explanation

The core of the PLAN method is a contrastive learning framework that learns representations from positive-only data. The model takes an input and its associated positive labels, and compares it to other inputs with different positive labels. By maximizing the similarity between the input and the positive labels, and minimizing the similarity between the input and the negative labels (i.e., all the other labels), the model learns to identify which labels are present in the input.

The authors also propose several techniques to further improve the performance of PLAN, including multi-level label correction and counterfactual reasoning. These techniques help the model better handle long-tailed label distributions and explore contrastive learning for multi-label classification.

The authors evaluate PLAN on several multi-label classification datasets, including MS-COCO, NUS-WIDE, and LVIS. They show that PLAN outperforms state-of-the-art methods, including those that require both positive and negative labels, demonstrating the effectiveness of their positive-only approach.

Critical Analysis

The authors provide a thorough analysis of the PLAN method and its performance on various benchmark datasets. They acknowledge that the method may be more suitable for certain types of multi-label classification tasks, where obtaining negative labels is challenging or infeasible.

One potential limitation of the PLAN method is that it may not be as effective in scenarios with highly imbalanced label distributions, where some labels are much more common than others. The authors mention that their proposed techniques, such as multi-level label correction, help mitigate this issue, but further research may be needed to fully address this challenge.

Additionally, the authors do not provide a comprehensive analysis of the computational complexity or training time requirements of the PLAN method compared to other multi-label classification approaches. This information would be valuable for understanding the practical applicability of the method, especially in real-world scenarios with large-scale datasets.

Overall, the PLAN method represents an interesting and promising approach to multi-label classification that can be particularly useful in situations where obtaining negative labels is difficult or infeasible. The authors have demonstrated its effectiveness through rigorous experiments, and the critical analysis suggests potential areas for further improvement and investigation.

Conclusion

The "Positive Label Is All You Need" (PLAN) method proposed in this paper offers a novel solution to the multi-label classification problem by leveraging contrastive learning to learn from positive-only data. This is a significant advancement, as obtaining negative labels can be challenging in many real-world applications.

The authors have shown that PLAN outperforms state-of-the-art multi-label classification techniques across several benchmark datasets, highlighting the effectiveness of their approach. While the method may have some limitations, particularly in handling highly imbalanced label distributions, the critical analysis suggests that PLAN represents an important step forward in the field of multi-label classification.

As the demand for accurate and efficient multi-label classification continues to grow, the PLAN method and its underlying principles could have far-reaching implications for a wide range of applications, from image recognition to text categorization and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

⛏️

Positive Unlabeled Contrastive Learning

Anish Acharya, Sujay Sanghavi, Li Jing, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive or negative). We first propose a simple extension of standard infoNCE family of contrastive losses, to the PU setting; and show that this learns superior representations, as compared to existing unsupervised and supervised approaches. We then develop a simple methodology to pseudo-label the unlabeled samples using a new PU-specific clustering scheme; these pseudo-labels can then be used to train the final (positive vs. negative) classifier. Our method handily outperforms state-of-the-art PU methods over several standard PU benchmark datasets, while not requiring a-priori knowledge of any class prior (which is a common assumption in other PU methods). We also provide a simple theoretical analysis that motivates our methods.

4/1/2024

cs.LG cs.AI

Meta-learning for Positive-unlabeled Classification

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

We propose a meta-learning method for positive and unlabeled (PU) classification, which improves the performance of binary classifiers obtained from only PU data in unseen target tasks. PU learning is an important problem since PU data naturally arise in real-world applications such as outlier detection and information retrieval. Existing PU learning methods require many PU data, but sufficient data are often unavailable in practice. The proposed method minimizes the test classification risk after the model is adapted to PU data by using related tasks that consist of positive, negative, and unlabeled data. We formulate the adaptation as an estimation problem of the Bayes optimal classifier, which is an optimal classifier to minimize the classification risk. The proposed method embeds each instance into a task-specific space using neural networks. With the embedded PU data, the Bayes optimal classifier is estimated through density-ratio estimation of PU densities, whose solution is obtained as a closed-form solution. The closed-form solution enables us to efficiently and effectively minimize the test classification risk. We empirically show that the proposed method outperforms existing methods with one synthetic and three real-world datasets.

6/7/2024

cs.LG stat.ML

🏷️

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

Multi-label learning (MLL) requires comprehensive multi-semantic annotations that is hard to fully obtain, thus often resulting in missing labels scenarios. In this paper, we investigate Single Positive Multi-label Learning (SPML), where each image is associated with merely one positive label. Existing SPML methods only focus on designing losses using mechanisms such as hard pseudo-labeling and robust losses, mostly leading to unacceptable false negatives. To address this issue, we first propose a generalized loss framework based on expected risk minimization to provide soft pseudo labels, and point out that the former losses can be seamlessly converted into our framework. In particular, we design a novel robust loss based on our framework, which enjoys flexible coordination between false positives and false negatives, and can additionally deal with the imbalance between positive and negative samples. Extensive experiments show that our approach can significantly improve SPML performance and outperform the vast majority of state-of-the-art methods on all the four benchmarks.

5/7/2024

cs.LG cs.AI cs.CV

ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual Classification

Ahmad Sajedi, Samir Khaki, Yuri A. Lawryshyn, Konstantinos N. Plataniotis

Multi-label image classification presents a challenging task in many domains, including computer vision and medical imaging. Recent advancements have introduced graph-based and transformer-based methods to improve performance and capture label dependencies. However, these methods often include complex modules that entail heavy computation and lack interpretability. In this paper, we propose Probabilistic Multi-label Contrastive Learning (ProbMCL), a novel framework to address these challenges in multi-label image classification tasks. Our simple yet effective approach employs supervised contrastive learning, in which samples that share enough labels with an anchor image based on a decision threshold are introduced as a positive set. This structure captures label dependencies by pulling positive pair embeddings together and pushing away negative samples that fall below the threshold. We enhance representation learning by incorporating a mixture density network into contrastive learning and generating Gaussian mixture distributions to explore the epistemic uncertainty of the feature encoder. We validate the effectiveness of our framework through experimentation with datasets from the computer vision and medical imaging domains. Our method outperforms the existing state-of-the-art methods while achieving a low computational footprint on both datasets. Visualization analyses also demonstrate that ProbMCL-learned classifiers maintain a meaningful semantic topology.

4/15/2024

cs.CV cs.LG