Can Class-Priors Help Single-Positive Multi-Label Learning?

Read original: arXiv:2309.13886 - Published 5/28/2024 by Biao Liu, Ning Xu, Jie Wang, Xin Geng

🔮

Overview

The paper introduces a novel framework called "Class-pRiors Induced Single-Positive multi-label learning" (CRISP) to address the limitations of existing single-positive multi-label learning (SPMLL) methods.
SPMLL is a type of weakly supervised multi-label learning where each training example is annotated with only one positive label, and the other labels are assumed to be negative.
Existing SPMLL methods often make the unrealistic assumption that the prior probabilities of all classes are identical, which can lead to poor performance on real-world datasets where class priors may differ significantly.
CRISP introduces a class-priors estimator that can accurately estimate the true class priors, and an unbiased risk estimator for classification that can approximate the optimal risk minimizer on fully supervised data.

Plain English Explanation

In machine learning, there are situations where we have a dataset where each example is labeled with only one positive label, and the rest of the labels are assumed to be negative. This is called single-positive multi-label learning (SPMLL). Existing SPMLL methods often assume that the prior probabilities (the likelihood of each label occurring) are the same for all classes. However, in the real world, the prior probabilities of different labels can vary significantly.

The proposed framework, CRISP, addresses this issue by introducing a way to estimate the true prior probabilities of each class. This allows the model to make more accurate predictions, even in cases where the class priors are uneven. The paper also derives an unbiased risk estimator for classification, which means the model can better approximate the optimal performance that would be achieved with fully labeled data.

The researchers tested CRISP on several benchmark datasets and found that it outperformed existing SPMLL methods. This suggests that accounting for realistic class priors can significantly improve the performance of multi-label learning models, especially in real-world scenarios where the distribution of labels is not uniform.

Technical Explanation

The key elements of the CRISP framework are:

Class-Priors Estimator: CRISP introduces a class-priors estimator that can accurately estimate the true class priors, even in the absence of complete label information. This is achieved by leveraging the single positive label available for each training example.
Unbiased Risk Estimator: Based on the estimated class priors, CRISP derives an unbiased risk estimator for classification. This estimator can be used to train a predictive model that approximately converges to the optimal risk minimizer on fully supervised data.
Model Training: The predictive model in CRISP is trained by minimizing the unbiased risk estimator, which allows the model to learn effectively even with the limited label information available in SPMLL scenarios.

The key insight behind CRISP is that by accurately estimating the class priors, the model can make better predictions, even when it only has access to a single positive label per example. This is a significant improvement over existing SPMLL methods that make unrealistic assumptions about the class priors.

The paper presents a thorough experimental evaluation of CRISP on 10 multi-label learning benchmark datasets, demonstrating its effectiveness and superiority over state-of-the-art SPMLL approaches.

Critical Analysis

The CRISP framework addresses an important limitation of existing SPMLL methods, but there are a few potential areas for further research:

Scalability: The paper does not discuss the computational complexity of the CRISP framework, which could be a concern for large-scale real-world applications. Positive-label is all you need and ProbMCL have explored more scalable approaches to SPMLL, and combining their ideas with CRISP could be an interesting direction.
Robustness: The paper does not address the potential impact of noisy or inaccurate single-positive labels on the performance of CRISP. Positive Unlabeled Contrastive Learning has explored ways to make SPMLL more robust to label noise, and these techniques could be integrated with CRISP.
Generalization: The experiments in the paper are limited to standard multi-label learning benchmarks. It would be valuable to evaluate CRISP on real-world applications with more diverse and complex label distributions to assess its generalization capabilities.

Overall, the CRISP framework represents a significant advancement in SPMLL by addressing the critical issue of class priors. The technical insights and experimental results suggest that CRISP is a promising approach that could have a meaningful impact on real-world multi-label learning tasks.

Conclusion

The CRISP framework introduced in this paper addresses a key limitation of existing single-positive multi-label learning (SPMLL) methods by accurately estimating the class priors and deriving an unbiased risk estimator for classification. This allows CRISP to outperform state-of-the-art SPMLL approaches on a range of benchmark datasets.

The ability to handle realistic class priors is a crucial aspect of making multi-label learning systems more practical and effective in real-world applications. The insights and techniques presented in this paper represent an important step forward in this direction and could have significant implications for the development of more robust and reliable multi-label learning systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

Can Class-Priors Help Single-Positive Multi-Label Learning?

Biao Liu, Ning Xu, Jie Wang, Xin Geng

Single-positive multi-label learning (SPMLL) is a typical weakly supervised multi-label learning problem, where each training example is annotated with only one positive label. Existing SPMLL methods typically assign pseudo-labels to unannotated labels with the assumption that prior probabilities of all classes are identical. However, the class-prior of each category may differ significantly in real-world scenarios, which makes the predictive model not perform as well as expected due to the unrealistic assumption on real-world application. To alleviate this issue, a novel framework named {proposed}, i.e., Class-pRiors Induced Single-Positive multi-label learning, is proposed. Specifically, a class-priors estimator is introduced, which could estimate the class-priors that are theoretically guaranteed to converge to the ground-truth class-priors. In addition, based on the estimated class-priors, an unbiased risk estimator for classification is derived, and the corresponding risk minimizer could be guaranteed to approximately converge to the optimal risk minimizer on fully supervised data. Experimental results on ten MLL benchmark datasets demonstrate the effectiveness and superiority of our method over existing SPMLL approaches.

5/28/2024

🏷️

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

Multi-label learning (MLL) requires comprehensive multi-semantic annotations that is hard to fully obtain, thus often resulting in missing labels scenarios. In this paper, we investigate Single Positive Multi-label Learning (SPML), where each image is associated with merely one positive label. Existing SPML methods only focus on designing losses using mechanisms such as hard pseudo-labeling and robust losses, mostly leading to unacceptable false negatives. To address this issue, we first propose a generalized loss framework based on expected risk minimization to provide soft pseudo labels, and point out that the former losses can be seamlessly converted into our framework. In particular, we design a novel robust loss based on our framework, which enjoys flexible coordination between false positives and false negatives, and can additionally deal with the imbalance between positive and negative samples. Extensive experiments show that our approach can significantly improve SPML performance and outperform the vast majority of state-of-the-art methods on all the four benchmarks.

5/7/2024

Positive Label Is All You Need for Multi-Label Classification

Zhixiang Yuan, Kaixin Zhang, Tao Huang

Multi-label classification (MLC) faces challenges from label noise in training data due to annotating diverse semantic labels for each image. Current methods mainly target identifying and correcting label mistakes using trained MLC models, but still struggle with persistent noisy labels during training, resulting in imprecise recognition and reduced performance. Our paper addresses label noise in MLC by introducing a positive and unlabeled multi-label classification (PU-MLC) method. To counteract noisy labels, we directly discard negative labels, focusing on the abundance of negative labels and the origin of most noisy labels. PU-MLC employs positive-unlabeled learning, training the model with only positive labels and unlabeled data. The method incorporates adaptive re-balance factors and temperature coefficients in the loss function to address label distribution imbalance and prevent over-smoothing of probabilities during training. Additionally, we introduce a local-global convolution module to capture both local and global dependencies in the image without requiring backbone retraining. PU-MLC proves effective on MLC and MLC with partial labels (MLC-PL) tasks, demonstrating significant improvements on MS-COCO and PASCAL VOC datasets with fewer annotations. Code is available at: https://github.com/TAKELAMAG/PU-MLC.

4/17/2024

⛏️

Positive Unlabeled Contrastive Learning

Anish Acharya, Sujay Sanghavi, Li Jing, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive or negative). We first propose a simple extension of standard infoNCE family of contrastive losses, to the PU setting; and show that this learns superior representations, as compared to existing unsupervised and supervised approaches. We then develop a simple methodology to pseudo-label the unlabeled samples using a new PU-specific clustering scheme; these pseudo-labels can then be used to train the final (positive vs. negative) classifier. Our method handily outperforms state-of-the-art PU methods over several standard PU benchmark datasets, while not requiring a-priori knowledge of any class prior (which is a common assumption in other PU methods). We also provide a simple theoretical analysis that motivates our methods.

4/1/2024