Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

Read original: arXiv:2311.15502 - Published 5/14/2024 by Wei Wang, Takashi Ishida, Yu-Jie Zhang, Gang Niu, Masashi Sugiyama

Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

Overview

This paper proposes a new approach for learning with complementary labels, which are a type of partial labeling where the true label is not observed, but rather the complement of the true label is known.
The authors present a consistent negative-unlabeled (NU) learning method that can handle complementary labels and provide theoretical guarantees on its performance.
The method is designed to be robust and effective, with applications in areas like medical image analysis and object recognition.

Plain English Explanation

The paper tackles the problem of learning with complementary labels, which is a situation where the true label of a data point is not directly observed. Instead, we only know the "complement" of the true label - for example, we might know that an image is not of a cat, but we don't know what the actual object in the image is.

The authors propose a new negative-unlabeled (NU) learning method that can effectively handle this type of partial labeling. The key idea is to treat the complementary labels as "negative" examples, and then use a specialized learning algorithm to extract useful information from this data.

This NU learning approach has several advantages. First, it comes with theoretical guarantees, meaning we can prove that it will perform well under certain conditions. Second, it is designed to be robust and effective, so it should work well in practical applications like medical image analysis or object recognition.

The paper provides a detailed technical explanation of the NU learning method, as well as experiments demonstrating its effectiveness. Overall, this research represents an important advance in the field of machine learning with partial or complementary labels, with potential applications in a variety of domains.

Technical Explanation

The paper first introduces the problem of multi-class classification with complementary labels. In this setting, the true label of each data point is not observed; instead, we only know the "complement" of the true label, i.e., the set of all labels that the data point is not.

The authors then present their negative-unlabeled (NU) learning approach for handling complementary labels. The key idea is to treat the complementary labels as "negative" examples, and then use a specialized learning algorithm to extract useful information from this data. Specifically, the NU learning method involves:

Estimating the class-conditional probabilities of the negative (complementary) labels.
Using these estimates to construct a consistent risk estimator for the true classification problem.
Optimizing this risk estimator to learn an accurate classifier.

The paper provides a theoretical analysis of the NU learning method, proving that it is consistent (i.e., the learned classifier converges to the optimal one as the amount of data increases) under certain conditions.

The authors also conduct experiments on both synthetic and real-world datasets, including medical image analysis and object recognition tasks. They show that the NU learning method outperforms alternative approaches for learning with complementary labels, such as Positive-Unlabeled Contrastive Learning, Learning to Complement Multiple Humans, and Characterization of Semi-Supervised Adversarially Robust PAC-Learnability.

Critical Analysis

The paper presents a well-designed and theoretically grounded approach for learning with complementary labels. The NU learning method is a clever way to extract useful information from the partial label data, and the theoretical guarantees provide confidence in its performance.

However, the paper does not address some potential limitations or areas for further research. For example, the method assumes that the complementary labels are independent of the true labels, which may not always be the case in real-world scenarios. Additionally, the paper does not explore the sample complexity of the NU learning approach, i.e., how much data is required to achieve good performance.

It would also be interesting to see how the NU learning method compares to other approaches for learning with partial labels, such as Soft Label PU Learning or Contrastive Credibility Propagation for Reliable Semi-Supervised Learning. A more comprehensive empirical evaluation could provide additional insights into the strengths and weaknesses of the proposed technique.

Conclusion

This paper presents a novel negative-unlabeled (NU) learning approach for handling complementary labels, a type of partial labeling where the true label is not observed, but its complement is known. The authors demonstrate the theoretical consistency of their method and show its effectiveness on various real-world tasks.

While the paper has some limitations and areas for further research, it represents an important contribution to the field of machine learning with partial or complementary labels. The NU learning technique could have significant practical applications in domains like medical image analysis, object recognition, and beyond, where partial or noisy labels are common.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

Wei Wang, Takashi Ishida, Yu-Jie Zhang, Gang Niu, Masashi Sugiyama

Complementary-label learning is a weakly supervised learning problem in which each training example is associated with one or multiple complementary labels indicating the classes to which it does not belong. Existing consistent approaches have relied on the uniform distribution assumption to model the generation of complementary labels, or on an ordinary-label training set to estimate the transition matrix in non-uniform cases. However, either condition may not be satisfied in real-world scenarios. In this paper, we propose a novel consistent approach that does not rely on these conditions. Inspired by the positive-unlabeled (PU) learning literature, we propose an unbiased risk estimator based on the Selected-Completely-at-Random assumption for complementary-label learning. We then introduce a risk-correction approach to address overfitting problems. Furthermore, we find that complementary-label learning can be expressed as a set of negative-unlabeled binary classification problems when using the one-versus-rest strategy. Extensive experimental results on both synthetic and real-world benchmark datasets validate the superiority of our proposed approach over state-of-the-art methods.

5/14/2024

🔗

CLImage: Human-Annotated Datasets for Complementary-Label Learning

Hsiu-Hsuan Wang, Tan-Ha Mai, Nai-Xuan Ye, Wei-I Lin, Hsuan-Tien Lin

Complementary-label learning (CLL) is a weakly-supervised learning paradigm that aims to train a multi-class classifier using only complementary labels, which indicate classes to which an instance does not belong. Despite numerous algorithmic proposals for CLL, their practical applicability remains unverified for two reasons. Firstly, these algorithms often rely on assumptions about the generation of complementary labels, and it is not clear how far the assumptions are from reality. Secondly, their evaluation has been limited to synthetic datasets. To gain insights into the real-world performance of CLL algorithms, we developed a protocol to collect complementary labels from human annotators. Our efforts resulted in the creation of four datasets: CLCIFAR10, CLCIFAR20, CLMicroImageNet10, and CLMicroImageNet20, derived from well-known classification datasets CIFAR10, CIFAR100, and TinyImageNet200. These datasets represent the very first real-world CLL datasets. Through extensive benchmark experiments, we discovered a notable decrease in performance when transitioning from synthetic datasets to real-world datasets. We investigated the key factors contributing to the decrease with a thorough dataset-level ablation study. Our analyses highlight annotation noise as the most influential factor in the real-world datasets. In addition, we discover that the biased-nature of human-annotated complementary labels and the difficulty to validate with only complementary labels are two outstanding barriers to practical CLL. These findings suggest that the community focus more research efforts on developing CLL algorithms and validation schemes that are robust to noisy and biased complementary-label distributions.

6/26/2024

⛏️

Positive Unlabeled Contrastive Learning

Anish Acharya, Sujay Sanghavi, Li Jing, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive or negative). We first propose a simple extension of standard infoNCE family of contrastive losses, to the PU setting; and show that this learns superior representations, as compared to existing unsupervised and supervised approaches. We then develop a simple methodology to pseudo-label the unlabeled samples using a new PU-specific clustering scheme; these pseudo-labels can then be used to train the final (positive vs. negative) classifier. Our method handily outperforms state-of-the-art PU methods over several standard PU benchmark datasets, while not requiring a-priori knowledge of any class prior (which is a common assumption in other PU methods). We also provide a simple theoretical analysis that motivates our methods.

4/1/2024

Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification

Xuerong Zhang, Li Huang, Jing Lv, Ming Yang

Semi-supervised learning is attracting blooming attention, due to its success in combining unlabeled data. However, pseudo-labeling-based semi-supervised approaches suffer from two problems in image classification: (1) Existing methods might fail to adopt suitable thresholds since they either use a pre-defined/fixed threshold or an ad-hoc threshold adjusting scheme, resulting in inferior performance and slow convergence. (2) Discarding unlabeled data with confidence below the thresholds results in the loss of discriminating information. To solve these issues, we develop an effective method to make sufficient use of unlabeled data. Specifically, we design a self adaptive threshold pseudo-labeling strategy, which thresholds for each class can be dynamically adjusted to increase the number of reliable samples. Meanwhile, in order to effectively utilise unlabeled data with confidence below the thresholds, we propose an unreliable sample contrastive loss to mine the discriminative information in low-confidence samples by learning the similarities and differences between sample features. We evaluate our method on several classification benchmarks under partially labeled settings and demonstrate its superiority over the other approaches.

7/8/2024