Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Read original: arXiv:2405.03501 - Published 5/7/2024 by Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

🏷️

Overview

In this paper, the researchers investigate a specific type of multi-label learning called Single Positive Multi-label Learning (SPML).
In SPML, each image is associated with only one positive label, making it challenging to obtain comprehensive multi-semantic annotations.
Existing SPML methods focus on designing losses using techniques like hard pseudo-labeling and robust losses, but these often lead to unacceptable false negatives.

Plain English Explanation

In the field of machine learning, there is a problem called multi-label learning where an image or object can be associated with multiple labels or tags. For example, an image of a dog might be labeled as "dog," "animal," and "pet." However, obtaining a comprehensive set of all the possible labels for each image can be very difficult.

The researchers in this paper investigate a more specific type of multi-label learning called Single Positive Multi-label Learning (SPML). In SPML, each image is only associated with a single positive label, making it even harder to get all the relevant labels. Existing SPML methods have tried to address this by using techniques like hard pseudo-labeling and robust losses, but these often result in too many false negatives - cases where the model fails to identify a label that should have been there.

Technical Explanation

To address this issue, the researchers first propose a generalized loss framework based on expected risk minimization to provide "soft" pseudo labels, which are more flexible than the hard pseudo-labels used in previous work. They show that the losses used in earlier SPML methods can be seamlessly converted into this new framework.

The researchers then design a novel robust loss within their framework that can better balance the trade-off between false positives and false negatives. This new loss can also handle the imbalance between positive and negative samples, which is common in SPML scenarios.

Through extensive experiments, the researchers demonstrate that their approach can significantly improve SPML performance and outperform the majority of state-of-the-art methods on several benchmark datasets.

Critical Analysis

The paper presents a well-designed solution to the SPML problem, addressing the limitations of existing methods. However, as with any research, there are some potential caveats and areas for further exploration:

The researchers focus on improving SPML performance, but do not discuss the broader applicability of their approach to other multi-label learning scenarios, such as Positive-Unlabeled Contrastive Learning.
The experiments are conducted on standard benchmarks, but it would be valuable to see how the method performs on real-world, noisy datasets with more complex label distributions.
The paper does not provide a deep analysis of the failure cases or limitations of the proposed approach, which could guide future research in this area.

Overall, the researchers have made a significant contribution to the field of SPML, but there are still opportunities for further exploration and refinement of the techniques.

Conclusion

This paper presents an innovative approach to addressing the challenges of Single Positive Multi-label Learning (SPML), where each image is associated with only one positive label. The researchers propose a generalized loss framework and a novel robust loss that can better balance the trade-off between false positives and false negatives, while also handling imbalance between positive and negative samples.

The experimental results demonstrate that the researchers' method can significantly improve SPML performance and outperform state-of-the-art methods on multiple benchmarks. This work has the potential to advance the field of multi-label learning and enable more accurate and robust classification models, especially in scenarios where comprehensive multi-semantic annotations are difficult to obtain.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏷️

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

Multi-label learning (MLL) requires comprehensive multi-semantic annotations that is hard to fully obtain, thus often resulting in missing labels scenarios. In this paper, we investigate Single Positive Multi-label Learning (SPML), where each image is associated with merely one positive label. Existing SPML methods only focus on designing losses using mechanisms such as hard pseudo-labeling and robust losses, mostly leading to unacceptable false negatives. To address this issue, we first propose a generalized loss framework based on expected risk minimization to provide soft pseudo labels, and point out that the former losses can be seamlessly converted into our framework. In particular, we design a novel robust loss based on our framework, which enjoys flexible coordination between false positives and false negatives, and can additionally deal with the imbalance between positive and negative samples. Extensive experiments show that our approach can significantly improve SPML performance and outperform the vast majority of state-of-the-art methods on all the four benchmarks.

5/7/2024

🔮

Can Class-Priors Help Single-Positive Multi-Label Learning?

Biao Liu, Ning Xu, Jie Wang, Xin Geng

Single-positive multi-label learning (SPMLL) is a typical weakly supervised multi-label learning problem, where each training example is annotated with only one positive label. Existing SPMLL methods typically assign pseudo-labels to unannotated labels with the assumption that prior probabilities of all classes are identical. However, the class-prior of each category may differ significantly in real-world scenarios, which makes the predictive model not perform as well as expected due to the unrealistic assumption on real-world application. To alleviate this issue, a novel framework named {proposed}, i.e., Class-pRiors Induced Single-Positive multi-label learning, is proposed. Specifically, a class-priors estimator is introduced, which could estimate the class-priors that are theoretically guaranteed to converge to the ground-truth class-priors. In addition, based on the estimated class-priors, an unbiased risk estimator for classification is derived, and the corresponding risk minimizer could be guaranteed to approximately converge to the optimal risk minimizer on fully supervised data. Experimental results on ten MLL benchmark datasets demonstrate the effectiveness and superiority of our method over existing SPMLL approaches.

5/28/2024

Positive Label Is All You Need for Multi-Label Classification

Zhixiang Yuan, Kaixin Zhang, Tao Huang

Multi-label classification (MLC) faces challenges from label noise in training data due to annotating diverse semantic labels for each image. Current methods mainly target identifying and correcting label mistakes using trained MLC models, but still struggle with persistent noisy labels during training, resulting in imprecise recognition and reduced performance. Our paper addresses label noise in MLC by introducing a positive and unlabeled multi-label classification (PU-MLC) method. To counteract noisy labels, we directly discard negative labels, focusing on the abundance of negative labels and the origin of most noisy labels. PU-MLC employs positive-unlabeled learning, training the model with only positive labels and unlabeled data. The method incorporates adaptive re-balance factors and temperature coefficients in the loss function to address label distribution imbalance and prevent over-smoothing of probabilities during training. Additionally, we introduce a local-global convolution module to capture both local and global dependencies in the image without requiring backbone retraining. PU-MLC proves effective on MLC and MLC with partial labels (MLC-PL) tasks, demonstrating significant improvements on MS-COCO and PASCAL VOC datasets with fewer annotations. Code is available at: https://github.com/TAKELAMAG/PU-MLC.

4/17/2024

🛠️

Multi-Label Learning with Stronger Consistency Guarantees

Anqi Mao, Mehryar Mohri, Yutao Zhong

We present a detailed study of surrogate losses and algorithms for multi-label learning, supported by $H$-consistency bounds. We first show that, for the simplest form of multi-label loss (the popular Hamming loss), the well-known consistent binary relevance surrogate suffers from a sub-optimal dependency on the number of labels in terms of $H$-consistency bounds, when using smooth losses such as logistic losses. Furthermore, this loss function fails to account for label correlations. To address these drawbacks, we introduce a novel surrogate loss, multi-label logistic loss, that accounts for label correlations and benefits from label-independent $H$-consistency bounds. We then broaden our analysis to cover a more extensive family of multi-label losses, including all common ones and a new extension defined based on linear-fractional functions with respect to the confusion matrix. We also extend our multi-label logistic losses to more comprehensive multi-label comp-sum losses, adapting comp-sum losses from standard classification to the multi-label learning. We prove that this family of surrogate losses benefits from $H$-consistency bounds, and thus Bayes-consistency, across any general multi-label loss. Our work thus proposes a unified surrogate loss framework benefiting from strong consistency guarantees for any multi-label loss, significantly expanding upon previous work which only established Bayes-consistency and for specific loss functions. Additionally, we adapt constrained losses from standard classification to multi-label constrained losses in a similar way, which also benefit from $H$-consistency bounds and thus Bayes-consistency for any multi-label loss. We further describe efficient gradient computation algorithms for minimizing the multi-label logistic loss.

7/19/2024