Local Manifold Learning for No-Reference Image Quality Assessment

Read original: arXiv:2406.19247 - Published 6/28/2024 by Timin Gao, Wensheng Pan, Yan Zhang, Sicheng Zhao, Shengchuan Zhang, Xiawu Zheng, Ke Li, Liujuan Cao, Rongrong Ji

Local Manifold Learning for No-Reference Image Quality Assessment

Overview

The paper introduces a novel no-reference image quality assessment (NR-IQA) method based on local manifold learning.
The method aims to capture the intrinsic geometric structure of image distortions and learn a mapping between image features and perceptual quality scores.
It outperforms state-of-the-art NR-IQA algorithms on various benchmark datasets.

Plain English Explanation

The paper presents a new way to assess the quality of images without having access to a reference or high-quality version of the image. This is known as no-reference image quality assessment (NR-IQA).

The key idea is to learn the underlying geometric structure, or "manifold," of different types of image distortions, such as blurriness, noise, or compression artifacts. By understanding how these distortions affect the inherent geometry of the image, the method can then map this information to the perceived quality as judged by human observers.

This approach outperforms other state-of-the-art NR-IQA techniques across several standard datasets. It provides a more effective way to automatically assess image quality without requiring a reference image for comparison.

Technical Explanation

The paper proposes a local manifold learning approach for no-reference image quality assessment (NR-IQA). The method aims to capture the intrinsic geometric structure of image distortions and learn a mapping between image features and perceptual quality scores.

The authors first extract various low-level image features, such as statistics of gradient, color, and texture. They then apply local linear embedding (LLE) to learn a low-dimensional manifold representation of the image distortions based on these features.

The intuition is that different types of distortions, like blurriness or noise, will create distinct geometric structures or "manifolds" in the high-dimensional feature space. By learning these local manifolds, the model can effectively capture the underlying characteristics of the distortions.

Finally, the method learns a regression model to map the learned manifold representations to subjective quality scores. This allows it to predict the perceived quality of new images without requiring a reference.

The proposed approach is evaluated on multiple benchmark NR-IQA datasets and shown to outperform state-of-the-art methods, demonstrating its effectiveness in assessing image quality in a no-reference setting.

Critical Analysis

The paper presents a well-designed and thorough evaluation of the local manifold learning approach for NR-IQA. The authors acknowledge some limitations, such as the model's reliance on hand-crafted image features, which may limit its generalization to more complex distortions.

Additionally, the paper does not explore the interpretability of the learned manifold representations or provide insights into which specific distortion characteristics the model is capturing. Further analysis in this direction could help understand the model's strengths and weaknesses.

It would also be valuable to investigate the performance of the method on more diverse and challenging image datasets, as the evaluation is primarily conducted on standard benchmark datasets.

Overall, the research contributes a novel and promising NR-IQA technique that outperforms existing methods. The local manifold learning concept could potentially be extended to other image analysis tasks beyond quality assessment.

Conclusion

The paper introduces a local manifold learning approach for no-reference image quality assessment (NR-IQA). The method learns the intrinsic geometric structure of image distortions and maps this information to perceptual quality scores, allowing it to effectively assess image quality without a reference.

The proposed technique outperforms state-of-the-art NR-IQA algorithms on various benchmark datasets, demonstrating its effectiveness in this task. The research provides a new perspective on leveraging the underlying manifold of distortions for image quality assessment, which could have broader implications for other visual analysis problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Local Manifold Learning for No-Reference Image Quality Assessment

Timin Gao, Wensheng Pan, Yan Zhang, Sicheng Zhao, Shengchuan Zhang, Xiawu Zheng, Ke Li, Liujuan Cao, Rongrong Ji

Contrastive learning has considerably advanced the field of Image Quality Assessment (IQA), emerging as a widely adopted technique. The core mechanism of contrastive learning involves minimizing the distance between quality-similar (positive) examples while maximizing the distance between quality-dissimilar (negative) examples. Despite its successes, current contrastive learning methods often neglect the importance of preserving the local manifold structure. This oversight can result in a high degree of similarity among hard examples within the feature space, thereby impeding effective differentiation and assessment. To address this issue, we propose an innovative framework that integrates local manifold learning with contrastive learning for No-Reference Image Quality Assessment (NR-IQA). Our method begins by sampling multiple crops from a given image, identifying the most visually salient crop. This crop is then used to cluster other crops from the same image as the positive class, while crops from different images are treated as negative classes to increase inter-class distance. Uniquely, our approach also considers non-saliency crops from the same image as intra-class negative classes to preserve their distinctiveness. Additionally, we employ a mutual learning framework, which further enhances the model's ability to adaptively learn and identify visual saliency regions. Our approach demonstrates a better performance compared to state-of-the-art methods in 7 standard datasets, achieving PLCC values of 0.942 (compared to 0.908 in TID2013) and 0.914 (compared to 0.894 in LIVEC).

6/28/2024

MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning

Nasim Jamshidi Avanaki, Abhijay Ghildyal, Nabajeet Barman, Saman Zadtootaghaj

No-Reference Image Quality Assessment (NR-IQA) remains a challenging task due to the diversity of distortions and the lack of large annotated datasets. Many studies have attempted to tackle these challenges by developing more accurate NR-IQA models, often employing complex and computationally expensive networks, or by bridging the domain gap between various distortions to enhance performance on test datasets. In our work, we improve the performance of a generic lightweight NR-IQA model by introducing a novel augmentation strategy that boosts its performance by almost 28%. This augmentation strategy enables the network to better discriminate between different distortions in various parts of the image by zooming in and out. Additionally, the inclusion of test-time augmentation further enhances performance, making our lightweight network's results comparable to the current state-of-the-art models, simply through the use of augmentations.

9/9/2024

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

Xudong Li, Timin Gao, Runze Hu, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Jingyuan Zheng, Yunhang Shen, Ke Li, Yutao Liu, Pingyang Dai, Rongrong Ji

The current state-of-the-art No-Reference Image Quality Assessment (NR-IQA) methods typically rely on feature extraction from upstream semantic backbone networks, assuming that all extracted features are relevant. However, we make a key observation that not all features are beneficial, and some may even be harmful, necessitating careful selection. Empirically, we find that many image pairs with small feature spatial distances can have vastly different quality scores, indicating that the extracted features may contain a significant amount of quality-irrelevant noise. To address this issue, we propose a Quality-Aware Feature Matching IQA Metric (QFM-IQM) that employs an adversarial perspective to remove harmful semantic noise features from the upstream task. Specifically, QFM-IQM enhances the semantic noise distinguish capabilities by matching image pairs with similar quality scores but varying semantic features as adversarial semantic noise and adaptively adjusting the upstream task's features by reducing sensitivity to adversarial noise perturbation. Furthermore, we utilize a distillation framework to expand the dataset and improve the model's generalization ability. Our approach achieves superior performance to the state-of-the-art NR-IQA methods on eight standard IQA datasets.

5/28/2024

Contrastive Learning for Image Complexity Representation

Shipeng Liu, Liang Zhao, Dengfeng Chen, Zhanping Song

Quantifying and evaluating image complexity can be instrumental in enhancing the performance of various computer vision tasks. Supervised learning can effectively learn image complexity features from well-annotated datasets. However, creating such datasets requires expensive manual annotation costs. The models may learn human subjective biases from it. In this work, we introduce the MoCo v2 framework. We utilize contrastive learning to represent image complexity, named CLIC (Contrastive Learning for Image Complexity). We find that there are complexity differences between different local regions of an image, and propose Random Crop and Mix (RCM), which can produce positive samples consisting of multi-scale local crops. RCM can also expand the train set and increase data diversity without introducing additional data. We conduct extensive experiments with CLIC, comparing it with both unsupervised and supervised methods. The results demonstrate that the performance of CLIC is comparable to that of state-of-the-art supervised methods. In addition, we establish the pipelines that can apply CLIC to computer vision tasks to effectively improve their performance.

8/7/2024