Deep Feature Statistics Mapping for Generalized Screen Content Image Quality Assessment

Read original: arXiv:2209.05321 - Published 4/23/2024 by Baoliang Chen, Hanwei Zhu, Lingyu Zhu, Shiqi Wang, Sam Kwong

🤿

Overview

This paper investigates the statistical properties of screen content images (SCIs), which are typically computer-generated, and how they differ from natural images.
The researchers propose a new approach, Deep Feature Statistics-based SCI Quality Assessment (DFSS-IQA), to effectively assess the quality of SCIs.
The key idea is that even though SCIs do not follow the same statistical patterns as natural images, they still exhibit certain regularities that can be learned and leveraged for quality assessment.
The proposed method is shown to outperform existing no-reference image quality assessment (NR-IQA) models, particularly in cross-dataset evaluations.

Plain English Explanation

Natural images, like those captured by cameras, have certain statistical patterns that are well-understood. These patterns, known as natural scene statistics, play an important role in assessing the quality of images. However, screen content images (SCIs), which are typically computer-generated, do not follow these same statistical patterns.

The researchers in this paper set out to understand the statistical properties of SCIs and use this knowledge to develop a new way to evaluate their quality. They hypothesized that even though SCIs are not physically captured, they still have certain regularities that could be learned and used for quality assessment.

The researchers developed a deep learning-based model called DFSS-IQA that can effectively capture the statistical characteristics of SCIs. This model was shown to outperform existing quality assessment approaches, especially when tested on different datasets (known as "cross-dataset" evaluation).

The key insight is that even though SCIs may not follow the same natural patterns as regular photos, they still have their own underlying statistical structure that can be uncovered and leveraged for accurate quality evaluation. This research represents an important step in understanding and working with the unique properties of computer-generated visual content.

Technical Explanation

The researchers started by acknowledging that natural scene statistics play a crucial role in no-reference image quality assessment (NR-IQA), but that these statistics do not apply to screen content images (SCIs), which are typically computer-generated.

To address this, the researchers proposed a new approach called Deep Feature Statistics-based SCI Quality Assessment (DFSS-IQA). The underlying idea is that even though SCIs are not physically acquired, they still exhibit certain statistical regularities that can be learned and leveraged for quality assessment.

The DFSS-IQA model uses a deep learning architecture to capture the statistical characteristics of SCIs. Specifically, the model extracts deep features from SCI samples and models their statistical distributions. These learned feature statistics are then used to assess the quality of new SCI inputs.

The researchers conducted extensive experiments to evaluate the performance of DFSS-IQA. They compared it to existing NR-IQA models on various SCI datasets, including AIGIQA-20K, and found that DFSS-IQA outperformed the other methods, especially in cross-dataset evaluations.

The promising results demonstrate that the statistical properties of SCIs, though different from natural images, can be effectively learned and used for accurate quality assessment. This work represents an important step towards understanding and handling the unique characteristics of computer-generated visual content.

Critical Analysis

The researchers acknowledge that their work is the first attempt to learn the statistics of SCIs, and as such, there is room for further research and refinement.

One potential limitation is that the proposed DFSS-IQA model may not capture all the nuances and complexities of SCI statistics. As the researchers note, there may be additional statistical properties or higher-order relationships that could be explored to further improve the quality assessment capabilities.

Additionally, the experiments were conducted on a limited number of SCI datasets, and it would be valuable to see the model's performance evaluated on a broader range of SCI data, including those with different characteristics or from different sources.

Another area for potential investigation is the interpretability of the learned feature statistics. Understanding the specific statistical patterns that the model identifies as indicative of SCI quality could provide valuable insights and inform future advancements in this area.

Overall, this research represents an important step forward in understanding and working with the unique properties of computer-generated visual content. The proposed DFSS-IQA model demonstrates promising results, and the insights gained from this work could inspire further research and development in the field of no-reference image quality assessment for screen content images.

Conclusion

This paper presents a novel approach, DFSS-IQA, for assessing the quality of screen content images (SCIs), which are typically computer-generated and do not follow the same statistical patterns as natural images.

The key contribution of this work is the recognition that even though SCIs are not physically acquired, they still exhibit certain statistical regularities that can be learned and leveraged for effective quality assessment. The researchers developed a deep learning-based model to capture these SCI-specific statistical characteristics and demonstrated its superior performance compared to existing no-reference image quality assessment (NR-IQA) methods.

The successful implementation and evaluation of DFSS-IQA, particularly in cross-dataset settings, highlight the importance of understanding and adapting to the unique properties of computer-generated visual content. This research represents an important step forward in the field of image quality assessment and has the potential to enable more accurate and reliable evaluation of screen content images.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

Deep Feature Statistics Mapping for Generalized Screen Content Image Quality Assessment

Baoliang Chen, Hanwei Zhu, Lingyu Zhu, Shiqi Wang, Sam Kwong

The statistical regularities of natural images, referred to as natural scene statistics, play an important role in no-reference image quality assessment. However, it has been widely acknowledged that screen content images (SCIs), which are typically computer generated, do not hold such statistics. Here we make the first attempt to learn the statistics of SCIs, based upon which the quality of SCIs can be effectively determined. The underlying mechanism of the proposed approach is based upon the mild assumption that the SCIs, which are not physically acquired, still obey certain statistics that could be understood in a learning fashion. We empirically show that the statistics deviation could be effectively leveraged in quality assessment, and the proposed method is superior when evaluated in different settings. Extensive experimental results demonstrate the Deep Feature Statistics based SCI Quality Assessment (DFSS-IQA) model delivers promising performance compared with existing NR-IQA models and shows a high generalization capability in the cross-dataset settings. The implementation of our method is publicly available at https://github.com/Baoliang93/DFSS-IQA.

4/23/2024

Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics

Zhangkai Ni, Yue Liu, Keyan Ding, Wenhan Yang, Hanli Wang, Shiqi Wang

Deep learning-based methods have significantly influenced the blind image quality assessment (BIQA) field, however, these methods often require training using large amounts of human rating data. In contrast, traditional knowledge-based methods are cost-effective for training but face challenges in effectively extracting features aligned with human visual perception. To bridge these gaps, we propose integrating deep features from pre-trained visual models with a statistical analysis model into a Multi-scale Deep Feature Statistics (MDFS) model for achieving opinion-unaware BIQA (OU-BIQA), thereby eliminating the reliance on human rating data and significantly improving training efficiency. Specifically, we extract patch-wise multi-scale features from pre-trained vision models, which are subsequently fitted into a multivariate Gaussian (MVG) model. The final quality score is determined by quantifying the distance between the MVG model derived from the test image and the benchmark MVG model derived from the high-quality image set. A comprehensive series of experiments conducted on various datasets show that our proposed model exhibits superior consistency with human visual perception compared to state-of-the-art BIQA models. Furthermore, it shows improved generalizability across diverse target-specific BIQA tasks. Our code is available at: https://github.com/eezkni/MDFS

5/30/2024

S-IQA Image Quality Assessment With Compressive Sampling

Ronghua Liao, Chen Hui, Lang Yuan, Haiqi Zhu, Feng Jiang

No-Reference Image Quality Assessment (NR-IQA) aims at estimating image quality in accordance with subjective human perception. However, most methods focus on exploring increasingly complex networks to improve the final performance,accompanied by limitations on input images. Especially when applied to high-resolution (HR) images, these methods offen have to adjust the size of original image to meet model input.To further alleviate the aforementioned issue, we propose two networks for NR-IQA with Compressive Sampling (dubbed CL-IQA and CS-IQA). They consist of four components: (1) The Compressed Sampling Module (CSM) to sample the image (2)The Adaptive Embedding Module (AEM). The measurements are embedded by AEM to extract high-level features. (3) The Vision Transformer and Scale Swin TranBlocksformer Moudle(SSTM) to extract deep features. (4) The Dual Branch (DB) to get final quality score. Experiments show that our proposed methods outperform other methods on various datasets with less data usage.

9/12/2024

Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement

Kang Xiao, Xu Wang, Yulin He, Baoliang Chen, Xuelin Shen

Full-reference image quality assessment (FR-IQA) models generally operate by measuring the visual differences between a degraded image and its reference. However, existing FR-IQA models including both the classical ones (eg, PSNR and SSIM) and deep-learning based measures (eg, LPIPS and DISTS) still exhibit limitations in capturing the full perception characteristics of the human visual system (HVS). In this paper, instead of designing a new FR-IQA measure, we aim to explore a generalized human visual attention estimation strategy to mimic the process of human quality rating and enhance existing IQA models. In particular, we model human attention generation by measuring the statistical dependency between the degraded image and the reference image. The dependency is captured in a training-free manner by our proposed sliced maximal information coefficient and exhibits surprising generalization in different IQA measures. Experimental results verify the performance of existing IQA models can be consistently improved when our attention module is incorporated. The source code is available at https://github.com/KANGX99/SMIC.

8/20/2024