Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

Read original: arXiv:2405.00670 - Published 5/2/2024 by Andrei Chubarau, Hyunjin Yoo, Tara Akhavan, James Clark

Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

Overview

This paper explores how to adapt pre-trained computer vision models for the task of image quality assessment on high dynamic range (HDR) displays.
The researchers investigate techniques to fine-tune and retrain existing models to better handle the unique challenges of HDR imaging, such as a wider luminance range and increased contrast.
The goal is to develop more accurate and reliable methods for evaluating the visual quality of HDR images and videos, which is crucial for emerging HDR display technologies.

Plain English Explanation

Computers can now generate and display images with a much wider range of brightness levels than traditional screens. These "high dynamic range" (HDR) images can show very bright and very dark areas in the same picture, providing a more realistic and immersive viewing experience.

However, evaluating the quality of HDR images is tricky. Existing image quality assessment models, which were designed for standard dynamic range (SDR) content, don't work well with HDR. The researchers in this paper explore ways to adapt or "fine-tune" those pre-trained models to better handle the unique characteristics of HDR.

By modifying the inner workings of the models and retraining them on HDR data, the researchers aim to create quality assessment tools that are more accurate and reliable for emerging HDR display technologies. This is an important step towards ensuring HDR content looks its best on the latest high-end TVs, monitors, and other devices.

Technical Explanation

The paper proposes two main approaches for adapting pre-trained image quality assessment (IQA) models for HDR:

Fine-tuning: The researchers take a pre-trained IQA model, such as one designed for SDR content, and fine-tune it on HDR data. This involves retraining the final layers of the model to specialize in HDR quality prediction, while keeping the lower-level feature extraction layers frozen.
Retraining: Instead of fine-tuning, the researchers completely retrain the IQA model from scratch, this time using only HDR training data. This allows the model to learn HDR-specific features and characteristics from the ground up.

The paper evaluates these techniques on several publicly available HDR image and video quality datasets. The results show that both fine-tuning and retraining can lead to significant improvements over using the original pre-trained IQA models on HDR content.

Interestingly, the researchers find that fine-tuning tends to work better when the pre-trained model was originally designed for natural images, while retraining works better for models designed for synthetic graphics. This suggests the underlying architecture of the pre-trained model plays an important role in how effectively it can be adapted to HDR.

Critical Analysis

The paper presents a thoughtful and thorough investigation into adapting pre-trained models for HDR image quality assessment. The researchers explore multiple adaptation techniques and analyze their performance across a range of HDR datasets.

One potential limitation is that the experiments are primarily focused on static images, rather than dynamic HDR video content. The researchers acknowledge this and suggest future work could extend the techniques to video quality assessment as well.

Additionally, the paper does not delve into the specific neural network architectures or training hyperparameters used in the experiments. More implementation details would allow for better reproducibility and further exploration of the adaptation approaches.

Overall, this research makes a valuable contribution towards developing more robust and accurate methods for evaluating the visual quality of HDR media. As HDR displays become more widespread, tools like these will be crucial for ensuring content creators and consumers can get the most out of the new technology.

Conclusion

This paper explores effective techniques for adapting pre-trained computer vision models to the task of image quality assessment on high dynamic range (HDR) displays. By fine-tuning or retraining existing models on HDR data, the researchers demonstrate significant improvements in the ability to accurately evaluate the visual quality of HDR images.

As HDR technology continues to advance and become more prevalent in consumer electronics, tools for assessing HDR content quality will be increasingly important. The approaches outlined in this paper represent an important step towards creating more reliable and versatile HDR image and video quality assessment models. This research could have far-reaching implications for ensuring the best possible viewing experience on the latest high-end HDR displays.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

Andrei Chubarau, Hyunjin Yoo, Tara Akhavan, James Clark

Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-trained neural networks are likewise intended for SDR inputs, restricting their direct application to HDR content. On the other hand, training HDR models from scratch is challenging due to limited available HDR data. In this work, we explore more effective approaches for training deep learning-based models for image quality assessment (IQA) on HDR data. We leverage networks pre-trained on SDR data (source domain) and re-target these models to HDR (target domain) with additional fine-tuning and domain adaptation. We validate our methods on the available HDR IQA datasets, demonstrating that models trained with our combined recipe outperform previous baselines, converge much quicker, and reliably generalize to HDR inputs.

5/2/2024

🛠️

Perceptual Assessment and Optimization of High Dynamic Range Image Rendering

Peibei Cao, Rafal K. Mantiuk, Kede Ma

High dynamic range (HDR) rendering has the ability to faithfully reproduce the wide luminance ranges in natural scenes, but how to accurately assess the rendering quality is relatively underexplored. Existing quality models are mostly designed for low dynamic range (LDR) images, and do not align well with human perception of HDR image quality. To fill this gap, we propose a family of HDR quality metrics, in which the key step is employing a simple inverse display model to decompose an HDR image into a stack of LDR images with varying exposures. Subsequently, these decomposed images are assessed through well-established LDR quality metrics. Our HDR quality models present three distinct benefits. First, they directly inherit the recent advancements of LDR quality metrics. Second, they do not rely on human perceptual data of HDR image quality for re-calibration. Third, they facilitate the alignment and prioritization of specific luminance ranges for more accurate and detailed quality assessment. Experimental results show that our HDR quality metrics consistently outperform existing models in terms of quality assessment on four HDR image quality datasets and perceptual optimization of HDR novel view synthesis.

6/18/2024

🏋️

FastHDRNet: A new efficient method for SDR-to-HDR Translation

Siyuan Tian, Hao Wang, Yiren Rong, Junhao Wang, Renjie Dai, Zhengxiao He

Modern displays nowadays possess the capability to render video content with a high dynamic range (HDR) and an extensive color gamut .However, the majority of available resources are still in standard dynamic range (SDR). Therefore, we need to identify an effective methodology for this objective.The existing deep neural networks (DNN) based SDR to HDR conversion methods outperforms conventional methods, but they are either too large to implement or generate some terrible artifacts. We propose a neural network for SDR to HDR conversion, termed FastHDRNet. This network includes two parts, Adaptive Universal Color Transformation (AUCT) and Local Enhancement (LE). The architecture is designed as a lightweight network that utilizes global statistics and local information with super high efficiency. After the experiment, we find that our proposed method achieves state-of-the-art performance in both quantitative comparisons and visual quality with a lightweight structure and a enhanced infer speed.

5/14/2024

Efficient HDR Reconstruction from Real-World Raw Images

Qirui Yang, Yihao Liu, Qihua Chen, Huanjing Yue, Kun Li, Jingyu Yang

The widespread usage of high-definition screens on edge devices stimulates a strong demand for efficient high dynamic range (HDR) algorithms. However, many existing HDR methods either deliver unsatisfactory results or consume too much computational and memory resources, hindering their application to high-resolution images (usually with more than 12 megapixels) in practice. In addition, existing HDR dataset collection methods often are labor-intensive. In this work, in a new aspect, we discover an excellent opportunity for HDR reconstructing directly from raw images and investigating novel neural network structures that benefit the deployment of mobile devices. Our key insights are threefold: (1) we develop a lightweight-efficient HDR model, RepUNet, using the structural re-parameterization technique to achieve fast and robust HDR; (2) we design a new computational raw HDR data formation pipeline and construct a real-world raw HDR dataset, RealRaw-HDR; (3) we propose a plug-and-play motion alignment loss to mitigate motion ghosting under limited bandwidth conditions. Our model contains less than 830K parameters and takes less than 3 ms to process an image of 4K resolution using one RTX 3090 GPU. While being highly efficient, our model also outperforms the state-of-the-art HDR methods in terms of PSNR, SSIM, and a color difference metric.

6/6/2024