YouTube SFV+HDR Quality Dataset

Read original: arXiv:2406.05305 - Published 6/24/2024 by Yilin Wang, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli

🤖

Overview

The popularity of Short-form Videos (SFV) has grown dramatically in recent years, with billions of viewers.
High Dynamic Range (HDR) has also become more popular on video sharing platforms.
This paper explores whether SFV+HDR quality assessment is significantly different from traditional User Generated Content (UGC) quality assessment, and whether existing objective quality metrics designed for UGC still work well for SFV+HDR.

Plain English Explanation

The researchers created the first large-scale dataset of Short-form Videos (SFVs) with High Dynamic Range (HDR), which includes subjective quality scores. This dataset covers 10 popular content categories and uses a sampling framework to ensure it is representative.

The researchers then analyzed the subjective quality scores for short SDR (Standard Dynamic Range) and HDR videos. They also evaluated how well the state-of-the-art UGC (User Generated Content) quality metrics perform on this new SFV+HDR dataset, and discussed potential improvements.

The key questions the researchers aimed to answer are:

Is the quality assessment for SFV+HDR significantly different from traditional UGC quality assessment?
Do the objective quality metrics designed for traditional UGC still work well for SFV+HDR?

Technical Explanation

The researchers created a new large-scale dataset of SFV+HDR videos, with reliable subjective quality scores, to investigate these questions. They used a sampling framework to maximize the representativeness of the dataset, covering 10 popular content categories.

The researchers then conducted a comprehensive analysis of the subjective quality scores for the short SDR and HDR videos in the dataset. They also evaluated the performance of state-of-the-art UGC quality metrics on this new SFV+HDR dataset, and discussed potential areas for improvement.

Critical Analysis

The researchers acknowledge that their dataset is the first of its kind, covering SFV+HDR content. This is an important step forward in video quality research, as the growing popularity of these video formats necessitates a better understanding of how to assess their quality.

However, the researchers do not discuss the potential limitations of their dataset or sampling framework. It would be helpful to understand if there are any biases or gaps in the content coverage, and how that could impact the generalizability of their findings.

Additionally, the paper does not provide a detailed analysis of why the existing UGC quality metrics may or may not perform well on the SFV+HDR dataset. Further insights into the specific factors that contribute to quality differences between traditional UGC and SFV+HDR would strengthen the paper's conclusions.

Conclusion

This research highlights the need for a deeper understanding of video quality assessment for emerging video formats, such as Short-form Videos (SFVs) with High Dynamic Range (HDR). The new dataset and analysis presented in this paper provide a valuable foundation for future work in this area.

By exploring the differences between SFV+HDR and traditional UGC quality assessment, as well as the performance of existing quality metrics, the researchers have identified an important research direction. Continued efforts to develop robust and reliable quality assessment methods for these new video formats will be crucial as they continue to gain popularity.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤖

YouTube SFV+HDR Quality Dataset

Yilin Wang, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli

The popularity of Short form videos (SFV) has grown dramatically in the past few years, and has become a phenomenal video category with billions of viewers. Meanwhile, High Dynamic Range (HDR) as an advanced feature also becomes more and more popular on video sharing platforms. As a hot topic with huge impact, SFV and HDR bring new questions to video quality research: 1) is SFV+HDR quality assessment significantly different from traditional User Generated Content (UGC) quality assessment? 2) do objective quality metrics designed for traditional UGC still work well for SFV+HDR? To answer the above questions, we created the first large scale SFV+HDR dataset with reliable subjective quality scores, covering 10 popular content categories. Further, we also introduce a general sampling framework to maximize the representativeness of the dataset. We provided a comprehensive analysis of subjective quality scores for Short form SDR and HDR videos, and discuss the reliability of state-of-the-art UGC quality metrics and potential improvements.

6/24/2024

Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos

Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos. Although HDR video capture has seen increasing popularity because of recent flagship mobile phones such as Apple iPhones, Google Pixels, and Samsung Galaxy phones, a broad swath of consumers still utilize legacy SDR displays that are unable to display HDR videos. As result, HDR videos must be processed, i.e., tone-mapped, before streaming to a large section of SDR-capable video consumers. However, server-side tone-mapping involves automating decisions regarding the choices of tone-mapping operators (TMOs) and their parameters to yield high-fidelity outputs. Moreover, these choices must be balanced against the effects of lossy compression, which is ubiquitous in streaming scenarios. In this work, we develop a novel, efficient model of objective video quality named Cut-FUNQUE that is able to accurately predict the visual quality of tone-mapped and compressed HDR videos. Finally, we evaluate Cut-FUNQUE on a large-scale crowdsourced database of such videos and show that it achieves state-of-the-art accuracy.

4/23/2024

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai, Wei Sun, Yuqin Cao, Yanwei Jiang, Jun Jia, Zhichao Zhang, Zijian Chen, Weixia Zhang, Xiongkuo Min, Steve Goring, Zihao Qi, Chen Feng

This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed methods must process 30 FHD frames under 1 second. In the challenge, a total of 102 participants registered, and 15 submitted code and models. The performance of the top-5 submissions is reviewed and provided here as a survey of diverse deep models for efficient video quality assessment of user-generated content.

4/26/2024

Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays

Andrei Chubarau, Hyunjin Yoo, Tara Akhavan, James Clark

Conventional image quality metrics (IQMs), such as PSNR and SSIM, are designed for perceptually uniform gamma-encoded pixel values and cannot be directly applied to perceptually non-uniform linear high-dynamic-range (HDR) colors. Similarly, most of the available datasets consist of standard-dynamic-range (SDR) images collected in standard and possibly uncontrolled viewing conditions. Popular pre-trained neural networks are likewise intended for SDR inputs, restricting their direct application to HDR content. On the other hand, training HDR models from scratch is challenging due to limited available HDR data. In this work, we explore more effective approaches for training deep learning-based models for image quality assessment (IQA) on HDR data. We leverage networks pre-trained on SDR data (source domain) and re-target these models to HDR (target domain) with additional fine-tuning and domain adaptation. We validate our methods on the available HDR IQA datasets, demonstrating that models trained with our combined recipe outperform previous baselines, converge much quicker, and reliably generalize to HDR inputs.

5/2/2024