SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction

Read original: arXiv:2305.04844 - Published 8/21/2024 by Evgeney Bogatyrev, Ivan Molodetskikh, Dmitriy Vatolin

➖

Overview

There has been significant interest in Super-Resolution (SR), which aims to generate a high-resolution image from a low-resolution input.
Deep learning-based SR methods have shown impressive results, but may not perform as well on strongly compressed videos.
Researchers developed a super-resolution benchmark to analyze SR's capacity to upscale compressed videos.
The benchmark employed video codecs based on five widely-used compression standards: H.264, H.265, H.266, AV1, and AVS3.
19 popular SR models were assessed using the benchmark, and their ability to restore details and susceptibility to compression artifacts were evaluated.
A crowd-sourced side-by-side comparison of the SR model outputs was conducted to get an accurate perceptual ranking.
The benchmark is publicly available for further research.

Plain English Explanation

Super-resolution (SR) is a technique that takes a low-quality image or video and tries to make it look sharper and more detailed. This has become a popular area of research in recent years, with deep learning-based methods showing promising results.

However, the researchers noticed that these SR methods might not work as well when the input is a highly compressed video, which is common for online streaming or digital broadcasts. To better understand this, they created a special "benchmark" - a set of test videos and tools to evaluate how different SR models perform on compressed video.

The benchmark used video compression standards like H.264, H.265, and AV1 to create low-quality versions of videos. Then, they tested 19 popular SR models to see how well they could "upscale" these compressed videos and restore the missing details. They also had people visually compare the outputs to get a sense of which models were most effective.

The key findings were that some SR models, when combined with compression, could actually reduce the video bitrate (the amount of data needed) without losing too much quality. The researchers also looked at how well different video quality metrics matched up with what people actually perceived as good quality.

Overall, this benchmark provides a valuable tool for researchers and developers working on super-resolution for compressed video, which is an important real-world application.

Technical Explanation

The researchers developed a super-resolution benchmark for video compression to analyze the capacity of super-resolution (SR) methods to upscale compressed videos. They employed video codecs based on five widely-used compression standards: H.264, H.265, H.266, AV1, and AVS3.

A total of 19 popular SR models were assessed using the benchmark, and their ability to restore details as well as their susceptibility to compression artifacts were evaluated. To obtain an accurate perceptual ranking of the SR models, the researchers conducted a crowd-sourced side-by-side comparison of the model outputs.

The results showed that some SR models, when combined with compression, can reduce the video bitrate without significant loss of quality. The researchers also compared a range of image and video quality metrics with subjective scores to evaluate their accuracy on super-resolved compressed videos.

Critical Analysis

The researchers acknowledge that their benchmark is limited to a specific set of compression standards and SR models, and that further research is needed to explore the performance of SR on a wider range of compressed video content and codecs.

Additionally, the paper does not delve into the specific architectural details or training approaches of the 19 SR models tested, which could provide valuable insights into the strengths and weaknesses of different model designs and learning strategies.

It would also be interesting to see how the performance of these SR models scales with the degree of compression, as well as the impact of different compression parameters on the upscaling quality.

Further research could explore the integration of SR with adaptive bitrate streaming techniques, which could potentially lead to more efficient video delivery systems that maintain high visual quality while reducing bandwidth requirements.

Conclusion

This super-resolution benchmark for video compression provides a valuable tool for researchers and developers working on improving the quality of upscaled video content, particularly in the context of online streaming and digital broadcasts.

The findings suggest that certain SR models, when combined with compression, can reduce the video bitrate without significant loss of quality, which could have significant implications for the efficiency and accessibility of high-quality video content delivery.

The publicly available benchmark dataset and evaluation framework can serve as a starting point for further advancements in this important area of video processing research.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

➖

SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction

Evgeney Bogatyrev, Ivan Molodetskikh, Dmitriy Vatolin

In recent years, there has been significant interest in Super-Resolution (SR), which focuses on generating a high-resolution image from a low-resolution input. Deep learning-based methods for super-resolution have been particularly popular and have shown impressive results on various benchmarks. However, research indicates that these methods may not perform as well on strongly compressed videos. We developed a super-resolution benchmark to analyze SR's capacity to upscale compressed videos. Our dataset employed video codecs based on five widely-used compression standards: H.264, H.265, H.266, AV1, and AVS3. We assessed 19 popular SR models using our benchmark and evaluated their ability to restore details and their susceptibility to compression artifacts. To get an accurate perceptual ranking of SR models, we conducted a crowd-sourced side-by-side comparison of their outputs. We found that some SR models, combined with compression, allow us to reduce the video bitrate without significant loss of quality. We also compared a range of image and video quality metrics with subjective scores to evaluate their accuracy on super-resolved compressed videos. The benchmark is publicly available at https://videoprocessing.ai/benchmarks/super-resolution-for-video-compression.html

8/21/2024

AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content

Marcos V Conde, Zhijun Lei, Wen Li, Christos Bampis, Ioannis Katsavounidis, Radu Timofte

Video super-resolution (VSR) is a critical task for enhancing low-bitrate and low-resolution videos, particularly in streaming applications. While numerous solutions have been developed, they often suffer from high computational demands, resulting in low frame rates (FPS) and poor power efficiency, especially on mobile platforms. In this work, we compile different methods to address these challenges, the solutions are end-to-end real-time video super-resolution frameworks optimized for both high performance and low runtime. We also introduce a new test set of high-quality 4K videos to further validate the approaches. The proposed solutions tackle video up-scaling for two applications: 540p to 4K (x4) as a general case, and 360p to 1080p (x3) more tailored towards mobile devices. In both tracks, the solutions have a reduced number of parameters and operations (MACs), allow high FPS, and improve VMAF and PSNR over interpolation baselines. This report gauges some of the most efficient video super-resolution methods to date.

9/27/2024

📉

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu, Chengjian Zheng, Diankai Zhang, Ning Wang, Xintao Qiu, Yuanbo Zhou, Kongxian Wu, Xinwei Dai, Hui Tang, Wei Deng, Qingquan Gao, Tong Tong, Jae-Hyeon Lee, Ui-Jin Choi, Min Yan, Xin Liu, Qian Wang, Xiaoqian Ye, Zhan Du, Tiansen Zhang, Long Peng, Jiaming Guo, Xin Di, Bohao Liao, Zhibo Du, Peize Xia, Renjing Pei, Yang Wang, Yang Cao, Zhengjun Zha, Bingnan Han, Hongyuan Yu, Zhuoyuan Wu, Cheng Wan, Yuqing Liu, Haodong Yu, Jizhe Li, Zhijuan Huang, Yuan Huang, Yajun Zou, Xianyu Guan, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Hyeon-Cheol Moon, Tae-hyun Jeong, Yoonmo Yang, Jae-Gon Kim, Jinwoo Jeong, Sunjei Kim

This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF codec, instead of JPEG. All the proposed methods improve PSNR fidelity over Lanczos interpolation, and process images under 10ms. Out of the 160 participants, 25 teams submitted their code and models. The solutions present novel designs tailored for memory-efficiency and runtime on edge devices. This survey describes the best solutions for real-time SR of compressed high-resolution images.

4/26/2024

See More Details: Efficient Image Super-Resolution by Experts Mining

Eduard Zamfir, Zongwei Wu, Nancy Mehta, Yulun Zhang, Radu Timofte

Reconstructing high-resolution (HR) images from low-resolution (LR) inputs poses a significant challenge in image super-resolution (SR). While recent approaches have demonstrated the efficacy of intricate operations customized for various objectives, the straightforward stacking of these disparate operations can result in a substantial computational burden, hampering their practical utility. In response, we introduce SeemoRe, an efficient SR model employing expert mining. Our approach strategically incorporates experts at different levels, adopting a collaborative methodology. At the macro scale, our experts address rank-wise and spatial-wise informative features, providing a holistic understanding. Subsequently, the model delves into the subtleties of rank choice by leveraging a mixture of low-rank experts. By tapping into experts specialized in distinct key factors crucial for accurate SR, our model excels in uncovering intricate intra-feature details. This collaborative approach is reminiscent of the concept of see more, allowing our model to achieve an optimal performance with minimal computational costs in efficient settings. The source will be publicly made available at https://github.com/eduardzamfir/seemoredetails

6/7/2024