Domain-adaptive Video Deblurring via Test-time Blurring

Read original: arXiv:2407.09059 - Published 7/15/2024 by Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

Domain-adaptive Video Deblurring via Test-time Blurring

Overview

This paper proposes a novel video deblurring technique that uses a diffusion model to adapt to the test-time blurring distribution.
The key idea is to train a diffusion model on clean video frames, then apply it at test-time to blur the input video, which helps the deblurring model better handle the domain shift between training and test data.
The authors demonstrate that this "test-time blurring" approach outperforms previous domain adaptation methods for video deblurring on various real-world datasets.

Plain English Explanation

Video deblurring is the task of taking a blurry video and trying to "un-blur" it to make the frames clearer and sharper. This can be a challenging problem, as the blurring can be caused by many different factors, like camera shake, object motion, or environmental conditions.

The key insight in this paper is that it's often hard to match the training data used to develop a deblurring model with the actual blurriness encountered in the real world. To address this, the researchers propose "test-time blurring" - they take a clean video and intentionally blur it using a special machine learning model called a diffusion model. This creates synthetic blurry video that is more representative of what the deblurring model will see in the real world.

By training the deblurring model on this "pre-blurred" video, it becomes better able to handle the types of blur it will encounter when deployed. The authors show this approach outperforms previous methods that try to adapt the deblurring model to the real-world blur in other ways.

Technical Explanation

The paper starts by noting the challenge of domain shift in video deblurring - the training data used to develop deblurring models may not match the real-world blur encountered at test-time. Previous domain adaptation approaches have tried to address this, such as denoising as adaptation and diffusion-driven test-time adaptation.

The authors propose a new method called "test-time blurring" that uses a diffusion model to intentionally blur clean video frames during inference. This diffusion model is trained separately on unblurred video, then applied to the test-time input to create synthetic blurry frames. The deblurring model is then trained on this pre-blurred video, helping it learn to handle the real-world blur distribution.

Experiments on various real-world video deblurring datasets show this test-time blurring approach outperforms previous domain adaptation methods. The authors also demonstrate the flexibility of their approach, showing it can be combined with existing deblurring architectures like DABIT and ID-BLAU.

Critical Analysis

While the test-time blurring approach is effective, it does add computational overhead at inference time to apply the diffusion model. The paper does not deeply explore the trade-offs between this performance cost and the improved deblurring quality.

Additionally, the diffusion model used for blurring is trained on clean video data, which may not fully capture the diversity of real-world blur types. Exploring ways to make the blurring more representative, such as DyBlurF, could further improve the domain adaptation capabilities.

Overall, the test-time blurring technique is a creative and promising approach to video deblurring that helps bridge the gap between training and test data. Continuing to refine the blurring model and exploring the computational trade-offs could lead to even stronger real-world performance.

Conclusion

This paper presents a novel video deblurring method that uses a diffusion model to adapt to the test-time blur distribution. By intentionally blurring clean video frames during inference, the deblurring model can better handle the domain shift between training and real-world data. Experiments show this test-time blurring approach outperforms previous domain adaptation techniques, making it a valuable contribution to the field of video restoration.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Domain-adaptive Video Deblurring via Test-time Blurring

Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin

Dynamic scene video deblurring aims to remove undesirable blurry artifacts captured during the exposure process. Although previous video deblurring methods have achieved impressive results, they suffer from significant performance drops due to the domain gap between training and testing videos, especially for those captured in real-world scenarios. To address this issue, we propose a domain adaptation scheme based on a blurring model to achieve test-time fine-tuning for deblurring models in unseen domains. Since blurred and sharp pairs are unavailable for fine-tuning during inference, our scheme can generate domain-adaptive training pairs to calibrate a deblurring model for the target domain. First, a Relative Sharpness Detection Module is proposed to identify relatively sharp regions from the blurry input images and regard them as pseudo-sharp images. Next, we utilize a blurring model to produce blurred images based on the pseudo-sharp images extracted during testing. To synthesize blurred images in compliance with the target data distribution, we propose a Domain-adaptive Blur Condition Generation Module to create domain-specific blur conditions for the blurring model. Finally, the generated pseudo-sharp and blurred pairs are used to fine-tune a deblurring model for better performance. Extensive experimental results demonstrate that our approach can significantly improve state-of-the-art video deblurring methods, providing performance gains of up to 7.54dB on various real-world video deblurring datasets. The source code is available at https://github.com/Jin-Ting-He/DADeblur.

7/15/2024

🔍

VDPI: Video Deblurring with Pseudo-inverse Modeling

Zhihao Huang, Santiago Lopez-Tapia, Aggelos K. Katsaggelos

Video deblurring is a challenging task that aims to recover sharp sequences from blur and noisy observations. The image-formation model plays a crucial role in traditional model-based methods, constraining the possible solutions. However, this is only the case for some deep learning-based methods. Despite deep-learning models achieving better results, traditional model-based methods remain widely popular due to their flexibility. An increasing number of scholars combine the two to achieve better deblurring performance. This paper proposes introducing knowledge of the image-formation model into a deep learning network by using the pseudo-inverse of the blur. We use a deep network to fit the blurring and estimate pseudo-inverse. Then, we use this estimation, combined with a variational deep-learning network, to deblur the video sequence. Notably, our experimental results demonstrate that such modifications can significantly improve the performance of deep learning models for video deblurring. Furthermore, our experiments on different datasets achieved notable performance improvements, proving that our proposed method can generalize to different scenarios and cameras.

9/4/2024

DAVIDE: Depth-Aware Video Deblurring

German F. Torres, Jussi Kalliola, Soumya Tripathy, Erman Acar, Joni-Kristian Kamarainen

Video deblurring aims at recovering sharp details from a sequence of blurry frames. Despite the proliferation of depth sensors in mobile phones and the potential of depth information to guide deblurring, depth-aware deblurring has received only limited attention. In this work, we introduce the 'Depth-Aware VIdeo DEblurring' (DAVIDE) dataset to study the impact of depth information in video deblurring. The dataset comprises synchronized blurred, sharp, and depth videos. We investigate how the depth information should be injected into the existing deep RGB video deblurring models, and propose a strong baseline for depth-aware video deblurring. Our findings reveal the significance of depth information in video deblurring and provide insights into the use cases where depth cues are beneficial. In addition, our results demonstrate that while the depth improves deblurring performance, this effect diminishes when models are provided with a longer temporal context. Project page: https://germanftv.github.io/DAVIDE.github.io/ .

9/4/2024

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

Chen Rao, Guangyuan Li, Zehua Lan, Jiakai Sun, Junsheng Luan, Wei Xing, Lei Zhao, Huaizhong Lin, Jianfeng Dong, Dalong Zhang

Current video deblurring methods have limitations in recovering high-frequency information since the regression losses are conservative with high-frequency details. Since Diffusion Models (DMs) have strong capabilities in generating high-frequency details, we consider introducing DMs into the video deblurring task. However, we found that directly applying DMs to the video deblurring task has the following problems: (1) DMs require many iteration steps to generate videos from Gaussian noise, which consumes many computational resources. (2) DMs are easily misled by the blurry artifacts in the video, resulting in irrational content and distortion of the deblurred video. To address the above issues, we propose a novel video deblurring framework VD-Diff that integrates the diffusion model into the Wavelet-Aware Dynamic Transformer (WADT). Specifically, we perform the diffusion model in a highly compact latent space to generate prior features containing high-frequency information that conforms to the ground truth distribution. We design the WADT to preserve and recover the low-frequency information in the video while utilizing the high-frequency information generated by the diffusion model. Extensive experiments show that our proposed VD-Diff outperforms SOTA methods on GoPro, DVD, BSD, and Real-World Video datasets.

8/27/2024