SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction

Read original: arXiv:2408.11411 - Published 8/22/2024 by Wei Shang, Dongwei Ren, Wanying Zhang, Qilong Wang, Pengfei Zhu, Wangmeng Zuo

SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction

Overview

Self-supervised learning approach for dual reversed rolling shutter correction
Leverages the unique dual reversed imaging mechanism to enable self-supervision
Addresses challenges of rolling shutter artifacts in image capture

Plain English Explanation

The paper presents a self-supervised learning method called SelfDRSC++ for correcting rolling shutter distortions in images. Rolling shutter is a common issue in digital cameras where different parts of the image are captured at slightly different times, leading to distortions.

The key innovation in this work is the use of a "dual reversed imaging mechanism" - the camera captures two images simultaneously, but with the rolling shutter directions reversed between the two. This allows the system to learn how to correct the distortions in a self-supervised manner, without requiring any ground truth data.

The method uses these paired images to train a neural network to accurately predict the distortion flow, which can then be used to rectify the rolling shutter artifacts. By leveraging this unique imaging setup, the approach is able to achieve strong rolling shutter correction results without the need for expensive or cumbersome data collection.

Technical Explanation

The SelfDRSC++ system consists of two key components: a distortion flow prediction network and a distortion correction module.

The distortion flow prediction network takes the two reversed rolling shutter images as input and learns to predict the optical flow that represents the distortion. This is achieved through a self-supervised training process that exploits the unique properties of the dual reversed imaging setup.

The distortion correction module then uses the predicted distortion flow to warp the input images and remove the rolling shutter artifacts. This produces the final corrected output.

The authors demonstrate the effectiveness of their approach through extensive experiments on both synthetic and real-world datasets, showing significant improvements in rolling shutter correction compared to prior methods.

Critical Analysis

The SelfDRSC++ method presents a clever and innovative solution to the problem of rolling shutter correction. By leveraging the dual reversed imaging mechanism, the system is able to learn effective distortion correction without requiring any ground truth data, which is a significant advantage over supervised approaches.

However, the authors acknowledge that the dual camera setup may not always be feasible or practical in real-world applications. Additionally, the method may struggle with more complex or irregular rolling shutter patterns that are not well captured by the predicted distortion flow.

Further research could explore ways to extend the self-supervised approach to more generalized rolling shutter correction, perhaps by incorporating additional cues or constraints beyond the dual reversed imaging setup. Investigating the performance on a wider range of real-world scenarios would also help validate the broader applicability of the technique.

Conclusion

The SelfDRSC++ paper presents a novel self-supervised learning method for correcting rolling shutter distortions in images. By leveraging a unique dual reversed imaging mechanism, the system is able to learn effective distortion correction without requiring any ground truth data.

The proposed approach demonstrates strong results on both synthetic and real-world datasets, highlighting its potential for practical applications in areas like computational photography and image processing. While the dual camera setup may limit its immediate deployment, the self-supervised learning strategy suggests promising directions for further research and development in this important field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

SelfDRSC++: Self-Supervised Learning for Dual Reversed Rolling Shutter Correction

Wei Shang, Dongwei Ren, Wanying Zhang, Qilong Wang, Pengfei Zhu, Wangmeng Zuo

Modern consumer cameras commonly employ the rolling shutter (RS) imaging mechanism, via which images are captured by scanning scenes row-by-row, resulting in RS distortion for dynamic scenes. To correct RS distortion, existing methods adopt a fully supervised learning manner that requires high framerate global shutter (GS) images as ground-truth for supervision. In this paper, we propose an enhanced Self-supervised learning framework for Dual reversed RS distortion Correction (SelfDRSC++). Firstly, we introduce a lightweight DRSC network that incorporates a bidirectional correlation matching block to refine the joint optimization of optical flows and corrected RS features, thereby improving correction performance while reducing network parameters. Subsequently, to effectively train the DRSC network, we propose a self-supervised learning strategy that ensures cycle consistency between input and reconstructed dual reversed RS images. The RS reconstruction in SelfDRSC++ can be interestingly formulated as a specialized instance of video frame interpolation, where each row in reconstructed RS images is interpolated from predicted GS images by utilizing RS distortion time maps. By achieving superior performance while simplifying the training process, SelfDRSC++ enables feasible one-stage self-supervised training. Additionally, besides start and end RS scanning time, SelfDRSC++ allows supervision of GS images at arbitrary intermediate scanning times, thus enabling the learned DRSC network to generate high framerate GS videos. The code and trained models are available at url{https://github.com/shangwei5/SelfDRSC_plusplus}.

8/22/2024

Rolling Shutter Correction with Intermediate Distortion Flow Estimation

Mingdeng Cao, Sidi Yang, Yujiu Yang, Yinqiang Zheng

This paper proposes to correct the rolling shutter (RS) distorted images by estimating the distortion flow from the global shutter (GS) to RS directly. Existing methods usually perform correction using the undistortion flow from the RS to GS. They initially predict the flow from consecutive RS frames, subsequently rescaling it as the displacement fields from the RS frame to the underlying GS image using time-dependent scaling factors. Following this, RS-aware forward warping is employed to convert the RS image into its GS counterpart. Nevertheless, this strategy is prone to two shortcomings. First, the undistortion flow estimation is rendered inaccurate by merely linear scaling the flow, due to the complex non-linear motion nature. Second, RS-aware forward warping often results in unavoidable artifacts. To address these limitations, we introduce a new framework that directly estimates the distortion flow and rectifies the RS image with the backward warping operation. More specifically, we first propose a global correlation-based flow attention mechanism to estimate the initial distortion flow and GS feature jointly, which are then refined by the following coarse-to-fine decoder layers. Additionally, a multi-distortion flow prediction strategy is integrated to mitigate the issue of inaccurate flow estimation further. Experimental results validate the effectiveness of the proposed method, which outperforms state-of-the-art approaches on various benchmarks while maintaining high efficiency. The project is available at url{https://github.com/ljzycmd/DFRSC}.

4/10/2024

Single Image Rolling Shutter Removal with Diffusion Models

Zhanglei Yang, Haipeng Li, Mingbo Hong, Bing Zeng, Shuaicheng Liu

We present RS-Diffusion, the first Diffusion Models-based method for single-frame Rolling Shutter (RS) correction. RS artifacts compromise visual quality of frames due to the row wise exposure of CMOS sensors. Most previous methods have focused on multi-frame approaches, using temporal information from consecutive frames for the motion rectification. However, few approaches address the more challenging but important single frame RS correction. In this work, we present an ``image-to-motion'' framework via diffusion techniques, with a designed patch-attention module. In addition, we present the RS-Real dataset, comprised of captured RS frames alongside their corresponding Global Shutter (GS) ground-truth pairs. The GS frames are corrected from the RS ones, guided by the corresponding Inertial Measurement Unit (IMU) gyroscope data acquired during capture. Experiments show that our RS-Diffusion surpasses previous single RS correction methods. Our method and proposed RS-Real dataset lay a solid foundation for advancing the field of RS correction.

7/4/2024

🤯

Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations

Zhilu Zhang, Ruohao Wang, Hongzhi Zhang, Wangmeng Zuo

In this paper, we consider two challenging issues in reference-based super-resolution (RefSR) for smartphone, (i) how to choose a proper reference image, and (ii) how to learn RefSR in a self-supervised manner. Particularly, we propose a novel self-supervised learning approach for real-world RefSR from observations at dual and multiple camera zooms. Firstly, considering the popularity of multiple cameras in modern smartphones, the more zoomed (telephoto) image can be naturally leveraged as the reference to guide the super-resolution (SR) of the lesser zoomed (ultra-wide) image, which gives us a chance to learn a deep network that performs SR from the dual zoomed observations (DZSR). Secondly, for self-supervised learning of DZSR, we take the telephoto image instead of an additional high-resolution image as the supervision information, and select a center patch from it as the reference to super-resolve the corresponding ultra-wide image patch. To mitigate the effect of the misalignment between ultra-wide low-resolution (LR) patch and telephoto ground-truth (GT) image during training, we first adopt patch-based optical flow alignment and then design an auxiliary-LR to guide the deforming of the warped LR features. To generate visually pleasing results, we present local overlapped sliced Wasserstein loss to better represent the perceptual difference between GT and output in the feature space. During testing, DZSR can be directly deployed to super-solve the whole ultra-wide image with the reference of the telephoto image. In addition, we further take multiple zoomed observations to explore self-supervised RefSR, and present a progressive fusion scheme for the effective utilization of reference images. Experiments show that our methods achieve better quantitative and qualitative performance against state-of-the-arts. Codes are available at https://github.com/cszhilu1998/SelfDZSR_PlusPlus.

5/6/2024