RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement

Read original: arXiv:2409.04363 - Published 9/9/2024 by Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang

RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement

Overview

This paper introduces RCNet, a deep recurrent collaborative network for enhancing multi-view low-light images.
RCNet combines intra-view enhancement and inter-view alignment & fusion to improve low-light image quality across multiple views.
The network leverages recurrent connections to enable iterative refinement of the enhancement process.

Plain English Explanation

The RCNet model is designed to improve the quality of low-light images captured from multiple viewpoints or angles. Low-light conditions can make it difficult to take clear, high-quality photos, as the images often appear dark and lacking in detail.

RCNet addresses this problem by using a [object Object] approach that combines two key steps:

Intra-view Enhancement: RCNet first enhances each individual low-light image, improving brightness, contrast, and detail.
Inter-view Alignment & Fusion: RCNet then aligns and combines the enhanced images from multiple viewpoints to create a single, high-quality output.

The network uses [object Object] to allow for iterative refinement of the enhancement process, gradually improving the results over multiple steps.

By leveraging information from multiple views, RCNet is able to produce more accurate and detailed low-light image enhancements compared to approaches that only consider a single view.

Technical Explanation

The RCNet architecture consists of two main components:

Intra-view Enhancement Module: This module uses a [object Object] to enhance each input low-light image individually, improving brightness, contrast, and preserving important details.
Inter-view Alignment & Fusion Module: This module aligns the enhanced images from multiple viewpoints and combines them to create a final, high-quality output. The alignment step ensures that features in the images are properly registered, while the fusion step leverages the complementary information across views.

The network is trained in an [object Object] fashion, allowing the intra-view enhancement and inter-view alignment & fusion components to work together effectively.

Experimental results on a multi-view low-light image dataset demonstrate that RCNet outperforms state-of-the-art single-view and multi-view enhancement methods, producing visually appealing and detailed low-light image outputs.

Critical Analysis

The paper provides a thorough evaluation of the RCNet model, including comparisons to several existing low-light enhancement approaches. However, the authors acknowledge some limitations:

The performance of RCNet may be influenced by the quality and diversity of the training data, which can be challenging to acquire for real-world multi-view low-light scenarios.
The iterative refinement process, while effective, can be computationally expensive, which may limit the deployment of RCNet in real-time applications.

Further research could explore ways to improve the efficiency of the recurrent refinement process, or investigate techniques for generating more robust synthetic multi-view low-light datasets to supplement the training data.

Conclusion

The RCNet model presents a promising approach for enhancing low-light images captured from multiple viewpoints. By combining intra-view enhancement and inter-view alignment & fusion, the network is able to leverage complementary information across views to produce high-quality, visually appealing low-light image outputs.

The recurrent nature of the network allows for iterative refinement, leading to further improvements in the enhancement results. While the model has some limitations, the overall framework demonstrates the potential of collaborative multi-view approaches for addressing challenging low-light imaging challenges.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement

Hao Luo, Baoliang Chen, Lingyu Zhu, Peilin Chen, Shiqi Wang

Scene observation from multiple perspectives would bring a more comprehensive visual experience. However, in the context of acquiring multiple views in the dark, the highly correlated views are seriously alienated, making it challenging to improve scene understanding with auxiliary views. Recent single image-based enhancement methods may not be able to provide consistently desirable restoration performance for all views due to the ignorance of potential feature correspondence among different views. To alleviate this issue, we make the first attempt to investigate multi-view low-light image enhancement. First, we construct a new dataset called Multi-View Low-light Triplets (MVLT), including 1,860 pairs of triple images with large illumination ranges and wide noise distribution. Each triplet is equipped with three different viewpoints towards the same scene. Second, we propose a deep multi-view enhancement framework based on the Recurrent Collaborative Network (RCNet). Specifically, in order to benefit from similar texture correspondence across different views, we design the recurrent feature enhancement, alignment and fusion (ReEAF) module, in which intra-view feature enhancement (Intra-view EN) followed by inter-view feature alignment and fusion (Inter-view AF) is performed to model the intra-view and inter-view feature propagation sequentially via multi-view collaboration. In addition, two different modules from enhancement to alignment (E2A) and from alignment to enhancement (A2E) are developed to enable the interactions between Intra-view EN and Inter-view AF, which explicitly utilize attentive feature weighting and sampling for enhancement and alignment, respectively. Experimental results demonstrate that our RCNet significantly outperforms other state-of-the-art methods. All of our dataset, code, and model will be available at https://github.com/hluo29/RCNet.

9/9/2024

✨

MRIo3DS-Net: A Mutually Reinforcing Images to 3D Surface RNN-like framework for model-adaptation indoor 3D reconstruction

Chang Li, Jiao Guo, Yufei Zhao, Yongjun Zhang

This paper is the first to propose an end-to-end framework of mutually reinforcing images to 3D surface recurrent neural network-like for model-adaptation indoor 3D reconstruction,where multi-view dense matching and point cloud surface optimization are mutually reinforced by a RNN-like structure rather than being treated as a separate issue.The characteristics are as follows:In the multi-view dense matching module, the model-adaptation strategy is used to fine-tune and optimize a Transformer-based multi-view dense matching DNN,so that it has the higher image feature for matching and detail expression capabilities;In the point cloud surface optimization module,the 3D surface reconstruction network based on 3D implicit field is optimized by using model-adaptation strategy,which solves the problem of point cloud surface optimization without knowing normal vector of 3D surface.To improve and finely reconstruct 3D surfaces from point cloud,smooth loss is proposed and added to this module;The MRIo3DS-Net is a RNN-like framework,which utilizes the finely optimized 3D surface obtained by PCSOM to recursively reinforce the differentiable warping for optimizing MVDMM.This refinement leads to achieving better dense matching results, and better dense matching results leads to achieving better 3D surface results recursively and mutually.Hence, model-adaptation strategy can better collaborate the differences between the two network modules,so that they complement each other to achieve the better effect;To accelerate the transfer learning and training convergence from source domain to target domain,a multi-task loss function based on Bayesian uncertainty is used to adaptively adjust the weights between the two networks loss functions of MVDMM and PCSOM;In this multi-task cascade network framework,any modules can be replaced by any state-of-the-art networks to achieve better 3D reconstruction results.

7/17/2024

CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task

Kangzhen Yang, Tao Hu, Kexin Dai, Genggeng Chen, Yu Cao, Wei Dong, Peng Wu, Yanning Zhang, Qingsen Yan

In real-world scenarios, images captured often suffer from blurring, noise, and other forms of image degradation, and due to sensor limitations, people usually can only obtain low dynamic range images. To achieve high-quality images, researchers have attempted various image restoration and enhancement operations on photographs, including denoising, deblurring, and high dynamic range imaging. However, merely performing a single type of image enhancement still cannot yield satisfactory images. In this paper, to deal with the challenge above, we propose the Composite Refinement Network (CRNet) to address this issue using multiple exposure images. By fully integrating information-rich multiple exposure inputs, CRNet can perform unified image restoration and enhancement. To improve the quality of image details, CRNet explicitly separates and strengthens high and low-frequency information through pooling layers, using specially designed Multi-Branch Blocks for effective fusion of these frequencies. To increase the receptive field and fully integrate input features, CRNet employs the High-Frequency Enhancement Module, which includes large kernel convolutions and an inverted bottleneck ConvFFN. Our model secured third place in the first track of the Bracketing Image Restoration and Enhancement Challenge, surpassing previous SOTA models in both testing metrics and visual quality.

4/23/2024

Relational Representation Learning Network for Cross-Spectral Image Patch Matching

Chuang Yu, Yunpeng Liu, Jinmiao Zhao, Dou Quan, Zelin Shi

Recently, feature relation learning has drawn widespread attention in cross-spectral image patch matching. However, existing related research focuses on extracting diverse relations between image patch features and ignores sufficient intrinsic feature representations of individual image patches. Therefore, we propose an innovative relational representation learning idea that simultaneously focuses on sufficiently mining the intrinsic features of individual image patches and the relations between image patch features. Based on this, we construct a Relational Representation Learning Network (RRL-Net). Specifically, we innovatively construct an autoencoder to fully characterize the individual intrinsic features, and introduce a feature interaction learning (FIL) module to extract deep-level feature relations. To further fully mine individual intrinsic features, a lightweight multi-dimensional global-to-local attention (MGLA) module is constructed to enhance the global feature extraction of individual image patches and capture local dependencies within global features. By combining the MGLA module, we further explore the feature extraction network and construct an attention-based lightweight feature extraction (ALFE) network. In addition, we propose a multi-loss post-pruning (MLPP) optimization strategy, which greatly promotes network optimization while avoiding increases in parameters and inference time. Extensive experiments demonstrate that our RRL-Net achieves state-of-the-art (SOTA) performance on multiple public datasets. Our code will be made public later.

8/7/2024