BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement

Read original: arXiv:2402.01970 - Published 5/28/2024 by Nantheera Anantrasirichai, Ruirui Lin, Alexandra Malyugina, David Bull

BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement

Overview

This research paper introduces a new benchmark dataset called BVI-Lowlight for evaluating low-light video enhancement algorithms.
The dataset contains fully registered, high-quality video footage captured in various low-light conditions, along with corresponding high-quality reference frames.
The authors aim to provide a comprehensive and challenging benchmark to drive progress in low-light video enhancement, a critical area for many real-world applications.

Plain English Explanation

The paper presents a new dataset called BVI-Lowlight that can be used to test and compare algorithms for enhancing the quality of video captured in low-light conditions. Low-light video is challenging because details and colors can be hard to see, but it's important for many applications like security cameras, self-driving cars, and night photography.

The BVI-Lowlight dataset includes high-quality video footage taken in different low-light environments, along with "reference" frames that show what the scenes would look like in normal lighting. By comparing the enhanced low-light videos to the references, researchers can evaluate how well their algorithms are able to restore details and colors. This provides a standardized way to measure progress in this field.

The authors hope that this comprehensive dataset will drive further advancements in low-light video enhancement, leading to improvements in many real-world technologies that rely on being able to see clearly in the dark.

Technical Explanation

The paper introduces the BVI-Lowlight dataset, a new benchmark for evaluating low-light video enhancement algorithms. The dataset contains fully registered, high-quality video footage captured in various low-light conditions, along with corresponding high-quality reference frames.

The authors designed the dataset to address limitations of existing low-light video datasets. Unlike previous datasets that focused on single-frame enhancement or lacked diversity in lighting conditions, BVI-Lowlight provides a more comprehensive and challenging benchmark for evaluating end-to-end video enhancement algorithms.

The dataset covers a range of indoor and outdoor scenes with varying levels of illumination, from near-darkness to moderate low-light. The video frames are spatially and temporally aligned to the corresponding reference frames, enabling direct quantitative evaluation of enhancement performance.

To demonstrate the utility of the dataset, the authors conduct a benchmark evaluation of several state-of-the-art low-light video enhancement methods. The results show significant performance gaps, highlighting the need for further advancements in this domain.

Critical Analysis

The BVI-Lowlight dataset represents a valuable contribution to the field of low-light video enhancement. By providing a standardized, high-quality benchmark, the authors have addressed an important gap in existing datasets and enabled more meaningful comparisons between algorithms.

However, the authors acknowledge certain limitations of the dataset. For example, the scenes are static, and the dataset does not include moving objects or camera motion. Additionally, the range of lighting conditions, while diverse, may not fully capture the breadth of real-world low-light scenarios.

Further research is needed to extend the dataset to more dynamic and challenging environments, such as those with event-assisted low-light video object segmentation or nighttime motion detection. Additionally, exploring the integration of the BVI-Lowlight dataset with emerging spatio-temporal aligned models or multi-object tracking in the dark could further advance the state of the art in low-light video enhancement.

Conclusion

The BVI-Lowlight dataset represents a significant step forward in benchmarking low-light video enhancement algorithms. By providing a comprehensive, high-quality dataset with fully registered frames, the authors have created a valuable tool for driving progress in this critical field. The dataset's potential to accelerate research in ground-based low-light diffusion models and other low-light video enhancement techniques could have far-reaching impacts on a wide range of real-world applications, from surveillance to autonomous vehicles.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement

Nantheera Anantrasirichai, Ruirui Lin, Alexandra Malyugina, David Bull

Low-light videos often exhibit spatiotemporal incoherent noise, leading to poor visibility and compromised performance across various computer vision applications. One significant challenge in enhancing such content using modern technologies is the scarcity of training data. This paper introduces a novel low-light video dataset, consisting of 40 scenes captured in various motion scenarios under two distinct low-lighting conditions, incorporating genuine noise and temporal artifacts. We provide fully registered ground truth data captured in normal light using a programmable motorized dolly, and subsequently, refine them via image-based post-processing to ensure the pixel-wise alignment of frames in different light levels. This paper also presents an exhaustive analysis of the low-light dataset, and demonstrates the extensive and representative nature of our dataset in the context of supervised learning. Our experimental results demonstrate the significance of fully registered video pairs in the development of low-light video enhancement methods and the need for comprehensive evaluation. Our dataset is available at DOI:10.21227/mzny-8c77.

5/28/2024

BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement

Ruirui Lin, Nantheera Anantrasirichai, Guoxi Huang, Joanne Lin, Qi Sun, Alexandra Malyugina, David R Bull

Low-light videos often exhibit spatiotemporal incoherent noise, compromising visibility and performance in computer vision applications. One significant challenge in enhancing such content using deep learning is the scarcity of training data. This paper introduces a novel low-light video dataset, consisting of 40 scenes with various motion scenarios under two distinct low-lighting conditions, incorporating genuine noise and temporal artifacts. We provide fully registered ground truth data captured in normal light using a programmable motorized dolly and refine it via an image-based approach for pixel-wise frame alignment across different light levels. We provide benchmarks based on four different technologies: convolutional neural networks, transformers, diffusion models, and state space models (mamba). Our experimental results demonstrate the significance of fully registered video pairs for low-light video enhancement (LLVE) and the comprehensive evaluation shows that the models trained with our dataset outperform those trained with the existing datasets. Our dataset and links to benchmarks are publicly available at https://doi.org/10.21227/mzny-8c77.

7/30/2024

EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More

Kanghao Chen, Guoqiang Liang, Hangyu Li, Yunfan Lu, Lin Wang

Event cameras offer significant advantages for low-light video enhancement, primarily due to their high dynamic range. Current research, however, is severely limited by the absence of large-scale, real-world, and spatio-temporally aligned event-video datasets. To address this, we introduce a large-scale dataset with over 30,000 pairs of frames and events captured under varying illumination. This dataset was curated using a robotic arm that traces a consistent non-linear trajectory, achieving spatial alignment precision under 0.03mm and temporal alignment with errors under 0.01s for 90% of the dataset. Based on the dataset, we propose textbf{EvLight++}, a novel event-guided low-light video enhancement approach designed for robust performance in real-world scenarios. Firstly, we design a multi-scale holistic fusion branch to integrate structural and textural information from both images and events. To counteract variations in regional illumination and noise, we introduce Signal-to-Noise Ratio (SNR)-guided regional feature selection, enhancing features from high SNR regions and augmenting those from low SNR regions by extracting structural information from events. To incorporate temporal information and ensure temporal coherence, we further introduce a recurrent module and temporal loss in the whole pipeline. Extensive experiments on our and the synthetic SDSD dataset demonstrate that EvLight++ significantly outperforms both single image- and video-based methods by 1.37 dB and 3.71 dB, respectively. To further explore its potential in downstream tasks like semantic segmentation and monocular depth estimation, we extend our datasets by adding pseudo segmentation and depth labels via meticulous annotation efforts with foundation models. Experiments under diverse low-light scenes show that the enhanced results achieve a 15.97% improvement in mIoU for semantic segmentation.

8/30/2024

Low-Light Object Tracking: A Benchmark

Pengzhi Zhong, Xiaoyu Guo, Defeng Huang, Xiaojun Peng, Yian Li, Qijun Zhao, Shuiwang Li

In recent years, the field of visual tracking has made significant progress with the application of large-scale training datasets. These datasets have supported the development of sophisticated algorithms, enhancing the accuracy and stability of visual object tracking. However, most research has primarily focused on favorable illumination circumstances, neglecting the challenges of tracking in low-ligh environments. In low-light scenes, lighting may change dramatically, targets may lack distinct texture features, and in some scenarios, targets may not be directly observable. These factors can lead to a severe decline in tracking performance. To address this issue, we introduce LLOT, a benchmark specifically designed for Low-Light Object Tracking. LLOT comprises 269 challenging sequences with a total of over 132K frames, each carefully annotated with bounding boxes. This specially designed dataset aims to promote innovation and advancement in object tracking techniques for low-light conditions, addressing challenges not adequately covered by existing benchmarks. To assess the performance of existing methods on LLOT, we conducted extensive tests on 39 state-of-the-art tracking algorithms. The results highlight a considerable gap in low-light tracking performance. In response, we propose H-DCPT, a novel tracker that incorporates historical and darkness clue prompts to set a stronger baseline. H-DCPT outperformed all 39 evaluated methods in our experiments, demonstrating significant improvements. We hope that our benchmark and H-DCPT will stimulate the development of novel and accurate methods for tracking objects in low-light conditions. The LLOT and code are available at https://github.com/OpenCodeGithub/H-DCPT.

8/22/2024