WB LUTs: Contrastive Learning for White Balancing Lookup Tables

Read original: arXiv:2404.10133 - Published 4/17/2024 by Sai Kumar Reddy Manne, Michael Wan

WB LUTs: Contrastive Learning for White Balancing Lookup Tables

Overview

The paper proposes a novel approach for white balancing lookup tables (WB LUTs) using contrastive learning.
White balancing is an important image processing task that adjusts the color temperature of an image to correct for the effects of different lighting conditions.
The authors introduce a contrastive learning framework to learn effective WB LUTs directly from data, without relying on manual color calibration or complex modeling.

Plain English Explanation

The paper is about a new way to automatically adjust the colors in digital images to make them look more natural and accurate. When you take a photo, the lighting conditions can affect the colors - for example, images taken in warm, yellow-tinted lighting can look unnatural. To fix this, a process called "white balancing" is used to adjust the colors in the image.

Traditionally, white balancing has relied on complex mathematical models or manual color calibration, which can be time-consuming and difficult. The authors of this paper propose a new approach based on "contrastive learning". This involves training a neural network to learn how to adjust colors directly from a large dataset of images, without requiring manual tuning or complicated models.

The key insight is that by training the neural network to distinguish between "good" and "bad" white balance, it can learn to automatically apply the right color adjustments to new images. This makes the white balancing process much simpler and more efficient, which could be useful for a wide range of applications, from photography to video editing.

Technical Explanation

The paper introduces a novel contrastive learning framework for learning effective white balancing lookup tables (WB LUTs) directly from data. White balancing is an important image processing task that adjusts the color temperature of an image to correct for the effects of different lighting conditions.

Traditionally, white balancing has relied on manual color calibration or complex mathematical models. The authors propose a data-driven approach that learns WB LUTs in an end-to-end manner, without requiring manual tuning or complex modeling. The key idea is to train a neural network to distinguish between "good" and "bad" white balance, using a contrastive learning objective.

Specifically, the authors construct a dataset of images with known white balance parameters, and train a neural network to predict the correct WB LUT for each input image. The network is trained to maximize the similarity between the predicted LUT and the ground truth LUT, while minimizing the similarity between the predicted LUT and "negative" LUTs sampled from the dataset.

Through extensive experiments, the authors demonstrate that their contrastive learning approach outperforms traditional white balancing methods, both in terms of objective metrics and subjective evaluations. The proposed framework is also shown to be computationally efficient and can be easily integrated into existing image processing pipelines.

Critical Analysis

The paper presents a novel and promising approach for learning white balancing lookup tables using contrastive learning. The key strength of the method is its ability to learn effective WB LUTs directly from data, without relying on manual color calibration or complex mathematical models.

One potential limitation of the approach is that it requires a dataset of images with known white balance parameters, which may not always be available. The authors address this by using synthetic data generation, but this could introduce its own set of biases and limitations.

Additionally, the paper does not explore the generalization capabilities of the learned WB LUTs to diverse lighting conditions or camera sensors. Further research is needed to assess the robustness and adaptability of the proposed framework in real-world scenarios.

Another area for future work could be to investigate the incorporation of additional contextual information, such as scene content or metadata, to further improve the white balancing performance.

Overall, the paper presents a promising step forward in the field of computational photography and offers a novel approach to the long-standing challenge of automatic white balancing.

Conclusion

The WB LUTs paper introduces a contrastive learning framework for learning effective white balancing lookup tables directly from data. This approach offers a more efficient and data-driven alternative to traditional white balancing methods, which often rely on manual calibration or complex mathematical modeling.

The key contribution of the paper is the introduction of a novel contrastive learning objective that allows the neural network to learn how to predict the correct white balance adjustments for a given input image. This has the potential to significantly streamline the white balancing process and make it more accessible for a wide range of applications, from photography to video editing.

While the paper presents promising results, there are still some areas for further research, such as exploring the generalization capabilities of the learned WB LUTs and investigating the incorporation of additional contextual information. Overall, the paper represents an important step forward in the field of computational photography and offers a compelling new approach to the problem of automatic white balancing.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

WB LUTs: Contrastive Learning for White Balancing Lookup Tables

Sai Kumar Reddy Manne, Michael Wan

Automatic white balancing (AWB), one of the first steps in an integrated signal processing (ISP) pipeline, aims to correct the color cast induced by the scene illuminant. An incorrect white balance (WB) setting or AWB failure can lead to an undesired blue or red tint in the rendered sRGB image. To address this, recent methods pose the post-capture WB correction problem as an image-to-image translation task and train deep neural networks to learn the necessary color adjustments at a lower resolution. These low resolution outputs are post-processed to generate high resolution WB corrected images, forming a bottleneck in the end-to-end run time. In this paper we present a 3D Lookup Table (LUT) based WB correction model called WB LUTs that can generate high resolution outputs in real time. We introduce a contrastive learning framework with a novel hard sample mining strategy, which improves the WB correction quality of baseline 3D LUTs by 25.5%. Experimental results demonstrate that the proposed WB LUTs perform competitively against state-of-the-art models on two benchmark datasets while being 300 times faster using 12.7 times less memory. Our model and code are available at https://github.com/skrmanne/3DLUT_sRGB_WB.

4/17/2024

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao, Yatai Ji, Hanzhong Guo, Ngai Wong, Yujiu Yang

The widespread use of high-definition screens in edge devices, such as end-user cameras, smartphones, and televisions, is spurring a significant demand for image enhancement. Existing enhancement models often optimize for high performance while falling short of reducing hardware inference time and power consumption, especially on edge devices with constrained computing and storage resources. To this end, we propose Image Color Enhancement Lookup Table (ICELUT) that adopts LUTs for extremely efficient edge inference, without any convolutional neural network (CNN). During training, we leverage pointwise (1x1) convolution to extract color information, alongside a split fully connected layer to incorporate global information. Both components are then seamlessly converted into LUTs for hardware-agnostic deployment. ICELUT achieves near-state-of-the-art performance and remarkably low power consumption. We observe that the pointwise network structure exhibits robust scalability, upkeeping the performance even with a heavily downsampled 32x32 input image. These enable ICELUT, the first-ever purely LUT-based image enhancer, to reach an unprecedented speed of 0.4ms on GPU and 7ms on CPU, at least one order faster than any CNN solution. Codes are available at https://github.com/Stephen0808/ICELUT.

7/16/2024

New!Optimizing 4D Lookup Table for Low-light Video Enhancement via Wavelet Priori

Jinhong He, Minglong Xue, Wenhai Wang, Mingliang Zhou

Low-light video enhancement is highly demanding in maintaining spatiotemporal color consistency. Therefore, improving the accuracy of color mapping and keeping the latency low is challenging. Based on this, we propose incorporating Wavelet-priori for 4D Lookup Table (WaveLUT), which effectively enhances the color coherence between video frames and the accuracy of color mapping while maintaining low latency. Specifically, we use the wavelet low-frequency domain to construct an optimized lookup prior and achieve an adaptive enhancement effect through a designed Wavelet-prior 4D lookup table. To effectively compensate the a priori loss in the low light region, we further explore a dynamic fusion strategy that adaptively determines the spatial weights based on the correlation between the wavelet lighting prior and the target intensity structure. In addition, during the training phase, we devise a text-driven appearance reconstruction method that dynamically balances brightness and content through multimodal semantics-driven Fourier spectra. Extensive experiments on a wide range of benchmark datasets show that this method effectively enhances the previous method's ability to perceive the color space and achieves metric-favorable and perceptually oriented real-time enhancement while maintaining high efficiency.

9/16/2024

A Learnable Color Correction Matrix for RAW Reconstruction

Anqi Liu, Shiyi Mu, Shugong Xu

Autonomous driving algorithms usually employ sRGB images as model input due to their compatibility with the human visual system. However, visually pleasing sRGB images are possibly sub-optimal for downstream tasks when compared to RAW images. The availability of RAW images is constrained by the difficulties in collecting real-world driving data and the associated challenges of annotation. To address this limitation and support research in RAW-domain driving perception, we design a novel and ultra-lightweight RAW reconstruction method. The proposed model introduces a learnable color correction matrix (CCM), which uses only a single convolutional layer to approximate the complex inverse image signal processor (ISP). Experimental results demonstrate that simulated RAW (simRAW) images generated by our method provide performance improvements equivalent to those produced by more complex inverse ISP methods when pretraining RAW-domain object detectors, which highlights the effectiveness and practicality of our approach.

9/5/2024