JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients

Read original: arXiv:2404.05558 - Published 4/9/2024 by Woo Kyoung Han, Sunghoon Im, Jaedeok Kim, Kyong Hwan Jin

🏷️

Overview

Proposes a practical approach to improving JPEG image decoding using a local implicit neural representation with continuous cosine formulation
Addresses quality degradation caused by the JPEG algorithm's significant quantization of discrete cosine transform (DCT) spectra
Designed a continuous cosine spectrum estimator to restore the distorted spectrum
Leverages local DCT formulations to exploit dequantization and upsampling simultaneously
Enables direct decoding of compressed images across different quality factors using a single pre-trained model

Plain English Explanation

The paper presents a new way to improve the quality of JPEG-compressed images. JPEG is a widely used image compression algorithm, but it comes with a trade-off - the higher the compression, the lower the image quality. This is because JPEG significantly reduces the amount of information stored in the image by quantizing the discrete cosine transform (DCT) spectra, which are mathematical representations of the image data.

The researchers have designed a neural network that can "estimate" the original, high-quality DCT spectra from the heavily quantized data in JPEG-compressed images. By leveraging the local structure of the DCT, their model can simultaneously dequantize and upsample the image, restoring the lost detail and sharpness. This allows their model to directly decode JPEG-compressed images at different quality levels using a single pre-trained network, without needing a traditional JPEG decoder.

The key innovation is the use of a continuous cosine formulation to represent the DCT spectra, which helps the model better capture the underlying structure of the image data. This, combined with the local processing approach, allows the model to effectively undo the quality degradation caused by JPEG compression.

Technical Explanation

The proposed model utilizes a local implicit neural representation with a continuous cosine formulation to estimate the original DCT spectra from heavily quantized JPEG-compressed data. By leveraging the local structure of the DCT, the model can exploit dequantization and upsampling simultaneously.

The key components of the model include:

Continuous Cosine Spectrum Estimator: This module estimates the original, high-quality DCT spectra from the distorted, quantized spectra in the JPEG-compressed image.
Local DCT Formulation: The model processes the image in local patches, allowing it to better capture the spatial structure of the DCT and perform dequantization and upsampling more effectively.

The researchers trained the model end-to-end using a single pre-trained network, which enables direct decoding of JPEG-compressed images at different quality factors without relying on a traditional JPEG decoder. This approach achieves state-of-the-art performance in flexible color image JPEG artifact removal tasks.

Critical Analysis

The paper presents a novel and practical approach to improving the quality of JPEG-compressed images. The use of a continuous cosine formulation and local processing is a clever way to exploit the underlying structure of the DCT spectra and effectively undo the quality degradation caused by JPEG compression.

However, the paper does not discuss the computational complexity or inference speed of the proposed model, which could be a important consideration for real-world applications. Additionally, while the model is shown to outperform existing methods on JPEG artifact removal tasks, it would be interesting to see how it performs on other image processing tasks, such as secure lossy image compression or low-light image enhancement.

The authors also mention that their model can directly decode JPEG-compressed images at different quality factors using a single pre-trained model, but it would be valuable to see a more detailed analysis of the model's flexibility and robustness across a wider range of compression levels and image types.

Conclusion

The proposed approach for JPEG image decoding represents a significant advancement in the field of image compression and artifact removal. By leveraging a continuous cosine formulation and local processing, the model is able to effectively restore the quality of JPEG-compressed images, outperforming existing methods.

This research has the potential to improve the user experience in a variety of applications that rely on JPEG-compressed images, such as content-aware image resizing or low-light image enhancement. The ability to directly decode JPEG-compressed images at different quality factors using a single pre-trained model also suggests that the proposed approach could be highly scalable and practical for real-world deployment.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏷️

JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients

Woo Kyoung Han, Sunghoon Im, Jaedeok Kim, Kyong Hwan Jin

We propose a practical approach to JPEG image decoding, utilizing a local implicit neural representation with continuous cosine formulation. The JPEG algorithm significantly quantizes discrete cosine transform (DCT) spectra to achieve a high compression rate, inevitably resulting in quality degradation while encoding an image. We have designed a continuous cosine spectrum estimator to address the quality degradation issue that restores the distorted spectrum. By leveraging local DCT formulations, our network has the privilege to exploit dequantization and upsampling simultaneously. Our proposed model enables decoding compressed images directly across different quality factors using a single pre-trained model without relying on a conventional JPEG decoder. As a result, our proposed network achieves state-of-the-art performance in flexible color image JPEG artifact removal tasks. Our source code is available at https://github.com/WooKyoungHan/JDEC.

4/9/2024

🌐

JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer

Mingyu Ouyang, Zhenzhong Chen

JPEG compression adopts the quantization of Discrete Cosine Transform (DCT) coefficients for effective bit-rate reduction, whilst the quantization could lead to a significant loss of important image details. Recovering compressed JPEG images in the frequency domain has recently garnered increasing interest, complementing the multitude of restoration techniques established in the pixel domain. However, existing DCT domain methods typically suffer from limited effectiveness in handling a wide range of compression quality factors or fall short in recovering sparse quantized coefficients and the components across different colorspaces. To address these challenges, we propose a DCT domain spatial-frequential Transformer, namely DCTransformer, for JPEG quantized coefficient recovery. Specifically, a dual-branch architecture is designed to capture both spatial and frequential correlations within the collocated DCT coefficients. Moreover, we incorporate the operation of quantization matrix embedding, which effectively allows our single model to handle a wide range of quality factors, and a luminance-chrominance alignment head that produces a unified feature map to align different-sized luminance and chrominance components. Our proposed DCTransformer outperforms the current state-of-the-art JPEG artifact removal techniques, as demonstrated by our extensive experiments.

5/6/2024

Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors

Ming-Che Li, Archisman Ghosh, Shreyas Sen

Recent expansions in multimedia devices gather enormous amounts of real-time images for processing and inference. The images are first compressed using compression schemes, like JPEG, to reduce storage costs and power for transmitting the captured data. Due to inherent error resilience and imperceptibility in images, JPEG can be approximated to reduce the required computation power and area. This work demonstrates the first end-to-end approximation computing-based optimization of JPEG hardware using i) an approximate division realized using bit-shift operators to reduce the complexity of the quantization block, ii) loop perforation, and iii) precision scaling on top of a multiplier-less fast DCT architecture to achieve an extremely energy-efficient JPEG compression unit which will be a perfect fit for power/bandwidth-limited scenario. Furthermore, a gradient descent-based heuristic composed of two conventional approximation strategies, i.e., Precision Scaling and Loop Perforation, is implemented for tuning the degree of approximation to trade off energy consumption with the quality degradation of the decoded image. The entire RTL design is coded in Verilog HDL, synthesized, mapped to TSMC 65nm CMOS technology, and simulated using Cadence Spectre Simulator under 25$^{circ}$textbf{C}, TT corner. The approximate division approach achieved around $textbf{28%}$ reduction in the active design area. The heuristic-based approximation technique combined with accelerator optimization achieves a significant energy reduction of $textbf{36%}$ for a minimal image quality degradation of $textbf{2%}$ SAD. Simulation results also show that the proposed architecture consumes 15uW at the DCT and quantization stages to compress a colored 480p image at 6fps.

6/25/2024

Edge-based Denoising Image Compression

Ryugo Morita, Hitoshi Nishimura, Ko Watanabe, Andreas Dengel, Jinjia Zhou

In recent years, deep learning-based image compression, particularly through generative models, has emerged as a pivotal area of research. Despite significant advancements, challenges such as diminished sharpness and quality in reconstructed images, learning inefficiencies due to mode collapse, and data loss during transmission persist. To address these issues, we propose a novel compression model that incorporates a denoising step with diffusion models, significantly enhancing image reconstruction fidelity by sub-information(e.g., edge and depth) from leveraging latent space. Empirical experiments demonstrate that our model achieves superior or comparable results in terms of image quality and compression efficiency when measured against the existing models. Notably, our model excels in scenarios of partial image loss or excessive noise by introducing an edge estimation network to preserve the integrity of reconstructed images, offering a robust solution to the current limitations of image compression.

9/18/2024