Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion

Read original: arXiv:2404.15174 - Published 4/24/2024 by Yu-Jie Liang, Zihan Cao, Liang-Jian Deng, Xiao Wu

🧠

Overview

Recent advances in implicit neural representations (INR) have brought new solutions for Multispectral and Hyperspectral Image Fusion (MHIF) tasks
However, INR has limitations, such as losing high-frequency information and lacking global perceptual capabilities
This paper introduces the Fourier-enhanced Implicit Neural Fusion Network (FeINFN), which aims to address these issues in MHIF tasks

Plain English Explanation

Hyperspectral and multispectral imaging are techniques that can capture detailed information about the composition and properties of materials and objects. These types of images are useful in many applications, such as remote sensing, medical imaging, and environmental monitoring.

Recently, a new approach called implicit neural representations (INR) has shown promise in fusing these types of images. INR can create detailed representations of images using neural networks, but it has some limitations. It can struggle to capture high-frequency details, and it may have trouble understanding the broader context of an image.

To address these issues, the researchers developed a new method called the Fourier-enhanced Implicit Neural Fusion Network (FeINFN). The key ideas behind FeINFN are:

Leveraging the similarity between the Fourier amplitudes of high-resolution and low-resolution hyperspectral images
Using a specialized fusion function to help INR capture high-frequency details
Employing a new decoding approach that enhances the interaction between different features in the INR representation

Technical Explanation

The paper introduces the Fourier-enhanced Implicit Neural Fusion Network (FeINFN), which aims to address the limitations of INR-based approaches for Multispectral and Hyperspectral Image Fusion (MHIF) tasks.

The researchers observed that the Fourier amplitudes of the high-resolution hyperspectral image (HR-HSI) latent code and the low-resolution hyperspectral image (LR-HSI) are remarkably similar, but their phase patterns differ. To leverage this insight, FeINFN incorporates a spatial and frequency implicit fusion function (Spa-Fre IFF) that helps INR capture high-frequency information and expand the receptive field.

Additionally, FeINFN employs a new decoder called the Spatial-Frequency Interactive Decoder (SFID), which uses a complex Gabor wavelet activation function. The researchers theoretically prove that this Gabor wavelet activation possesses a time-frequency tightness property, which helps the decoder learn the optimal bandwidths and enhance the interaction of INR features.

Experiments on benchmark MHIF datasets demonstrate that FeINFN achieves state-of-the-art performance, both visually and quantitatively. Ablation studies further confirm the contributions of the proposed Spa-Fre IFF and SFID components.

Critical Analysis

The paper presents a well-designed and thoughtful approach to addressing the limitations of INR-based methods for MHIF tasks. The researchers' insights about the Fourier domain similarities between HR-HSI and LR-HSI are particularly interesting and form the basis for their innovative fusion function.

However, the paper does not discuss the computational complexity or inference time of FeINFN, which could be an important consideration for real-world applications. Additionally, the researchers mention that INR-based methods are prone to losing high-frequency information, but they do not provide a deep analysis of why this occurs or how FeINFN specifically overcomes this challenge.

Further research could explore the generalization of FeINFN to other image fusion tasks beyond MHIF, such as visible-infrared fusion or multimodal fusion for semantic segmentation. Investigating the performance of FeINFN on more diverse datasets would also help validate its robustness and broader applicability.

Conclusion

This paper presents the Fourier-enhanced Implicit Neural Fusion Network (FeINFN), a novel approach for Multispectral and Hyperspectral Image Fusion (MHIF) tasks. By leveraging the similarities in the Fourier domain between high-resolution and low-resolution hyperspectral images, FeINFN incorporates specialized fusion and decoding mechanisms to address the limitations of existing INR-based methods.

The results demonstrate that FeINFN achieves state-of-the-art performance on benchmark MHIF datasets, both visually and quantitatively. This work contributes valuable insights and techniques that could advance the field of image fusion and inspire further research in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion

Yu-Jie Liang, Zihan Cao, Liang-Jian Deng, Xiao Wu

Recently, implicit neural representations (INR) have made significant strides in various vision-related domains, providing a novel solution for Multispectral and Hyperspectral Image Fusion (MHIF) tasks. However, INR is prone to losing high-frequency information and is confined to the lack of global perceptual capabilities. To address these issues, this paper introduces a Fourier-enhanced Implicit Neural Fusion Network (FeINFN) specifically designed for MHIF task, targeting the following phenomena: The Fourier amplitudes of the HR-HSI latent code and LR-HSI are remarkably similar; however, their phases exhibit different patterns. In FeINFN, we innovatively propose a spatial and frequency implicit fusion function (Spa-Fre IFF), helping INR capture high-frequency information and expanding the receptive field. Besides, a new decoder employing a complex Gabor wavelet activation function, called Spatial-Frequency Interactive Decoder (SFID), is invented to enhance the interaction of INR features. Especially, we further theoretically prove that the Gabor wavelet activation possesses a time-frequency tightness property that favors learning the optimal bandwidths in the decoder. Experiments on two benchmark MHIF datasets verify the state-of-the-art (SOTA) performance of the proposed method, both visually and quantitatively. Also, ablation studies demonstrate the mentioned contributions. The code will be available on Anonymous GitHub (https://anonymous.4open.science/r/FeINFN-15C9/) after possible acceptance.

4/24/2024

FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss

Meiyi Wei, Liu Xie, Ying Sun, Gang Chen

Recent advancements in local Implicit Neural Representation (INR) demonstrate its exceptional capability in handling images at various resolutions. However, frequency discrepancies between high-resolution (HR) and ground-truth images, especially at larger scales, result in significant artifacts and blurring in HR images. This paper introduces Frequency Consistency for Implicit Neural Representation (FreqINR), an innovative Arbitrary-scale Super-resolution method aimed at enhancing detailed textures by ensuring spectral consistency throughout both training and inference. During training, we employ Adaptive Discrete Cosine Transform Frequency Loss (ADFL) to minimize the frequency gap between HR and ground-truth images, utilizing 2-Dimensional DCT bases and focusing dynamically on challenging frequencies. During inference, we extend the receptive field to preserve spectral coherence between low-resolution (LR) and ground-truth images, which is crucial for the model to generate high-frequency details from LR counterparts. Experimental results show that FreqINR, as a lightweight approach, achieves state-of-the-art performance compared to existing Arbitrary-scale Super-resolution methods and offers notable improvements in computational efficiency. The code for our method will be made publicly available.

8/27/2024

HINER: Neural Representation for Hyperspectral Image

Junqi Shi, Mingyi Jiang, Ming Lu, Tong Chen, Xun Cao, Zhan Ma

This paper introduces {HINER}, a novel neural representation for compressing HSI and ensuring high-quality downstream tasks on compressed HSI. HINER fully exploits inter-spectral correlations by explicitly encoding of spectral wavelengths and achieves a compact representation of the input HSI sample through joint optimization with a learnable decoder. By additionally incorporating the Content Angle Mapper with the L1 loss, we can supervise the global and local information within each spectral band, thereby enhancing the overall reconstruction quality. For downstream classification on compressed HSI, we theoretically demonstrate the task accuracy is not only related to the classification loss but also to the reconstruction fidelity through a first-order expansion of the accuracy degradation, and accordingly adapt the reconstruction by introducing Adaptive Spectral Weighting. Owing to the monotonic mapping of HINER between wavelengths and spectral bands, we propose Implicit Spectral Interpolation for data augmentation by adding random variables to input wavelengths during classification model training. Experimental results on various HSI datasets demonstrate the superior compression performance of our HINER compared to the existing learned methods and also the traditional codecs. Our model is lightweight and computationally efficient, which maintains high accuracy for downstream classification task even on decoded HSIs at high compression ratios. Our materials will be released at https://github.com/Eric-qi/HINER.

8/1/2024

Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks

Ali Mehrabian, Parsa Mojarad Adi, Moein Heidari, Ilker Hacihaliloglu

Implicit neural representations (INRs) use neural networks to provide continuous and resolution-independent representations of complex signals with a small number of parameters. However, existing INR models often fail to capture important frequency components specific to each task. To address this issue, in this paper, we propose a Fourier Kolmogorov Arnold network (FKAN) for INRs. The proposed FKAN utilizes learnable activation functions modeled as Fourier series in the first layer to effectively control and learn the task-specific frequency components. In addition, the activation functions with learnable Fourier coefficients improve the ability of the network to capture complex patterns and details, which is beneficial for high-resolution and high-dimensional data. Experimental results show that our proposed FKAN model outperforms three state-of-the-art baseline schemes, and improves the peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) for the image representation task and intersection over union (IoU) for the 3D occupancy volume representation task, respectively.

9/23/2024