RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing

Read original: arXiv:2405.10030 - Published 5/17/2024 by Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He

RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing

Overview

Presents a lightweight deep learning model called RSDehamba for dehazing remote sensing satellite images
Leverages a novel state spaces model and Mamba U-Net architecture for efficient haze removal
Demonstrates superior performance compared to existing methods on benchmark datasets

Plain English Explanation

The paper introduces a new deep learning model called RSDehamba that is designed to remove haze and improve the clarity of remote sensing satellite images. Haze and atmospheric interference can often degrade the quality of satellite imagery, making it harder to see and analyze the underlying features of the landscape.

RSDehamba uses a state spaces model and a specialized Mamba U-Net architecture to efficiently identify and remove haze from satellite images. This allows the model to preserve important details while significantly improving the overall clarity and visibility of the imagery.

Compared to other dehazing methods, the researchers show that RSDehamba achieves superior performance on standard benchmark datasets. This suggests it could be a valuable tool for a range of remote sensing applications, from mapping to image super-resolution.

Technical Explanation

The key innovation in RSDehamba is its use of a state spaces model and Mamba U-Net architecture. The state spaces model allows the system to efficiently learn and represent the complex relationships between the observed hazy image and the underlying clear image. This provides a robust framework for the Mamba U-Net, a modified version of the popular U-Net convolutional neural network, to effectively remove haze.

The Mamba U-Net architecture is designed to be lightweight and computationally efficient, making it well-suited for practical deployment on remote sensing satellites. It incorporates several specialized components, including frequency-aware convolutions and attention mechanisms, to enhance its dehazing capabilities.

The researchers evaluate RSDehamba on standard remote sensing image dehazing benchmarks and demonstrate significant improvements over existing methods. They attribute this superior performance to the synergistic combination of the state spaces model and the efficient Mamba U-Net design.

Critical Analysis

The paper provides a comprehensive evaluation of RSDehamba's performance, including comparisons to several state-of-the-art dehazing techniques. However, the authors acknowledge that the model's effectiveness may be limited in certain challenging scenarios, such as when dealing with very dense or heterogeneous haze.

Additionally, while the Mamba U-Net architecture is designed to be lightweight, the computational and memory requirements of the overall system are not fully explored. Further research may be needed to assess the model's suitability for deployment on resource-constrained satellite platforms.

It would also be valuable to investigate the generalization capabilities of RSDehamba, as the paper primarily focuses on evaluating the model on standard benchmark datasets. Assessing its performance on a wider range of real-world remote sensing scenarios could provide deeper insights into its practical applicability.

Conclusion

The RSDehamba model presents a promising approach to addressing the challenge of haze removal in remote sensing satellite imagery. By leveraging a state spaces model and a specialized Mamba U-Net architecture, the researchers have developed a lightweight and efficient dehazing solution that outperforms existing methods. While the model has some limitations, its strong performance on benchmark datasets suggests it could be a valuable tool for improving the quality and usability of satellite imagery in a wide range of remote sensing applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing

Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He

Remote sensing image dehazing (RSID) aims to remove nonuniform and physically irregular haze factors for high-quality image restoration. The emergence of CNNs and Transformers has taken extraordinary strides in the RSID arena. However, these methods often struggle to demonstrate the balance of adequate long-range dependency modeling and maintaining computational efficiency. To this end, we propose the first lightweight network on the mamba-based model called RSDhamba in the field of RSID. Greatly inspired by the recent rise of Selective State Space Model (SSM) for its superior performance in modeling linear complexity and remote dependencies, our designed RSDehamba integrates the SSM framework into the U-Net architecture. Specifically, we propose the Vision Dehamba Block (VDB) as the core component of the overall network, which utilizes the linear complexity of SSM to achieve the capability of global context encoding. Simultaneously, the Direction-aware Scan Module (DSM) is designed to dynamically aggregate feature exchanges over different directional domains to effectively enhance the flexibility of sensing the spatially varying distribution of haze. In this way, our RSDhamba fully demonstrates the superiority of spatial distance capture dependencies and channel information exchange for better extraction of haze features. Extensive experimental results on widely used benchmarks validate the surpassing performance of our RSDehamba against existing state-of-the-art methods.

5/17/2024

HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model

Hang Fu, Genyun Sun, Yinhe Li, Jinchang Ren, Aizhu Zhang, Cheng Jing, Pedram Ghamisi

Haze contamination in hyperspectral remote sensing images (HSI) can lead to spatial visibility degradation and spectral distortion. Haze in HSI exhibits spatial irregularity and inhomogeneous spectral distribution, with few dehazing networks available. Current CNN and Transformer-based dehazing methods fail to balance global scene recovery, local detail retention, and computational efficiency. Inspired by the ability of Mamba to model long-range dependencies with linear complexity, we explore its potential for HSI dehazing and propose the first HSI Dehazing Mamba (HDMba) network. Specifically, we design a novel window selective scan module (WSSM) that captures local dependencies within windows and global correlations between windows by partitioning them. This approach improves the ability of conventional Mamba in local feature extraction. By modeling the local and global spectral-spatial information flow, we achieve a comprehensive analysis of hazy regions. The DehazeMamba layer (DML), constructed by WSSM, and residual DehazeMamba (RDM) blocks, composed of DMLs, are the core components of the HDMba framework. These components effectively characterize the complex distribution of haze in HSIs, aiding in scene reconstruction and dehazing. Experimental results on the Gaofen-5 HSI dataset demonstrate that HDMba outperforms other state-of-the-art methods in dehazing performance. The code will be available at https://github.com/RsAI-lab/HDMba.

6/11/2024

RS-Mamba for Large Remote Sensing Image Dense Prediction

Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang

Context modeling is critical for remote sensing image dense prediction tasks. Nowadays, the growing size of very-high-resolution (VHR) remote sensing images poses challenges in effectively modeling context. While transformer-based models possess global modeling capabilities, they encounter computational challenges when applied to large VHR images due to their quadratic complexity. The conventional practice of cropping large images into smaller patches results in a notable loss of contextual information. To address these issues, we propose the Remote Sensing Mamba (RSM) for dense prediction tasks in large VHR remote sensing images. RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images. Considering that the land covers in remote sensing images are distributed in arbitrary spatial directions due to characteristics of remote sensing over-head imaging, the RSM incorporates an omnidirectional selective scan module to globally model the context of images in multiple directions, capturing large spatial features from various directions. Extensive experiments on semantic segmentation and change detection tasks across various land covers demonstrate the effectiveness of the proposed RSM. We designed simple yet effective models based on RSM, achieving state-of-the-art performance on dense prediction tasks in VHR remote sensing images without fancy training strategies. Leveraging the linear complexity and global modeling capabilities, RSM achieves better efficiency and accuracy than transformer-based models on large remote sensing images. Interestingly, we also demonstrated that our model generally performs better with a larger image size on dense prediction tasks. Our code is available at https://github.com/walking-shadow/Official_Remote_Sensing_Mamba.

4/11/2024

🖼️

Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution

Yi Xiao, Qiangqiang Yuan, Kui Jiang, Yuzeng Chen, Qiang Zhang, Chia-Wen Lin

Recent progress in remote sensing image (RSI) super-resolution (SR) has exhibited remarkable performance using deep neural networks, e.g., Convolutional Neural Networks and Transformers. However, existing SR methods often suffer from either a limited receptive field or quadratic computational overhead, resulting in sub-optimal global representation and unacceptable computational costs in large-scale RSI. To alleviate these issues, we develop the first attempt to integrate the Vision State Space Model (Mamba) for RSI-SR, which specializes in processing large-scale RSI by capturing long-range dependency with linear complexity. To achieve better SR reconstruction, building upon Mamba, we devise a Frequency-assisted Mamba framework, dubbed FMSR, to explore the spatial and frequent correlations. In particular, our FMSR features a multi-level fusion architecture equipped with the Frequency Selection Module (FSM), Vision State Space Module (VSSM), and Hybrid Gate Module (HGM) to grasp their merits for effective spatial-frequency fusion. Considering that global and local dependencies are complementary and both beneficial for SR, we further recalibrate these multi-level features for accurate feature fusion via learnable scaling adaptors. Extensive experiments on AID, DOTA, and DIOR benchmarks demonstrate that our FMSR outperforms state-of-the-art Transformer-based methods HAT-L in terms of PSNR by 0.11 dB on average, while consuming only 28.05% and 19.08% of its memory consumption and complexity, respectively. Code will be available at https://github.com/XY-boy/FreMamba

8/30/2024