WaveMo: Learning Wavefront Modulations to See Through Scattering

Read original: arXiv:2404.07985 - Published 4/12/2024 by Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo Jin, Ashok Veeraraghavan, Christopher A. Metzler

WaveMo: Learning Wavefront Modulations to See Through Scattering

Overview

This paper introduces WaveMo, a method for seeing through scattering media by learning wavefront modulations.
The key idea is to use a neural network to learn the optimal wavefront modulation required to recover a clear image from a scattered one.
The approach is demonstrated on various imaging tasks, such as seeing through fog and turbid water, with promising results.

Plain English Explanation

The paper describes a new technique called WaveMo that can help us see through things that normally make it hard to see clearly, like fog or murky water. The key insight is that by using a neural network, we can learn the best way to adjust the light waves entering the camera to undo the distorting effects of the scattering medium.

Imagine you're trying to take a picture through a window that's all fogged up. Normally, the scattered light would make the image blurry and hard to make out. But with WaveMo, the system can figure out the right way to tweak the light waves so that when they reach the camera sensor, they form a clear, undistorted image.

This could be really useful for all sorts of applications, like high-performance-real-world-optical-computing-trained or flying-photons-rendering-novel-views-propagating-light, where being able to see clearly through scattering media is important. It could also help with diffusion-based-point-cloud-super-resolution-mmwave applications that rely on imaging.

The key is that the neural network can learn the optimal wavefront modulation, which is how the light waves need to be adjusted, just by looking at example images of the distorted and clear scenes. This is a powerful approach that could have a big impact on imaging through challenging environments.

Technical Explanation

The paper introduces a deep learning approach called WaveMo that can recover clear images from ones distorted by scattering media. The key insight is that by learning the optimal wavefront modulation required to undo the effects of scattering, the system can effectively "see through" the obscuring medium.

The WaveMo architecture consists of a neural network that takes in a distorted input image and outputs the necessary wavefront modulation. This modulation is then applied to the input light field, allowing a clear image to be reconstructed. The network is trained end-to-end using pairs of distorted and reference clear images.

Experiments demonstrate the effectiveness of WaveMo on a range of scattering scenarios, including seeing through fog, turbid water, and other challenging media. The method is able to outperform previous learning-based approaches, as well as traditional techniques like rf-ulm-ultrasound-localization-microscopy-learned-from and phase retrieval.

The authors also show that the learned wavefront modulations are interpretable, providing insights into how the system "sees through" the scattering. This opens up the possibility of further optimizing the approach or applying the principles to other imaging tasks.

Critical Analysis

The WaveMo approach represents a promising step forward in the field of imaging through scattering media. By learning the optimal wavefront modulation, the system is able to effectively undo the distorting effects of the scattering environment.

One potential limitation is that the training process requires pairs of distorted and clear reference images, which may not always be available in practical scenarios. The authors mention the possibility of using synthetic data, but this could introduce other challenges.

Additionally, the current implementation is limited to 2D imaging tasks. Extending the approach to 3D or dynamic scenes could be an important area for future research, as many real-world applications would require such capabilities.

While the interpretability of the learned wavefront modulations is a valuable feature, it would be interesting to further explore the reasons behind the system's success. Understanding the underlying principles could lead to even more effective strategies for seeing through scattering.

Overall, the WaveMo method represents an important advance in the field of computational imaging, and the insights gained from this work could have far-reaching applications, from video-snapshot-compressive-imaging to rf-ulm-ultrasound-localization-microscopy-learned-from. As the authors note, further research and development in this area could lead to significant breakthroughs in our ability to see through scattering media.

Conclusion

The WaveMo paper introduces a deep learning-based approach for recovering clear images from ones distorted by scattering. By learning the optimal wavefront modulation, the system is able to effectively undo the effects of the scattering medium, enabling improved imaging performance in challenging environments.

The key innovation is the use of a neural network to determine the necessary wavefront adjustment, which is then applied to the input light field. This data-driven approach outperforms traditional techniques and opens up new possibilities for computational imaging.

While the current work is focused on 2D imaging tasks, the insights gained from this research could have far-reaching implications, from high-performance-real-world-optical-computing-trained to diffusion-based-point-cloud-super-resolution-mmwave applications. As the field of computational imaging continues to advance, techniques like WaveMo will play an increasingly important role in our ability to see through the physical world's complexities.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

WaveMo: Learning Wavefront Modulations to See Through Scattering

Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo Jin, Ashok Veeraraghavan, Christopher A. Metzler

Imaging through scattering media is a fundamental and pervasive challenge in fields ranging from medical diagnostics to astronomy. A promising strategy to overcome this challenge is wavefront modulation, which induces measurement diversity during image acquisition. Despite its importance, designing optimal wavefront modulations to image through scattering remains under-explored. This paper introduces a novel learning-based framework to address the gap. Our approach jointly optimizes wavefront modulations and a computationally lightweight feedforward proxy reconstruction network. This network is trained to recover scenes obscured by scattering, using measurements that are modified by these modulations. The learned modulations produced by our framework generalize effectively to unseen scattering scenarios and exhibit remarkable versatility. During deployment, the learned modulations can be decoupled from the proxy network to augment other more computationally expensive restoration algorithms. Through extensive experiments, we demonstrate our approach significantly advances the state of the art in imaging through scattering media. Our project webpage is at https://wavemo-2024.github.io/.

4/12/2024

RF-ULM: Ultrasound Localization Microscopy Learned from Radio-Frequency Wavefronts

Christopher Hahne, Georges Chabouh, Arthur Chavignon, Olivier Couture, Raphael Sznitman

In Ultrasound Localization Microscopy (ULM), achieving high-resolution images relies on the precise localization of contrast agent particles across a series of beamformed frames. However, our study uncovers an enormous potential: The process of delay-and-sum beamforming leads to an irreversible reduction of Radio-Frequency (RF) channel data, while its implications for localization remain largely unexplored. The rich contextual information embedded within RF wavefronts, including their hyperbolic shape and phase, offers great promise for guiding Deep Neural Networks (DNNs) in challenging localization scenarios. To fully exploit this data, we propose to directly localize scatterers in RF channel data. Our approach involves a custom super-resolution DNN using learned feature channel shuffling, non-maximum suppression, and a semi-global convolutional block for reliable and accurate wavefront localization. Additionally, we introduce a geometric point transformation that facilitates seamless mapping to the B-mode coordinate space. To understand the impact of beamforming on ULM, we validate the effectiveness of our method by conducting an extensive comparison with State-Of-The-Art (SOTA) techniques. We present the inaugural in vivo results from a wavefront-localizing DNN, highlighting its real-world practicality. Our findings show that RF-ULM bridges the domain shift between synthetic and real datasets, offering a considerable advantage in terms of precision and complexity. To enable the broader research community to benefit from our findings, our code and the associated SOTA methods are made available at https://github.com/hahnec/rf-ulm.

4/9/2024

👨‍🏫

Scattering-induced entropy boost for highly-compressed optical sensing and encryption

Xinrui Zhan, Xuyang Chang, Daoyu Li, Rong Yan, Yinuo Zhang, Liheng Bian

Image sensing often relies on a high-quality machine vision system with a large field of view and high resolution. It requires fine imaging optics, has high computational costs, and requires a large communication bandwidth between image sensors and computing units. In this paper, we propose a novel image-free sensing framework for resource-efficient image classification, where the required number of measurements can be reduced by up to two orders of magnitude. In the proposed framework for single-pixel detection, the optical field for a target is first scattered by an optical diffuser and then two-dimensionally modulated by a spatial light modulator. The optical diffuser simultaneously serves as a compressor and an encryptor for the target information, effectively narrowing the field of view and improving the system's security. The one-dimensional sequence of intensity values, which is measured with time-varying patterns on the spatial light modulator, is then used to extract semantic information based on end-to-end deep learning. The proposed sensing framework is shown to obtain over a 95% accuracy at sampling rates of 1% and 5% for classification on the MNIST dataset and the recognition of Chinese license plates, respectively, and the framework is up to 24% more efficient than the approach without an optical diffuser. The proposed framework represents a significant breakthrough in high-throughput machine intelligence for scene analysis with low bandwidth, low costs, and strong encryption.

9/9/2024

Back-Projection Diffusion: Solving the Wideband Inverse Scattering Problem with Diffusion Models

Borong Zhang, Mart'in Guerra, Qin Li, Leonardo Zepeda-N'u~nez

We present Wideband back-projection diffusion, an end-to-end probabilistic framework for approximating the posterior distribution induced by the inverse scattering map from wideband scattering data. This framework leverages conditional diffusion models coupled with the underlying physics of wave-propagation and symmetries in the problem, to produce highly accurate reconstructions. The framework introduces a factorization of the score function into a physics-based latent representation inspired by the filtered back-propagation formula and a conditional score function conditioned on this latent representation. These two steps are also constrained to obey symmetries in the formulation while being amenable to compression by imposing the rank structure found in the filtered back-projection formula. As a result, empirically, our framework is able to provide sharp reconstructions effortlessly, even recovering sub-Nyquist features in the multiple-scattering regime. It has low-sample and computational complexity, its number of parameters scales sub-linearly with the target resolution, and it has stable training dynamics.

8/12/2024