Scattering-induced entropy boost for highly-compressed optical sensing and encryption

Read original: arXiv:2301.06084 - Published 9/9/2024 by Xinrui Zhan, Xuyang Chang, Daoyu Li, Rong Yan, Yinuo Zhang, Liheng Bian

👨‍🏫

Overview

Image sensing often relies on high-quality machine vision systems with large fields of view and high resolutions.
This requires fine imaging optics, high computational costs, and large communication bandwidth between sensors and computing units.
This paper proposes a novel image-free sensing framework for resource-efficient image classification.
The proposed framework can reduce the required number of measurements by up to two orders of magnitude.

Plain English Explanation

The paper describes a new way to classify images without actually capturing a full high-resolution image. This can save a lot of resources like computing power and data transfer bandwidth.

In the proposed system, the light from the target object is first scattered by an optical diffuser. This diffuser acts as a kind of "compressor" and "encryptor" for the target information, narrowing the field of view and improving security.

The scattered light is then modulated by a spatial light modulator using time-varying patterns. A one-dimensional sequence of intensity values measured from this modulated light is used to extract semantic information using deep learning.

This approach is shown to achieve over 95% accuracy for classifying handwritten digits and recognizing Chinese license plates, using only 1% and 5% of the measurements required by a traditional approach without the optical diffuser. The proposed framework is up to 24% more efficient.

Technical Explanation

The core of the proposed image-free sensing framework is the use of an optical diffuser to compress and encrypt the target information before modulating it with a spatial light modulator.

The optical diffuser simultaneously serves as a compressor and an encryptor for the target information, effectively narrowing the field of view and improving the system's security. The one-dimensional sequence of intensity values, measured with time-varying patterns on the spatial light modulator, is then used to extract semantic information based on end-to-end deep learning.

The experiments show this framework achieving over 95% accuracy for classifying handwritten digits from the MNIST dataset and recognizing Chinese license plates, using only 1% and 5% of the measurements required by a traditional approach without the optical diffuser. The proposed framework is up to 24% more efficient than the approach without the diffuser.

Critical Analysis

The paper presents a novel and promising approach to image classification that significantly reduces the required resources. However, some potential limitations and areas for further research are:

The experiments are limited to relatively simple image classification tasks. It's unclear how well the framework would scale to more complex computer vision problems.
The security benefits of the optical diffuser encryption are not rigorously analyzed. More work may be needed to fully understand the security properties and potential vulnerabilities.
The computational complexity and training requirements of the end-to-end deep learning approach are not discussed in detail. This will be an important consideration for real-world deployments.

Overall, the proposed framework represents an interesting and impactful contribution to the field of resource-efficient machine intelligence. Further research and development will be needed to fully realize its potential.

Conclusion

This paper introduces a novel image-free sensing framework that can dramatically reduce the resources required for image classification tasks. By using an optical diffuser to compress and encrypt the target information, the system can achieve high accuracy with only a fraction of the measurements needed by traditional approaches.

The proposed framework has the potential to enable high-throughput machine intelligence for scene analysis with low bandwidth, low costs, and strong encryption. While further research is needed to address certain limitations, this work represents a significant breakthrough in the field of resource-efficient computer vision.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👨‍🏫

Scattering-induced entropy boost for highly-compressed optical sensing and encryption

Xinrui Zhan, Xuyang Chang, Daoyu Li, Rong Yan, Yinuo Zhang, Liheng Bian

Image sensing often relies on a high-quality machine vision system with a large field of view and high resolution. It requires fine imaging optics, has high computational costs, and requires a large communication bandwidth between image sensors and computing units. In this paper, we propose a novel image-free sensing framework for resource-efficient image classification, where the required number of measurements can be reduced by up to two orders of magnitude. In the proposed framework for single-pixel detection, the optical field for a target is first scattered by an optical diffuser and then two-dimensionally modulated by a spatial light modulator. The optical diffuser simultaneously serves as a compressor and an encryptor for the target information, effectively narrowing the field of view and improving the system's security. The one-dimensional sequence of intensity values, which is measured with time-varying patterns on the spatial light modulator, is then used to extract semantic information based on end-to-end deep learning. The proposed sensing framework is shown to obtain over a 95% accuracy at sampling rates of 1% and 5% for classification on the MNIST dataset and the recognition of Chinese license plates, respectively, and the framework is up to 24% more efficient than the approach without an optical diffuser. The proposed framework represents a significant breakthrough in high-throughput machine intelligence for scene analysis with low bandwidth, low costs, and strong encryption.

9/9/2024

🧠

Pixel super-resolved lensless on-chip sensor with scattering multiplexing

Xuyang Chang, Shaowei Jiang, Yongcun Hu, Liheng Bian

Lensless on-chip microscopy has shown great potential for biomedical imaging due to its large-area and high-throughput imaging capabilities. By combining the pixel super-resolution (PSR) technique, it can improve the resolution beyond the limit of the imaging detector. However, existing PSR techniques are restricted to the feature size and crosstalk of modulation components (such as spatial light modulator), which cannot efficiently encode target information. Besides, the reconstruction algorithms suffer from the trade-off between image quality, reconstruction resolution and computational efficiency. In this work, we constructed a novel integrated lensless on-chip sensor via scattering multiplexing, and reported a robust PSR algorithm for sample reconstruction. The sensor employed a scattering layer as a modulator, which was permanently integrated with the detector. Benefiting from the high-degree-of-freedom reconstruction of the scattering layer, we realized fine wavefront modulation with a small feature size. The integration engineering avoided repetitious calibration and reduce the measurement complexity. The reported PSR algorithm combines both model-driven and data-driven strategies to efficiently exploit the high-frequency information from the fine modulation. A series of experiments validated that the reported sensor provides a low-cost solution for large-scale microscopic imaging, with significant advantages in resolution, image contrast and noise robustness.

9/6/2024

Deep Optics for Video Snapshot Compressive Imaging

Ping Wang, Lishun Wang, Xin Yuan

Video snapshot compressive imaging (SCI) aims to capture a sequence of video frames with only a single shot of a 2D detector, whose backbones rest in optical modulation patterns (also known as masks) and a computational reconstruction algorithm. Advanced deep learning algorithms and mature hardware are putting video SCI into practical applications. Yet, there are two clouds in the sunshine of SCI: i) low dynamic range as a victim of high temporal multiplexing, and ii) existing deep learning algorithms' degradation on real system. To address these challenges, this paper presents a deep optics framework to jointly optimize masks and a reconstruction network. Specifically, we first propose a new type of structural mask to realize motion-aware and full-dynamic-range measurement. Considering the motion awareness property in measurement domain, we develop an efficient network for video SCI reconstruction using Transformer to capture long-term temporal dependencies, dubbed Res2former. Moreover, sensor response is introduced into the forward model of video SCI to guarantee end-to-end model training close to real system. Finally, we implement the learned structural masks on a digital micro-mirror device. Experimental results on synthetic and real data validate the effectiveness of the proposed framework. We believe this is a milestone for real-world video SCI. The source code and data are available at https://github.com/pwangcs/DeepOpticsSCI.

4/9/2024

🎯

Multidimensional Compressed Sensing for Spectral Light Field Imaging

Wen Cao, Ehsan Miandji, Jonas Unger

This paper considers a compressive multi-spectral light field camera model that utilizes a one-hot spectralcoded mask and a microlens array to capture spatial, angular, and spectral information using a single monochrome sensor. We propose a model that employs compressed sensing techniques to reconstruct the complete multi-spectral light field from undersampled measurements. Unlike previous work where a light field is vectorized to a 1D signal, our method employs a 5D basis and a novel 5D measurement model, hence, matching the intrinsic dimensionality of multispectral light fields. We mathematically and empirically show the equivalence of 5D and 1D sensing models, and most importantly that the 5D framework achieves orders of magnitude faster reconstruction while requiring a small fraction of the memory. Moreover, our new multidimensional sensing model opens new research directions for designing efficient visual data acquisition algorithms and hardware.

5/2/2024