Optimizing Retinal Prosthetic Stimuli with Conditional Invertible Neural Networks

Read original: arXiv:2403.04884 - Published 7/16/2024 by Yuli Wu, Julian Wittmann, Peter Walter, Johannes Stegmaier

Optimizing Retinal Prosthetic Stimuli with Conditional Invertible Neural Networks

Overview

This paper explores the use of conditional invertible neural networks to optimize retinal prosthetic stimuli, which could improve the quality of vision provided by retinal implants.
The researchers developed a model that can accurately predict the retinal response to different stimulation patterns and then use this to find the optimal stimuli for achieving desired visual outcomes.
The proposed approach has the potential to significantly enhance the performance and functionality of retinal prostheses, which are important for restoring vision in people with retinal diseases or injuries.

Plain English Explanation

Retinal prostheses, or artificial retinas, are devices that can be implanted in the eye to restore some vision for people who have lost their sight due to conditions like macular degeneration or retinitis pigmentosa. These prostheses work by electrically stimulating the remaining healthy cells in the retina, which then send signals to the brain that are interpreted as visual information.

However, finding the best electrical stimulation patterns to provide the clearest and most useful vision is a challenging task. This paper describes a new approach using a type of artificial intelligence called a conditional invertible neural network. This AI model can accurately predict how the retina will respond to different electrical stimulation patterns.

By using this predictive model, the researchers were able to figure out the specific stimulation patterns that would produce the most desirable visual outcomes for the patient. This is similar to how an inverse rendering technique can reconstruct the original 3D scene from a 2D image.

Optimizing the retinal stimulation in this way could lead to significant improvements in the quality of vision provided by retinal prostheses. Patients may be able to see more clearly and have an easier time navigating their environment. This advance could have a major positive impact on the lives of people with severe vision loss.

Technical Explanation

The key innovation in this paper is the use of a conditional invertible neural network (cINN) to model the relationship between electrical stimulation patterns applied to the retina and the resulting neural responses.

Invertible neural networks are a type of AI model that can be run in both the forward and reverse directions. In this case, the cINN is trained to predict the retinal response given a particular stimulation pattern (forward direction), and then to find the optimal stimulation pattern that will produce a desired retinal response (reverse/inverse direction).

The researchers trained the cINN model on a dataset of simulated retinal responses to different electrical stimuli. They then used the inverse capabilities of the cINN to optimize the stimulation patterns, with the goal of producing visual percepts that are as clear and natural as possible for the patient.

This approach is similar to how conditional invertible networks have been used to efficiently reconstruct sound fields from limited sensor data. By leveraging the invertibility of the neural network, the researchers were able to find the optimal stimuli without having to exhaustively search all possibilities.

The results showed that the cINN-based optimization was able to outperform traditional methods for selecting retinal stimulation patterns. This suggests that this AI-powered approach has significant potential to enhance the capabilities of retinal prostheses and improve outcomes for patients.

Critical Analysis

One limitation of the research is that it was based on simulated retinal responses rather than real-world data from implanted devices. While simulation can be a useful tool, it is important to validate the approach using actual retinal prosthesis data to ensure the findings translate to real-world conditions.

Additionally, the paper does not address the potential challenges of implementing this optimization technique in a clinical setting. Issues around computational complexity, inference speed, and robustness to noise or variations in individual patients' anatomy and physiology would need to be carefully considered.

Further research is also needed to fully understand the perceptual experience of patients using retinal prostheses optimized in this way. While the model aims to produce natural-looking visual percepts, the subjective quality of vision may depend on factors beyond just the objective measures used in this study.

Overall, this paper presents a promising new approach to improving retinal prostheses, but additional work is needed to translate the findings into practical clinical applications that can meaningfully enhance the lives of people with severe vision loss.

Conclusion

This research demonstrates the potential of using conditional invertible neural networks to optimize electrical stimulation patterns for retinal prostheses. By accurately modeling the relationship between stimuli and retinal responses, the proposed approach can find the optimal stimuli to produce the clearest and most natural-looking visual percepts for patients.

If validated and refined further, this AI-powered optimization technique could lead to significant improvements in the functionality and user experience of retinal implants. This could in turn greatly enhance the quality of life for people suffering from debilitating retinal diseases or injuries. The findings highlight the powerful role that machine learning can play in advancing neural engineering and assistive technologies for vision restoration.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Optimizing Retinal Prosthetic Stimuli with Conditional Invertible Neural Networks

Yuli Wu, Julian Wittmann, Peter Walter, Johannes Stegmaier

Implantable retinal prostheses offer a promising solution to restore partial vision by circumventing damaged photoreceptor cells in the retina and directly stimulating the remaining functional retinal cells. However, the information transmission between the camera and retinal cells is often limited by the low resolution of the electrode array and the lack of specificity for different ganglion cell types, resulting in suboptimal stimulations. In this work, we propose to utilize normalizing flow-based conditional invertible neural networks to optimize retinal implant stimulation in an unsupervised manner. The invertibility of these networks allows us to use them as a surrogate for the computational model of the visual system, while also encoding input camera signals into optimized electrical stimuli on the electrode array. Compared to other methods, such as trivial downsampling, linear models, and feed-forward convolutional neural networks, the flow-based invertible neural network and its conditional extension yield better visual reconstruction qualities w.r.t. various metrics using a physiologically validated simulation tool.

7/16/2024

Network Inversion of Convolutional Neural Nets

Pirzada Suhail, Amit Sethi

Neural networks have emerged as powerful tools across various applications, yet their decision-making process often remains opaque, leading to them being perceived as black boxes. This opacity raises concerns about their interpretability and reliability, especially in safety-critical scenarios. Network inversion techniques offer a solution by allowing us to peek inside these black boxes, revealing the features and patterns learned by the networks behind their decision-making processes and thereby provide valuable insights into how neural networks arrive at their conclusions, making them more interpretable and trustworthy. This paper presents a simple yet effective approach to network inversion using a carefully conditioned generator that learns the data distribution in the input space of the trained neural network, enabling the reconstruction of inputs that would most likely lead to the desired outputs. To capture the diversity in the input space for a given output, instead of simply revealing the conditioning labels to the generator, we hideously encode the conditioning label information into vectors, further exemplified by heavy dropout in the generation process and minimisation of cosine similarity between the features corresponding to the generated images. The paper concludes with immediate applications of Network Inversion including in interpretability, explainability and generation of adversarial samples.

7/26/2024

Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks

I. R. Slootweg, M. Thach, K. R. Curro-Tafili, F. D. Verbraak, F. H. Bouwman, Y. A. L. Pijnenburg, J. F. Boer, J. H. P. de Kwisthout, L. Bagheriye, P. J. Gonz'alez

Background/Aim. This study aims to predict Amyloid Positron Emission Tomography (AmyloidPET) status with multimodal retinal imaging and convolutional neural networks (CNNs) and to improve the performance through pretraining with synthetic data. Methods. Fundus autofluorescence, optical coherence tomography (OCT), and OCT angiography images from 328 eyes of 59 AmyloidPET positive subjects and 108 AmyloidPET negative subjects were used for classification. Denoising Diffusion Probabilistic Models (DDPMs) were trained to generate synthetic images and unimodal CNNs were pretrained on synthetic data and finetuned on real data or trained solely on real data. Multimodal classifiers were developed to combine predictions of the four unimodal CNNs with patient metadata. Class activation maps of the unimodal classifiers provided insight into the network's attention to inputs. Results. DDPMs generated diverse, realistic images without memorization. Pretraining unimodal CNNs with synthetic data improved AUPR at most from 0.350 to 0.579. Integration of metadata in multimodal CNNs improved AUPR from 0.486 to 0.634, which was the best overall best classifier. Class activation maps highlighted relevant retinal regions which correlated with AD. Conclusion. Our method for generating and leveraging synthetic data has the potential to improve AmyloidPET prediction from multimodal retinal imaging. A DDPM can generate realistic and unique multimodal synthetic retinal images. Our best performing unimodal and multimodal classifiers were not pretrained on synthetic data, however pretraining with synthetic data slightly improved classification performance for two out of the four modalities.

6/27/2024

Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers

Johann Schmidt, Sebastian Stober

Deep neural networks are applied in more and more areas of everyday life. However, they still lack essential abilities, such as robustly dealing with spatially transformed input signals. Approaches to mitigate this severe robustness issue are limited to two pathways: Either models are implicitly regularised by increased sample variability (data augmentation) or explicitly constrained by hard-coded inductive biases. The limiting factor of the former is the size of the data space, which renders sufficient sample coverage intractable. The latter is limited by the engineering effort required to develop such inductive biases for every possible scenario. Instead, we take inspiration from human behaviour, where percepts are modified by mental or physical actions during inference. We propose a novel technique to emulate such an inference process for neural nets. This is achieved by traversing a sparsified inverse transformation tree during inference using parallel energy-based evaluations. Our proposed inference algorithm, called Inverse Transformation Search (ITS), is model-agnostic and equips the model with zero-shot pseudo-invariance to spatially transformed inputs. We evaluated our method on several benchmark datasets, including a synthesised ImageNet test set. ITS outperforms the utilised baselines on all zero-shot test scenarios.

5/28/2024