Search | arXiv e-print repository

Efficient Spatial Estimation of Perceptual Thresholds for Retinal Implants via Gaussian Process Regression

Authors: Roksana Sadeghi, Michael Beyeler

Abstract: Retinal prostheses restore vision by electrically stimulating surviving neurons, but calibrating perceptual thresholds (i.e., the minimum stimulus intensity required for perception) remains a time-intensive challenge, especially for high-electrode-count devices. Since neighboring electrodes exhibit spatial correlations, we propose a Gaussian Process Regression (GPR) framework to predict thresholds… ▽ More Retinal prostheses restore vision by electrically stimulating surviving neurons, but calibrating perceptual thresholds (i.e., the minimum stimulus intensity required for perception) remains a time-intensive challenge, especially for high-electrode-count devices. Since neighboring electrodes exhibit spatial correlations, we propose a Gaussian Process Regression (GPR) framework to predict thresholds at unsampled locations while leveraging uncertainty estimates to guide adaptive sampling. Using perceptual threshold data from four Argus II users, we show that GPR with a Matern kernel provides more accurate threshold predictions than a Radial Basis Function (RBF) kernel (p < .001, Wilcoxon signed-rank test). In addition, spatially optimized sampling yielded lower prediction error than uniform random sampling for Participants 1 and 3 (p < .05). While adaptive sampling dynamically selects electrodes based on model uncertainty, its accuracy gains over spatial sampling were not statistically significant (p > .05), though it approached significance for Participant 1 (p = .074). These findings establish GPR with spatial sampling as a scalable, efficient approach to retinal prosthesis calibration, minimizing patient burden while maintaining predictive accuracy. More broadly, this framework offers a generalizable solution for adaptive calibration in neuroprosthetic devices with spatially structured stimulation thresholds, paving the way for faster, more personalized system fitting in future high-channel-count implants. Clinical relevance: Gaussian Progress Regression offers a scalable path toward faster, more personalized calibration procedures for future high-channel-count neuroprosthetic devices. △ Less

Submitted 28 April, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

arXiv:2403.12990 [pdf, other]

Beyond Sight: Probing Alignment Between Image Models and Blind V1

Authors: Jacob Granley, Galen Pogoncheff, Alfonso Rodil, Leili Soo, Lily Marie Turkstra, Lucas Gil Nadolskis, Arantxa Alfaro Saez, Cristina Soto Sanchez, Eduardo Fernandez Jover, Michael Beyeler

Abstract: Neural activity in the visual cortex of blind humans persists in the absence of visual stimuli. However, little is known about the preservation of visual representation capacity in these cortical regions, which could have significant implications for neural interfaces such as visual prostheses. In this work, we present a series of analyses on the shared representations between evoked neural activi… ▽ More Neural activity in the visual cortex of blind humans persists in the absence of visual stimuli. However, little is known about the preservation of visual representation capacity in these cortical regions, which could have significant implications for neural interfaces such as visual prostheses. In this work, we present a series of analyses on the shared representations between evoked neural activity in the primary visual cortex (V1) of a blind human with an intracortical visual prosthesis, and latent visual representations computed in deep neural networks (DNNs). In the absence of natural visual input, we examine two alternative forms of inducing neural activity: electrical stimulation and mental imagery. We first quantitatively demonstrate that latent DNN activations are aligned with neural activity measured in blind V1. On average, DNNs with higher ImageNet accuracy or higher sighted primate neural predictivity are more predictive of blind V1 activity. We further probe blind V1 alignment in ResNet-50 and propose a proof-of-concept approach towards interpretability of blind V1 neurons. The results of these studies suggest the presence of some form of natural visual processing in blind V1 during electrically evoked visual perception and present unique directions in mechanistically understanding and interfacing with blind V1. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted preprint version

arXiv:2306.13104 [pdf, other]

Human-in-the-Loop Optimization for Deep Stimulus Encoding in Visual Prostheses

Authors: Jacob Granley, Tristan Fauvel, Matthew Chalk, Michael Beyeler

Abstract: Neuroprostheses show potential in restoring lost sensory function and enhancing human capabilities, but the sensations produced by current devices often seem unnatural or distorted. Exact placement of implants and differences in individual perception lead to significant variations in stimulus response, making personalized stimulus optimization a key challenge. Bayesian optimization could be used t… ▽ More Neuroprostheses show potential in restoring lost sensory function and enhancing human capabilities, but the sensations produced by current devices often seem unnatural or distorted. Exact placement of implants and differences in individual perception lead to significant variations in stimulus response, making personalized stimulus optimization a key challenge. Bayesian optimization could be used to optimize patient-specific stimulation parameters with limited noisy observations, but is not feasible for high-dimensional stimuli. Alternatively, deep learning models can optimize stimulus encoding strategies, but typically assume perfect knowledge of patient-specific variations. Here we propose a novel, practically feasible approach that overcomes both of these fundamental limitations. First, a deep encoder network is trained to produce optimal stimuli for any individual patient by inverting a forward model mapping electrical stimuli to visual percepts. Second, a preferential Bayesian optimization strategy utilizes this encoder to optimize patient-specific parameters for a new patient, using a minimal number of pairwise comparisons between candidate stimuli. We demonstrate the viability of this approach on a novel, state-of-the-art visual prosthesis model. We show that our approach quickly learns a personalized stimulus encoder, leads to dramatic improvements in the quality of restored vision, and is robust to noisy patient feedback and misspecifications in the underlying forward model. Overall, our results suggest that combining the strengths of deep learning and Bayesian optimization could significantly improve the perceptual experience of patients fitted with visual prostheses and may prove a viable solution for a range of neuroprosthetic technologies. △ Less

Submitted 27 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

Comments: Camera ready version

ACM Class: I.2.10

arXiv:2305.11275 [pdf, other]

Explaining V1 Properties with a Biologically Constrained Deep Learning Architecture

Authors: Galen Pogoncheff, Jacob Granley, Michael Beyeler

Abstract: Convolutional neural networks (CNNs) have recently emerged as promising models of the ventral visual stream, despite their lack of biological specificity. While current state-of-the-art models of the primary visual cortex (V1) have surfaced from training with adversarial examples and extensively augmented data, these models are still unable to explain key neural properties observed in V1 that aris… ▽ More Convolutional neural networks (CNNs) have recently emerged as promising models of the ventral visual stream, despite their lack of biological specificity. While current state-of-the-art models of the primary visual cortex (V1) have surfaced from training with adversarial examples and extensively augmented data, these models are still unable to explain key neural properties observed in V1 that arise from biological circuitry. To address this gap, we systematically incorporated neuroscience-derived architectural components into CNNs to identify a set of mechanisms and architectures that comprehensively explain neural activity in V1. We show drastic improvements in model-V1 alignment driven by the integration of architectural components that simulate center-surround antagonism, local receptive fields, tuned normalization, and cortical magnification. Upon enhancing task-driven CNNs with a collection of these specialized components, we uncover models with latent representations that yield state-of-the-art explanation of V1 neural activity and tuning properties. Our results highlight an important advancement in the field of NeuroAI, as we systematically establish a set of architectural components that contribute to unprecedented explanation of V1. The neuroscience insights that could be gleaned from increasingly accurate in-silico models of the brain have the potential to greatly advance the fields of both neuroscience and artificial intelligence. △ Less

Submitted 25 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

arXiv:2212.00081 [pdf, other]

Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition

Authors: Melani Sanchez-Garcia, Tushar Chauhan, Benoit R. Cottereau, Michael Beyeler

Abstract: Deep neural networks have surpassed human performance in key visual challenges such as object recognition, but require a large amount of energy, computation, and memory. In contrast, spiking neural networks (SNNs) have the potential to improve both the efficiency and biological plausibility of object recognition systems. Here we present a SNN model that uses spike-latency coding and winner-take-al… ▽ More Deep neural networks have surpassed human performance in key visual challenges such as object recognition, but require a large amount of energy, computation, and memory. In contrast, spiking neural networks (SNNs) have the potential to improve both the efficiency and biological plausibility of object recognition systems. Here we present a SNN model that uses spike-latency coding and winner-take-all inhibition (WTA-I) to efficiently represent visual stimuli using multi-scale parallel processing. Mimicking neuronal response properties in early visual cortex, images were preprocessed with three different spatial frequency (SF) channels, before they were fed to a layer of spiking neurons whose synaptic weights were updated using spike-timing-dependent-plasticity (STDP). We investigate how the quality of the represented objects changes under different SF bands and WTA-I schemes. We demonstrate that a network of 200 spiking neurons tuned to three SFs can efficiently represent objects with as little as 15 spikes per neuron. Studying how core object recognition may be implemented using biologically plausible learning rules in SNNs may not only further our understanding of the brain, but also lead to novel and efficient artificial vision systems. △ Less

Submitted 30 November, 2022; originally announced December 2022.

Comments: MSG and TC are co-first authors. BRC and MB are co-last authors. arXiv admin note: text overlap with arXiv:2205.10338

arXiv:2209.13561 [pdf, other]

Adapting Brain-Like Neural Networks for Modeling Cortical Visual Prostheses

Authors: Jacob Granley, Alexander Riedel, Michael Beyeler

Abstract: Cortical prostheses are devices implanted in the visual cortex that attempt to restore lost vision by electrically stimulating neurons. Currently, the vision provided by these devices is limited, and accurately predicting the visual percepts resulting from stimulation is an open challenge. We propose to address this challenge by utilizing 'brain-like' convolutional neural networks (CNNs), which ha… ▽ More Cortical prostheses are devices implanted in the visual cortex that attempt to restore lost vision by electrically stimulating neurons. Currently, the vision provided by these devices is limited, and accurately predicting the visual percepts resulting from stimulation is an open challenge. We propose to address this challenge by utilizing 'brain-like' convolutional neural networks (CNNs), which have emerged as promising models of the visual system. To investigate the feasibility of adapting brain-like CNNs for modeling visual prostheses, we developed a proof-of-concept model to predict the perceptions resulting from electrical stimulation. We show that a neurologically-inspired decoding of CNN activations produces qualitatively accurate phosphenes, comparable to phosphenes reported by real patients. Overall, this is an essential first step towards building brain-like models of electrical stimulation, which may not just improve the quality of vision provided by cortical prostheses but could also further our understanding of the neural code of vision. △ Less

Submitted 27 September, 2022; originally announced September 2022.

arXiv:2203.02493 [pdf, other]

Greedy Optimization of Electrode Arrangement for Epiretinal Prostheses

Authors: Ashley Bruce, Michael Beyeler

Abstract: Visual neuroprostheses are the only FDA-approved technology for the treatment of retinal degenerative blindness. Although recent work has demonstrated a systematic relationship between electrode location and the shape of the elicited visual percept, this knowledge has yet to be incorporated into retinal prosthesis design, where electrodes are typically arranged on either a rectangular or hexagonal… ▽ More Visual neuroprostheses are the only FDA-approved technology for the treatment of retinal degenerative blindness. Although recent work has demonstrated a systematic relationship between electrode location and the shape of the elicited visual percept, this knowledge has yet to be incorporated into retinal prosthesis design, where electrodes are typically arranged on either a rectangular or hexagonal grid. Here we optimize the intraocular placement of epiretinal electrodes using dictionary learning. Importantly, the optimization process is informed by a previously established and psychophysically validated model of simulated prosthetic vision. We systematically evaluate three different electrode placement strategies across a wide range of possible phosphene shapes and recommend electrode arrangements that maximize visual subfield coverage. In the near future, our work may guide the prototyping of next-generation neuroprostheses. △ Less

Submitted 30 June, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

arXiv:1702.06665 [pdf]

doi 10.1523/JNEUROSCI.0396-16.2016

Visual response properties of MSTd emerge from a sparse population code

Authors: Michael Beyeler, Nikil Dutt, Jeffrey L. Krichmar

Abstract: Neurons in the dorsal subregion of the medial superior temporal (MSTd) area respond to large, complex patterns of retinal flow, implying a role in the analysis of self-motion. Some neurons are selective for the expanding radial motion that occurs as an observer moves through the environment ("heading"), and computational models can account for this finding. However, ample evidence suggests that MS… ▽ More Neurons in the dorsal subregion of the medial superior temporal (MSTd) area respond to large, complex patterns of retinal flow, implying a role in the analysis of self-motion. Some neurons are selective for the expanding radial motion that occurs as an observer moves through the environment ("heading"), and computational models can account for this finding. However, ample evidence suggests that MSTd neurons may exhibit a continuum of visual response selectivity to large-field motion stimuli, but the underlying computational principles by which these response properties are derived remain poorly understood. Here we describe a computational model of MSTd based on the hypothesis that neurons in MSTd efficiently encode the continuum of large-field retinal flow patterns on the basis of inputs received from neurons in MT, with receptive fields that resemble basis vectors recovered with nonnegative matrix factorization (NMF). These assumptions are sufficient to quantitatively simulate neurophysiological response properties of MSTd cells such as radial, circular, and spiral motion tuning, suggesting that these properties might simply be a by-product of MSTd neurons performing dimensionality reduction on their inputs. At the population level, model MSTd accurately predicts heading using a sparse distributed code, consistent with the idea that biological MSTd might operate in a sparseness regime well-suited to efficiently encode a number of self-motion variables. The present work provides an alternative to the template-model view of MSTd, and offers a biologically plausible account of the receptive field structure across a wide range of visual response properties in MSTd. △ Less

Submitted 21 February, 2017; originally announced February 2017.

Showing 1–8 of 8 results for author: Beyeler, M