Search | arXiv e-print repository

Object-based active inference

Authors: Ruben S. van Bergen, Pablo L. Lanillos

Abstract: The world consists of objects: distinct entities possessing independent properties and dynamics. For agents to interact with the world intelligently, they must translate sensory inputs into the bound-together features that describe each object. These object-based representations form a natural basis for planning behavior. Active inference (AIF) is an influential unifying account of perception and… ▽ More The world consists of objects: distinct entities possessing independent properties and dynamics. For agents to interact with the world intelligently, they must translate sensory inputs into the bound-together features that describe each object. These object-based representations form a natural basis for planning behavior. Active inference (AIF) is an influential unifying account of perception and action, but existing AIF models have not leveraged this important inductive bias. To remedy this, we introduce 'object-based active inference' (OBAI), marrying AIF with recent deep object-based neural networks. OBAI represents distinct objects with separate variational beliefs, and uses selective attention to route inputs to their corresponding object slots. Object representations are endowed with independent action-based dynamics. The dynamics and generative model are learned from experience with a simple environment (active multi-dSprites). We show that OBAI learns to correctly segment the action-perturbed objects from video input, and to manipulate these objects towards arbitrary goals. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2003.12128 [pdf, ps, other]

doi 10.1016/j.conb.2020.11.009

Going in circles is the way forward: the role of recurrence in visual inference

Authors: Ruben S. van Bergen, Nikolaus Kriegeskorte

Abstract: Biological visual systems exhibit abundant recurrent connectivity. State-of-the-art neural network models for visual recognition, by contrast, rely heavily or exclusively on feedforward computation. Any finite-time recurrent neural network (RNN) can be unrolled along time to yield an equivalent feedforward neural network (FNN). This important insight suggests that computational neuroscientists may… ▽ More Biological visual systems exhibit abundant recurrent connectivity. State-of-the-art neural network models for visual recognition, by contrast, rely heavily or exclusively on feedforward computation. Any finite-time recurrent neural network (RNN) can be unrolled along time to yield an equivalent feedforward neural network (FNN). This important insight suggests that computational neuroscientists may not need to engage recurrent computation, and that computer-vision engineers may be limiting themselves to a special case of FNN if they build recurrent models. Here we argue, to the contrary, that FNNs are a special case of RNNs and that computational neuroscientists and engineers should engage recurrence to understand how brains and machines can (1) achieve greater and more flexible computational depth, (2) compress complex computations into limited hardware, (3) integrate priors and priorities into visual inference through expectation and attention, (4) exploit sequential dependencies in their data for better inference and prediction, and (5) leverage the power of iterative computation. △ Less

Submitted 16 November, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

arXiv:1708.04860 [pdf]

doi 10.1016/j.neuroimage.2017.08.015

Modeling correlated noise is necessary to decode uncertainty

Authors: R. S. van Bergen, J. F. M. Jehee

Abstract: Brain decoding algorithms form an important part of the arsenal of analysis tools available to neuroscientists, allowing for a more detailed study of the kind of information represented in patterns of cortical activity. While most current decoding algorithms focus on estimating a single, most likely stimulus from the pattern of noisy fMRI responses, the presence of noise causes this estimate to be… ▽ More Brain decoding algorithms form an important part of the arsenal of analysis tools available to neuroscientists, allowing for a more detailed study of the kind of information represented in patterns of cortical activity. While most current decoding algorithms focus on estimating a single, most likely stimulus from the pattern of noisy fMRI responses, the presence of noise causes this estimate to be uncertain. This uncertainty in stimulus estimates is a potentially highly relevant aspect of cortical stimulus processing, and features prominently in Bayesian or probabilistic models of neural coding. Here, we focus on sensory uncertainty and how best to extract this information with fMRI. We first demonstrate in simulations that decoding algorithms that take into account correlated noise between fMRI voxels better recover the amount of uncertainty (quantified as the width of a probability distribution over possible stimuli) associated with the decoded estimate. Furthermore, we show that not all correlated variability should be treated equally, as modeling tuning-dependent correlations has the greatest impact on decoding performance. Next, we examine actual noise correlations in human visual cortex, and find that shared variability in areas V1-V3 depends on the tuning properties of fMRI voxels. In line with our simulations, accounting for this shared noise between similarly tuned voxels produces important benefits in decoding. Our findings underscore the importance of accurate noise models in fMRI decoding approaches, and suggest a statistically feasible method to incorporate the most relevant forms of shared noise. △ Less

Submitted 16 August, 2017; originally announced August 2017.

Showing 1–3 of 3 results for author: van Bergen, R S