Skip to main content

Showing 1–48 of 48 results for author: Buckley, C L

.
  1. arXiv:2505.13124  [pdf, ps, other

    cs.LG cs.AI cs.NE

    $μ$PC: Scaling Predictive Coding to 100+ Layer Networks

    Authors: Francesco Innocenti, El Mehdi Achour, Christopher L. Buckley

    Abstract: The biological implausibility of backpropagation (BP) has motivated many alternative, brain-inspired algorithms that attempt to rely only on local information, such as predictive coding (PC) and equilibrium propagation. However, these algorithms have notoriously struggled to train very deep networks, preventing them from competing with BP in large-scale settings. Indeed, scaling PC networks (PCNs)… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 34 pages, 41 figures

    ACM Class: I.2.6

  2. arXiv:2412.03676  [pdf, other

    cs.NE cs.AI cs.LG

    JPC: Flexible Inference for Predictive Coding Networks in JAX

    Authors: Francesco Innocenti, Paul Kinghorn, Will Yun-Farmbrough, Miguel De Llanza Varona, Ryan Singh, Christopher L. Buckley

    Abstract: We introduce JPC, a JAX library for training neural networks with Predictive Coding. JPC provides a simple, fast and flexible interface to train a variety of PC networks (PCNs) including discriminative, generative and hybrid models. Unlike existing libraries, JPC leverages ordinary differential equation solvers to integrate the gradient flow inference dynamics of PCNs. We find that a second-order… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

    Comments: 9 pages, 7 figures

  3. arXiv:2410.03592  [pdf, other

    cs.CV cs.AI

    Variational Bayes Gaussian Splatting

    Authors: Toon Van de Maele, Ozan Catal, Alexander Tschantz, Christopher L. Buckley, Tim Verbelen

    Abstract: Recently, 3D Gaussian Splatting has emerged as a promising approach for modeling 3D scenes using mixtures of Gaussians. The predominant optimization method for these models relies on backpropagating gradients through a differentiable rendering pipeline, which struggles with catastrophic forgetting when dealing with continuous streams of data. To address this limitation, we propose Variational Baye… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  4. arXiv:2409.14216  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models

    Authors: Viet Dung Nguyen, Zhizhuo Yang, Christopher L. Buckley, Alexander Ororbia

    Abstract: Although research has produced promising results demonstrating the utility of active inference (AIF) in Markov decision processes (MDPs), there is relatively less work that builds AIF models in the context of environments and problems that take the form of partially observable Markov decision processes (POMDPs). In POMDP scenarios, the agent must infer the unobserved environmental state from raw s… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    Comments: 20 pages, 2 algorithms, 2 tables, 5 figures, submitted to ICRA 2025

    MSC Class: 68T40 (Primary) 68T07; 68T37; 68T05 (Secondary) ACM Class: I.2.9; I.2.10; G.3; I.2.6

  5. arXiv:2409.08892  [pdf, other

    cs.AI q-bio.NC

    Exploring Action-Centric Representations Through the Lens of Rate-Distortion Theory

    Authors: Miguel de Llanza Varona, Christopher L. Buckley, Beren Millidge

    Abstract: Organisms have to keep track of the information in the environment that is relevant for adaptive behaviour. Transmitting information in an economical and efficient way becomes crucial for limited-resourced agents living in high-dimensional environments. The efficient coding hypothesis claims that organisms seek to maximize the information about the sensory input in an efficient manner. Under Bayes… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Journal ref: 4th International Workshop on Active Inference, 2023

  6. arXiv:2409.01066  [pdf, other

    cs.AI eess.SY

    Learning in Hybrid Active Inference Models

    Authors: Poppy Collis, Ryan Singh, Paul F Kinghorn, Christopher L Buckley

    Abstract: An open problem in artificial intelligence is how systems can flexibly learn discrete abstractions that are useful for solving inherently continuous problems. Previous work in computational neuroscience has considered this functional integration of discrete and continuous variables during decision-making under the formalism of active inference (Parr, Friston & de Vries, 2017; Parr & Friston, 2018)… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 11 pages (+ appendix). Accepted to the International Workshop on Active Inference 2024. arXiv admin note: substantial text overlap with arXiv:2408.10970

  7. arXiv:2408.11979  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Only Strict Saddles in the Energy Landscape of Predictive Coding Networks?

    Authors: Francesco Innocenti, El Mehdi Achour, Ryan Singh, Christopher L. Buckley

    Abstract: Predictive coding (PC) is an energy-based learning algorithm that performs iterative inference over network activities before updating weights. Recent work suggests that PC can converge in fewer learning steps than backpropagation thanks to its inference procedure. However, these advantages are not always observed, and the impact of PC inference on learning is not theoretically well understood. He… ▽ More

    Submitted 8 November, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: 35 pages, 12 figures

    ACM Class: I.2.6

  8. arXiv:2408.10970  [pdf, other

    cs.AI eess.SY

    Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control

    Authors: Poppy Collis, Ryan Singh, Paul F Kinghorn, Christopher L Buckley

    Abstract: An open problem in artificial intelligence is how systems can flexibly learn discrete abstractions that are useful for solving inherently continuous problems. Previous work has demonstrated that a class of hybrid state-space model known as recurrent switching linear dynamical systems (rSLDS) discover meaningful behavioural units via the piecewise linear decomposition of complex continuous dynamics… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 4 pages, 3 figures

  9. arXiv:2312.07547  [pdf, other

    q-bio.NC cs.LG

    Active Inference and Intentional Behaviour

    Authors: Karl J. Friston, Tommaso Salvatori, Takuya Isomura, Alexander Tschantz, Alex Kiefer, Tim Verbelen, Magnus Koudahl, Aswin Paul, Thomas Parr, Adeel Razi, Brett Kagan, Christopher L. Buckley, Maxwell J. D. Ramstead

    Abstract: Recent advances in theoretical biology suggest that basal cognition and sentient behaviour are emergent properties of in vitro cell cultures and neuronal networks, respectively. Such neuronal networks spontaneously learn structured behaviours in the absence of reward or reinforcement. In this paper, we characterise this kind of self-organisation through the lens of the free energy principle, i.e.,… ▽ More

    Submitted 16 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: 33 pages, 9 figures

  10. arXiv:2311.10300  [pdf, other

    cs.LG cs.AI

    Supervised structure learning

    Authors: Karl J. Friston, Lancelot Da Costa, Alexander Tschantz, Alex Kiefer, Tommaso Salvatori, Victorita Neacsu, Magnus Koudahl, Conor Heins, Noor Sajid, Dimitrije Markovic, Thomas Parr, Tim Verbelen, Christopher L Buckley

    Abstract: This paper concerns structure learning or discovery of discrete generative models. It focuses on Bayesian model selection and the assimilation of training data or content, with a special emphasis on the order in which data are ingested. A key move - in the ensuing schemes - is to place priors on the selection of models, based upon expected free energy. In this setting, expected free energy reduces… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  11. arXiv:2311.03893  [pdf, other

    cs.AI q-bio.NC

    Understanding Tool Discovery and Tool Innovation Using Active Inference

    Authors: Poppy Collis, Paul F Kinghorn, Christopher L Buckley

    Abstract: The ability to invent new tools has been identified as an important facet of our ability as a species to problem solve in dynamic and novel environments. While the use of tools by artificial agents presents a challenging task and has been widely identified as a key goal in the field of autonomous robotics, far less research has tackled the invention of new tools by agents. In this paper, (1) we ar… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 13 pages, 8 pages, accepted for International Workshop on Active Inference 2023, due to be published in IWAI 2023, CCIS 1915 proceedings (Springer) 2024

  12. arXiv:2309.04653  [pdf, other

    q-bio.NC nlin.AO

    Relative representations for cognitive graphs

    Authors: Alex B. Kiefer, Christopher L. Buckley

    Abstract: Although the latent spaces learned by distinct neural networks are not generally directly comparable, recent work in machine learning has shown that it is possible to use the similarities and differences among latent space vectors to derive "relative representations" with comparable representational power to their "absolute" counterparts, and which are nearly identical across models trained on sim… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 19 pages, 1 table, 6 figures. Accepted paper at the 4th International Workshop on Active Inference (Ghent, Belgium 2023)

  13. arXiv:2308.07870  [pdf, other

    cs.AI cs.LG cs.NE

    A Survey on Brain-Inspired Deep Learning via Predictive Coding

    Authors: Tommaso Salvatori, Ankur Mali, Christopher L. Buckley, Thomas Lukasiewicz, Rajesh P. N. Rao, Karl Friston, Alexander Ororbia

    Abstract: Artificial intelligence (AI) is rapidly becoming one of the key technologies of this century. The majority of results in AI thus far have been achieved using deep neural networks trained with the error backpropagation learning algorithm. However, the ubiquitous adoption of this approach has highlighted some important limitations such as substantial computational cost, difficulty in quantifying unc… ▽ More

    Submitted 23 January, 2025; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 37 Pages, 9 Figures

  14. arXiv:2305.18188  [pdf, other

    cs.NE cs.AI cs.LG

    Understanding Predictive Coding as an Adaptive Trust-Region Method

    Authors: Francesco Innocenti, Ryan Singh, Christopher L. Buckley

    Abstract: Predictive coding (PC) is a brain-inspired local learning algorithm that has recently been suggested to provide advantages over backpropagation (BP) in biologically relevant scenarios. While theoretical work has mainly focused on showing how PC can approximate BP in various limits, the putative benefits of "natural" PC are less understood. Here we develop a theory of PC as an adaptive trust-region… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  15. arXiv:2304.04556  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Attention: Marginal Probability is All You Need?

    Authors: Ryan Singh, Christopher L. Buckley

    Abstract: Attention mechanisms are a central property of cognitive systems allowing them to selectively deploy cognitive resources in a flexible manner. Attention has been long studied in the neurosciences and there are numerous phenomenological models that try to capture its core properties. Recently attentional mechanisms have become a dominating architectural choice of machine learning and are the centra… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  16. arXiv:2302.08582  [pdf, other

    cs.CL cs.LG

    Pretraining Language Models with Human Preferences

    Authors: Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez

    Abstract: Language models (LMs) are pretrained to imitate internet text, including content that would violate human preferences if generated by an LM: falsehoods, offensive comments, personally identifiable information, low-quality or buggy code, and more. Here, we explore alternative objectives for pretraining LMs in a way that also guides them to generate text aligned with human preferences. We benchmark… ▽ More

    Submitted 14 June, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  17. arXiv:2212.01354  [pdf, other

    cs.AI cs.MA nlin.AO

    Designing Ecosystems of Intelligence from First Principles

    Authors: Karl J Friston, Maxwell J D Ramstead, Alex B Kiefer, Alexander Tschantz, Christopher L Buckley, Mahault Albarracin, Riddhi J Pitliya, Conor Heins, Brennan Klein, Beren Millidge, Dalton A R Sakthivadivel, Toby St Clere Smithe, Magnus Koudahl, Safae Essafi Tremblay, Capm Petersen, Kaiser Fung, Jason G Fox, Steven Swanson, Dan Mapes, Gabriel René

    Abstract: This white paper lays out a vision of research and development in the field of artificial intelligence for the next decade (and beyond). Its denouement is a cyber-physical ecosystem of natural and synthetic sense-making, in which humans are integral participants -- what we call ''shared intelligence''. This vision is premised on active inference, a formulation of adaptive behavior that can be read… ▽ More

    Submitted 11 January, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: 23+18 pages, one figure, one six page appendix

    Journal ref: Collective Intelligence, 3(1), 2024

  18. arXiv:2209.02567  [pdf, other

    q-bio.NC

    Capsule Networks as Generative Models

    Authors: Alex B. Kiefer, Beren Millidge, Alexander Tschantz, Christopher L. Buckley

    Abstract: Capsule networks are a neural network architecture specialized for visual scene recognition. Features and pose information are extracted from a scene and then dynamically routed through a hierarchy of vector-valued nodes called 'capsules' to create an implicit scene graph, with the ultimate aim of learning vision directly as inverse graphics. Despite these intuitions, however, capsule networks are… ▽ More

    Submitted 6 October, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: Accepted at the 3rd International Workshop on Active Inference, 19th Sept 2022, Grenoble; This version: added reference, corrected typographical error; final submitted version

  19. arXiv:2208.07114  [pdf, other

    cs.AI q-bio.NC

    Preventing Deterioration of Classification Accuracy in Predictive Coding Networks

    Authors: Paul F Kinghorn, Beren Millidge, Christopher L Buckley

    Abstract: Predictive Coding Networks (PCNs) aim to learn a generative model of the world. Given observations, this generative model can then be inverted to infer the causes of those observations. However, when training PCNs, a noticeable pathology is often observed where inference accuracy peaks and then declines with further training. This cannot be explained by overfitting since both training and test acc… ▽ More

    Submitted 1 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: preprint of IWAI 2022 conference paper. this version clarifies comments in final paragraph of section 3

  20. arXiv:2207.12914  [pdf, other

    q-bio.NC cond-mat.dis-nn nlin.CD

    Knitting a Markov blanket is hard when you are out-of-equilibrium: two examples in canonical nonequilibrium models

    Authors: Miguel Aguilera, Ángel Poc-López, Conor Heins, Christopher L. Buckley

    Abstract: Bayesian theories of biological and brain function speculate that Markov blankets (a conditional independence separating a system from external states) play a key role for facilitating inference-like behaviour in living systems. Although it has been suggested that Markov blankets are commonplace in sparsely connected, nonequilibrium complex systems, this has not been studied in detail. Here, we sh… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  21. arXiv:2207.09897  [pdf, other

    cs.AI

    Successor Representation Active Inference

    Authors: Beren Millidge, Christopher L Buckley

    Abstract: Recent work has uncovered close links between between classical reinforcement learning algorithms, Bayesian filtering, and Active Inference which lets us understand value functions in terms of Bayesian posteriors. An alternative, but less explored, model-free RL algorithm is the successor representation, which expresses the value function in terms of a successor matrix of expected future state occ… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: 20/07/22 initial upload

  22. arXiv:2205.11275  [pdf, other

    cs.LG stat.ML

    RL with KL penalties is better viewed as Bayesian inference

    Authors: Tomasz Korbak, Ethan Perez, Christopher L Buckley

    Abstract: Reinforcement learning (RL) is frequently employed in fine-tuning large language models (LMs), such as GPT-3, to penalize them for undesirable features of generated sequences, such as offensiveness, social bias, harmfulness or falsehood. The RL formulation involves treating the LM as a policy and updating it to maximise the expected value of a reward function which captures human preferences, such… ▽ More

    Submitted 21 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Findings of EMNLP 2022

  23. arXiv:2204.02169  [pdf, other

    q-bio.NC cs.AI cs.LG

    Hybrid Predictive Coding: Inferring, Fast and Slow

    Authors: Alexander Tschantz, Beren Millidge, Anil K Seth, Christopher L Buckley

    Abstract: Predictive coding is an influential model of cortical neural activity. It proposes that perceptual beliefs are furnished by sequentially minimising "prediction errors" - the differences between predicted and observed data. Implicit in this proposal is the idea that perception requires multiple cycles of neural activity. This is at odds with evidence that several aspects of visual perception - incl… ▽ More

    Submitted 6 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: 05/04/22 initial upload. 06/04/22 added acknowledgements section

  24. arXiv:2112.01871  [pdf, other

    cs.RO cs.AI cs.LG

    Active Inference in Robotics and Artificial Agents: Survey and Challenges

    Authors: Pablo Lanillos, Cristian Meo, Corrado Pezzato, Ajith Anil Meera, Mohamed Baioumy, Wataru Ohata, Alexander Tschantz, Beren Millidge, Martijn Wisse, Christopher L. Buckley, Jun Tani

    Abstract: Active inference is a mathematical framework which originated in computational neuroscience as a theory of how the brain implements action, perception and learning. Recently, it has been shown to be a promising approach to the problems of state-estimation and control under uncertainty, as well as a foundation for the construction of goal-driven behaviours in robotics and artificial agents in gener… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: This manuscript is under review in a IEEE journal

  25. arXiv:2109.00866  [pdf, other

    cs.AI q-bio.NC

    Habitual and Reflective Control in Hierarchical Predictive Coding

    Authors: Paul F. Kinghorn, Beren Millidge, Christopher L. Buckley

    Abstract: In cognitive science, behaviour is often separated into two types. Reflexive control is habitual and immediate, whereas reflective is deliberative and time consuming. We examine the argument that Hierarchical Predictive Coding (HPC) can explain both types of behaviour as a continuum operating across a multi-layered network, removing the need for separate circuits in the brain. On this view, "fast"… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 02/09/2021 Initial Upload

  26. arXiv:2108.13343  [pdf, other

    cs.AI

    A Mathematical Walkthrough and Discussion of the Free Energy Principle

    Authors: Beren Millidge, Anil Seth, Christopher L Buckley

    Abstract: The Free-Energy-Principle (FEP) is an influential and controversial theory which postulates a deep and powerful connection between the stochastic thermodynamics of self-organization and learning through variational inference. Specifically, it claims that any self-organizing system which can be statistically separated from its environment, and which maintains itself at a non-equilibrium steady stat… ▽ More

    Submitted 1 October, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 30/08/21 initial upload; 02/10/21 minor maths fixes

  27. arXiv:2107.12979  [pdf, other

    cs.AI cs.NE q-bio.NC

    Predictive Coding: a Theoretical and Experimental Review

    Authors: Beren Millidge, Anil Seth, Christopher L Buckley

    Abstract: Predictive coding offers a potentially unifying account of cortical function -- postulating that the core function of the brain is to minimize prediction errors with respect to a generative model of the world. The theory is closely related to the Bayesian brain framework and, over the last two decades, has gained substantial influence in the fields of theoretical and cognitive neuroscience. A larg… ▽ More

    Submitted 12 July, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: 27/07/21 initial upload; 14/01/22 maths fix; 05/07/22 maths fix; 12/07/22 text fixes

  28. How particular is the physics of the free energy principle?

    Authors: Miguel Aguilera, Beren Millidge, Alexander Tschantz, Christopher L. Buckley

    Abstract: The free energy principle (FEP) states that any dynamical system can be interpreted as performing Bayesian inference upon its surrounding environment. In this work, we examine in depth the assumptions required to derive the FEP in the simplest possible set of systems -- weakly-coupled non-equilibrium linear stochastic systems. Specifically, we explore (i) how general the requirements imposed on th… ▽ More

    Submitted 19 May, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Journal ref: Physics of Life Reviews. Volume 40, March 2022, Pages 24-50

  29. arXiv:2010.06219  [pdf, other

    cs.AI cs.NE stat.ML

    Investigating the Scalability and Biological Plausibility of the Activation Relaxation Algorithm

    Authors: Beren Millidge, Alexander Tschantz, Anil Seth, Christopher L Buckley

    Abstract: The recently proposed Activation Relaxation (AR) algorithm provides a simple and robust approach for approximating the backpropagation of error algorithm using only local learning rules. Unlike competing schemes, it converges to the exact backpropagation gradients, and utilises only a single type of computational unit and a single backwards relaxation phase. We have previously shown that the algor… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 13/10/20 initial upload

  30. arXiv:2010.01047  [pdf, other

    q-bio.NC cs.AI stat.ML

    Relaxing the Constraints on Predictive Coding Models

    Authors: Beren Millidge, Alexander Tschantz, Anil Seth, Christopher L Buckley

    Abstract: Predictive coding is an influential theory of cortical function which posits that the principal computation the brain performs, which underlies both perception and learning, is the minimization of prediction errors. While motivated by high-level notions of variational inference, detailed neurophysiological models of cortical microcircuits which can implements its computations have been developed.… ▽ More

    Submitted 10 October, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 02/10/20 initial upload; 10/10/20 minor fixes

  31. arXiv:2009.05359  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Activation Relaxation: A Local Dynamical Approximation to Backpropagation in the Brain

    Authors: Beren Millidge, Alexander Tschantz, Anil K Seth, Christopher L Buckley

    Abstract: The backpropagation of error algorithm (backprop) has been instrumental in the recent success of deep learning. However, a key question remains as to whether backprop can be formulated in a manner suitable for implementation in neural circuitry. The primary challenge is to ensure that any candidate formulation uses only local information, rather than relying on global signals as in standard backpr… ▽ More

    Submitted 10 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: initial upload; revised version (updated abstract, related work) 28-09-20; 05/10/20: revised for ICLR submission; 10/10/20: minor revisions

  32. arXiv:2007.05838  [pdf, other

    cs.LG cs.AI stat.ML

    Control as Hybrid Inference

    Authors: Alexander Tschantz, Beren Millidge, Anil K. Seth, Christopher L. Buckley

    Abstract: The field of reinforcement learning can be split into model-based and model-free methods. Here, we unify these approaches by casting model-free policy optimisation as amortised variational inference, and model-based planning as iterative variational inference, within a `control as hybrid inference' (CHI) framework. We present an implementation of CHI which naturally mediates the balance between it… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

  33. arXiv:2006.12964  [pdf, other

    cs.AI stat.ML

    On the Relationship Between Active Inference and Control as Inference

    Authors: Beren Millidge, Alexander Tschantz, Anil K Seth, Christopher L Buckley

    Abstract: Active Inference (AIF) is an emerging framework in the brain sciences which suggests that biological agents act to minimise a variational bound on model evidence. Control-as-Inference (CAI) is a framework within reinforcement learning which casts decision making as a variational inference problem. While these frameworks both consider action selection through the lens of variational inference, thei… ▽ More

    Submitted 29 June, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: final workshop version

  34. Predictions in the eye of the beholder: an active inference account of Watt governors

    Authors: Manuel Baltieri, Christopher L. Buckley, Jelle Bruineberg

    Abstract: Active inference introduces a theory describing action-perception loops via the minimisation of variational (and expected) free energy or, under simplifying assumptions, (weighted) prediction error. Recently, active inference has been proposed as part of a new and unifying framework in the cognitive sciences: predictive processing. Predictive processing is often associated with traditional computa… ▽ More

    Submitted 25 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: Accepted at ALife 2020

  35. arXiv:2006.10524  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning as Iterative and Amortised Inference

    Authors: Beren Millidge, Alexander Tschantz, Anil K Seth, Christopher L Buckley

    Abstract: There are several ways to categorise reinforcement learning (RL) algorithms, such as either model-based or model-free, policy-based or planning-based, on-policy or off-policy, and online or offline. Broad classification schemes such as these help provide a unified perspective on disparate techniques and can contextualise and guide the development of new algorithms. In this paper, we utilise the co… ▽ More

    Submitted 5 July, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: initial upload; 05-07-20 -- updated with minor corrections

  36. arXiv:2006.04182  [pdf, other

    cs.LG cs.NE

    Predictive Coding Approximates Backprop along Arbitrary Computation Graphs

    Authors: Beren Millidge, Alexander Tschantz, Christopher L. Buckley

    Abstract: Backpropagation of error (backprop) is a powerful algorithm for training machine learning architectures through end-to-end differentiation. However, backprop is often criticised for lacking biological plausibility. Recently, it has been shown that backprop in multilayer-perceptrons (MLPs) can be approximated using predictive coding, a biologically-plausible process theory of cortical computation w… ▽ More

    Submitted 5 October, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS 2020. Updated Acknowledgements. 11/06/20: fixed typos in maths -- 11/07/20: minor corrections; 05/10/20: major rewrite for ICLR

  37. arXiv:2005.06269  [pdf, other

    q-bio.NC

    On Kalman-Bucy filters, linear quadratic control and active inference

    Authors: Manuel Baltieri, Christopher L. Buckley

    Abstract: Linear Quadratic Gaussian (LQG) control is a framework first introduced in control theory that provides an optimal solution to linear problems of regulation in the presence of uncertainty. This framework combines Kalman-Bucy filters for the estimation of hidden states with Linear Quadratic Regulators for the control of their dynamics. Nowadays, LQG is also a common paradigm in neuroscience, where… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  38. arXiv:2004.08128  [pdf, ps, other

    cs.AI

    Whence the Expected Free Energy?

    Authors: Beren Millidge, Alexander Tschantz, Christopher L Buckley

    Abstract: The Expected Free Energy (EFE) is a central quantity in the theory of active inference. It is the quantity that all active inference agents are mandated to minimize through action, and its decomposition into extrinsic and intrinsic value terms is key to the balance of exploration and exploitation that active inference agents evince. Despite its importance, the mathematical origins of this quantity… ▽ More

    Submitted 28 September, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: 24 pages, 0 figures. Reuploaded to correct typos in the original. Update 05-07-20 -- minor corrections. Update 28-09-20 -- Final version accepted by Neural Computation

  39. arXiv:2002.12636  [pdf, other

    cs.LG cs.AI cs.IT eess.SY stat.ML

    Reinforcement Learning through Active Inference

    Authors: Alexander Tschantz, Beren Millidge, Anil K. Seth, Christopher L. Buckley

    Abstract: The central tenet of reinforcement learning (RL) is that agents seek to maximize the sum of cumulative rewards. In contrast, active inference, an emerging framework within cognitive and computational neuroscience, proposes that agents act to maximize the evidence for a biased generative model. Here, we illustrate how ideas from active inference can augment traditional RL approaches by (i) furnishi… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  40. arXiv:1911.10601  [pdf, other

    cs.LG cs.AI cs.IT eess.SY stat.ML

    Scaling active inference

    Authors: Alexander Tschantz, Manuel Baltieri, Anil. K. Seth, Christopher L. Buckley

    Abstract: In reinforcement learning (RL), agents often operate in partially observed and uncertain environments. Model-based RL suggests that this is best achieved by learning and exploiting a probabilistic model of the world. 'Active inference' is an emerging normative framework in cognitive and computational neuroscience that offers a unifying account of how biological agents achieve this. On this framewo… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

  41. Generative models as parsimonious descriptions of sensorimotor loops

    Authors: Manuel Baltieri, Christopher L. Buckley

    Abstract: The Bayesian brain hypothesis, predictive processing and variational free energy minimisation are typically used to describe perceptual processes based on accurate generative models of the world. However, generative models need not be veridical representations of the environment. We suggest that they can (and should) be used to describe sensorimotor relationships relevant for behaviour rather than… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: Commentary on Brette (2019) https://doi.org/10.1017/S0140525X19000049

    Journal ref: Behav Brain Sci 42 (2019) e218

  42. Nonmodular architectures of cognitive systems based on active inference

    Authors: Manuel Baltieri, Christopher L. Buckley

    Abstract: In psychology and neuroscience it is common to describe cognitive systems as input/output devices where perceptual and motor functions are implemented in a purely feedforward, open-loop fashion. On this view, perception and action are often seen as encapsulated modules with limited interaction between them. While embodied and enactive approaches to cognitive science have challenged the idealisatio… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

    Comments: Accepted at IJCNN 2019

  43. The modularity of action and perception revisited using control theory and active inference

    Authors: Manuel Baltieri, Christopher L. Buckley

    Abstract: The assumption that action and perception can be investigated independently is entrenched in theories, models and experimental approaches across the brain and mind sciences. In cognitive science, this has been a central point of contention between computationalist and 4Es (enactive, embodied, extended and embedded) theories of cognition, with the former embracing the "classical sandwich", modular,… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: Accepted at the International conference on Artificial Life, Tokyo, 2018

  44. An active inference implementation of phototaxis

    Authors: Manuel Baltieri, Christopher L. Buckley

    Abstract: Active inference is emerging as a possible unifying theory of perception and action in cognitive and computational neuroscience. On this theory, perception is a process of inferring the causes of sensory data by minimising the error between actual sensations and those predicted by an inner \emph{generative} (probabilistic) model. Action on the other hand is drawn as a process that modifies the wor… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

    Comments: 8 pages, 3 figures, accepted at ECAL (European Conference on Artificial Life) 2017, Lyon, France

  45. arXiv:1705.09156  [pdf, other

    q-bio.NC

    The free energy principle for action and perception: A mathematical review

    Authors: Christopher L. Buckley, Chang Sub Kim, Simon McGregor, Anil K. Seth

    Abstract: The 'free energy principle' (FEP) has been suggested to provide a unified theory of the brain, integrating data and theory relating to action, perception, and learning. The theory and implementation of the FEP combines insights from Helmholtzian 'perception as inference', machine learning theory, and statistical thermodynamics. Here, we provide a detailed mathematical evaluation of a suggested bio… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

    Comments: 77 pages 2 fugures

  46. arXiv:1602.08881  [pdf

    q-bio.NC

    Brain State Control by Closed-Loop Environmental Feedback

    Authors: Christopher L. Buckley, Satohiro Tajima, Toru Yanagawa, Kana Takakura, Yasuo Nagasaka, Naotaka Fujii, Taro Toyoizumi

    Abstract: Brain state regulates sensory processing and motor control for adaptive behavior. Internal mechanisms of brain state control are well studied, but the role of external modulation from the environment is not well understood. Here, we examined the role of closed-loop environmental (CLE) feedback, in comparison to open-loop sensory input, on brain state and behavior in diverse vertebrate systems. In… ▽ More

    Submitted 29 February, 2016; originally announced February 2016.

  47. arXiv:1503.04187  [pdf, ps, other

    cs.AI

    A Minimal Active Inference Agent

    Authors: Simon McGregor, Manuel Baltieri, Christopher L. Buckley

    Abstract: Research on the so-called "free-energy principle'' (FEP) in cognitive neuroscience is becoming increasingly high-profile. To date, introductions to this theory have proved difficult for many readers to follow, but it depends mainly upon two relatively simple ideas: firstly that normative or teleological values can be expressed as probability distributions (active inference), and secondly that appr… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

  48. arXiv:1011.5334  [pdf, ps, other

    q-bio.NC

    A Graph Theoretic Interpretation of Neural Complexity

    Authors: L. Barnett, C. L. Buckley, S. Bullock

    Abstract: One of the central challenges facing modern neuroscience is to explain the ability of the nervous system to coherently integrate information across distinct functional modules in the absence of a central executive. To this end Tononi et al. [Proc. Nat. Acad. Sci. USA 91, 5033 (1994)] proposed a measure of neural complexity that purports to capture this property based on mutual information between… ▽ More

    Submitted 29 November, 2010; v1 submitted 24 November, 2010; originally announced November 2010.

    Comments: submitted Phys. Rev. E, Nov. 2010