Skip to main content

Showing 1–8 of 8 results for author: Aceituno, P V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03889  [pdf, ps, other

    cs.LG nlin.CD

    Temporal horizons in forecasting: a performance-learnability trade-off

    Authors: Pau Vilimelis Aceituno, Jack William Miller, Noah Marti, Youssef Farag, Victor Boussange

    Abstract: When training autoregressive models to forecast dynamical systems, a critical question arises: how far into the future should the model be trained to predict? Too short a horizon may miss long-term trends, while too long a horizon can impede convergence due to accumulating prediction errors. In this work, we formalize this trade-off by analyzing how the geometry of the loss landscape depends on th… ▽ More

    Submitted 19 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

    Comments: 33 pages, 12 figures

  2. arXiv:2502.10927  [pdf, ps, other

    cs.LG

    The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training

    Authors: Matteo Saponati, Pascal Sager, Pau Vilimelis Aceituno, Thilo Stadelmann, Benjamin Grewe

    Abstract: Self-attention is essential to Transformer architectures, yet how information is embedded in the self-attention matrices and how different objective functions impact this process remains unclear. We present a mathematical framework to analyze self-attention matrices by deriving the structures governing their weight updates. Using this framework, we demonstrate that bidirectional training induces s… ▽ More

    Submitted 3 June, 2025; v1 submitted 15 February, 2025; originally announced February 2025.

  3. arXiv:2407.18838  [pdf, other

    cs.NE cs.LG

    The Role of Temporal Hierarchy in Spiking Neural Networks

    Authors: Filippo Moro, Pau Vilimelis Aceituno, Laura Kriener, Melika Payvand

    Abstract: Spiking Neural Networks (SNNs) have the potential for rich spatio-temporal signal processing thanks to exploiting both spatial and temporal parameters. The temporal dynamics such as time constants of the synapses and neurons and delays have been recently shown to have computational benefits that help reduce the overall number of parameters required in the network and increase the accuracy of the S… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 16 pages, 9 figures, pre-print

  4. arXiv:2212.04316  [pdf, other

    cs.NE cs.CV q-bio.NC

    Bio-Inspired, Task-Free Continual Learning through Activity Regularization

    Authors: Francesco Lässig, Pau Vilimelis Aceituno, Martino Sorbaro, Benjamin F. Grewe

    Abstract: The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning (CL) approaches have been devised. However, these usually require discrete task boundaries. This requirement seems biologically implausible and often limits the ap… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  5. arXiv:2210.09818  [pdf, other

    cs.LG

    Disentangling the Predictive Variance of Deep Ensembles through the Neural Tangent Kernel

    Authors: Seijin Kobayashi, Pau Vilimelis Aceituno, Johannes von Oswald

    Abstract: Identifying unfamiliar inputs, also known as out-of-distribution (OOD) detection, is a crucial property of any decision making process. A simple and empirically validated technique is based on deep ensembles where the variance of predictions over different neural networks acts as a substitute for input uncertainty. Nevertheless, a theoretical understanding of the inductive biases leading to the pe… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  6. arXiv:2106.07887  [pdf, other

    cs.LG

    Credit Assignment in Neural Networks through Deep Feedback Control

    Authors: Alexander Meulemans, Matilde Tristany Farinha, Javier García Ordóñez, Pau Vilimelis Aceituno, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning sparked interest in whether the brain learns by using similar techniques for assigning credit to each synaptic weight for its contribution to the network output. However, the majority of current attempts at biologically-plausible learning methods are either non-local in time, require highly specific connectivity motives, or have no clear link to any known mathematical… ▽ More

    Submitted 17 January, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 14 pages and 4 figures in the main manuscript; 49 pages and 15 figures in the supplementary materials

    MSC Class: 68T07 ACM Class: I.2.6

  7. arXiv:2105.02504  [pdf, other

    cs.IT

    Minimizing costs of communication with random constant weight codes

    Authors: Pau Vilimelis Aceituno

    Abstract: We present a framework for minimizing costs in constant weight codes while maintaining a certain amount of differentiable codewords. Our calculations are based on a combinatorial view of constant weight codes and relay on simple approximations.

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 5 pages 2 figures

  8. arXiv:1707.02469  [pdf, other

    cs.LG cs.NE

    Tailoring Artificial Neural Networks for Optimal Learning

    Authors: Pau Vilimelis Aceituno, Yan Gang, Yang-Yu Liu

    Abstract: As one of the most important paradigms of recurrent neural networks, the echo state network (ESN) has been applied to a wide range of fields, from robotics to medicine, finance, and language processing. A key feature of the ESN paradigm is its reservoir --- a directed and weighted network of neurons that projects the input time series into a high dimensional space where linear regression or classi… ▽ More

    Submitted 25 February, 2020; v1 submitted 8 July, 2017; originally announced July 2017.

    Comments: 19 pages, 10 figures