Skip to main content

Showing 1–8 of 8 results for author: Cartea, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.22270  [pdf, ps, other

    cs.LG

    Weighted Conditional Flow Matching

    Authors: Sergio Calvo-Ordonez, Matthieu Meunier, Alvaro Cartea, Christoph Reisinger, Yarin Gal, Jose Miguel Hernandez-Lobato

    Abstract: Conditional flow matching (CFM) has emerged as a powerful framework for training continuous normalizing flows due to its computational efficiency and effectiveness. However, standard CFM often produces paths that deviate significantly from straight-line interpolations between prior and target distributions, making generation slower and less accurate due to the need for fine discretization at infer… ▽ More

    Submitted 29 July, 2025; originally announced July 2025.

    Comments: Working paper

  2. arXiv:2506.11898  [pdf, ps, other

    cs.LG stat.ML

    Scalable Generalized Bayesian Online Neural Network Training for Sequential Decision Making

    Authors: Gerardo Duran-Martin, Leandro Sánchez-Betancourt, Álvaro Cartea, Kevin Murphy

    Abstract: We introduce scalable algorithms for online learning and generalized Bayesian inference of neural network parameters, designed for sequential decision making tasks. Our methods combine the strengths of frequentist and Bayesian filtering, which include fast low-rank updates via a block-diagonal approximation of the parameter error covariance, and a well-defined posterior predictive distribution tha… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Journal ref: Advances in Neural Information Processing Systems 39 (NeurIPS 2025)

  3. arXiv:2502.20966  [pdf, other

    stat.ML cs.LG

    Post-Hoc Uncertainty Quantification in Pre-Trained Neural Networks via Activation-Level Gaussian Processes

    Authors: Richard Bergna, Stefan Depeweg, Sergio Calvo Ordonez, Jonathan Plenk, Alvaro Cartea, Jose Miguel Hernandez-Lobato

    Abstract: Uncertainty quantification in neural networks through methods such as Dropout, Bayesian neural networks and Laplace approximations is either prone to underfitting or computationally demanding, rendering these approaches impractical for large-scale datasets. In this work, we address these shortcomings by shifting the focus from uncertainty in the weight space to uncertainty at the activation level,… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 10 pages, 8 figures, 7th Symposium on Advances in Approximate Bayesian Inference

  4. arXiv:2502.01556  [pdf, other

    cs.LG stat.ML

    Observation Noise and Initialization in Wide Neural Networks

    Authors: Sergio Calvo-Ordoñez, Jonathan Plenk, Richard Bergna, Alvaro Cartea, Jose Miguel Hernandez-Lobato, Konstantina Palla, Kamil Ciosek

    Abstract: Performing gradient descent in a wide neural network is equivalent to computing the posterior mean of a Gaussian Process with the Neural Tangent Kernel (NTK-GP), for a specific choice of prior mean and with zero observation noise. However, existing formulations of this result have two limitations: i) the resultant NTK-GP assumes no noise in the observed target variables, which can result in subopt… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

    Comments: Work under review, 22 pages

  5. arXiv:2405.20799  [pdf, other

    stat.ML cs.LG

    Rough Transformers: Lightweight and Continuous Time Series Modelling through Signature Patching

    Authors: Fernando Moreno-Pino, Álvaro Arroyo, Harrison Waldon, Xiaowen Dong, Álvaro Cartea

    Abstract: Time-series data in real-world settings typically exhibit long-range dependencies and are observed at non-uniform intervals. In these settings, traditional sequence-based recurrent models struggle. To overcome this, researchers often replace recurrent architectures with Neural ODE-based models to account for irregularly sampled data and use Transformer-based architectures to account for long-range… ▽ More

    Submitted 11 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024 Conference (Camera Ready Version)

  6. arXiv:2403.10288  [pdf, other

    stat.ML cs.AI cs.LG

    Rough Transformers for Continuous and Efficient Time-Series Modelling

    Authors: Fernando Moreno-Pino, Álvaro Arroyo, Harrison Waldon, Xiaowen Dong, Álvaro Cartea

    Abstract: Time-series data in real-world medical settings typically exhibit long-range dependencies and are observed at non-uniform intervals. In such contexts, traditional sequence-based recurrent models struggle. To overcome this, researchers replace recurrent architectures with Neural ODE-based models to model irregularly sampled data and use Transformer-based architectures to account for long-range depe… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  7. arXiv:2312.05827  [pdf, other

    q-fin.TR cs.LG

    Detecting Toxic Flow

    Authors: Álvaro Cartea, Gerardo Duran-Martin, Leandro Sánchez-Betancourt

    Abstract: This paper develops a framework to predict toxic trades that a broker receives from her clients. Toxic trades are predicted with a novel online Bayesian method which we call the projection-based unification of last-layer and subspace estimation (PULSE). PULSE is a fast and statistically-efficient online procedure to train a Bayesian neural network sequentially. We employ a proprietary dataset of f… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 27 pages, 18 figures

  8. arXiv:2206.14666  [pdf, other

    cs.LG q-fin.CP q-fin.PM q-fin.RM q-fin.TR

    Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning

    Authors: Anthony Coache, Sebastian Jaimungal, Álvaro Cartea

    Abstract: We propose a novel framework to solve risk-sensitive reinforcement learning (RL) problems where the agent optimises time-consistent dynamic spectral risk measures. Based on the notion of conditional elicitability, our methodology constructs (strictly consistent) scoring functions that are used as penalizers in the estimation procedure. Our contribution is threefold: we (i) devise an efficient appr… ▽ More

    Submitted 1 May, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: 41 pages, 7 figures