Skip to main content

Showing 1–4 of 4 results for author: Ferraz, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.18589  [pdf, other

    cs.CV

    Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling

    Authors: Guillem Capellera, Antonio Rubio, Luis Ferraz, Antonio Agudo

    Abstract: Multi-agent trajectory modeling has primarily focused on forecasting future states, often overlooking broader tasks like trajectory completion, which are crucial for real-world applications such as correcting tracking data. Existing methods also generally predict agents' states without offering any state-wise measure of uncertainty. Moreover, popular multi-modal sampling methods lack any error pro… ▽ More

    Submitted 29 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025 conference

  2. arXiv:2410.17785  [pdf, other

    cs.CV cs.MA

    TranSPORTmer: A Holistic Approach to Trajectory Understanding in Multi-Agent Sports

    Authors: Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Understanding trajectories in multi-agent scenarios requires addressing various tasks, including predicting future movements, imputing missing observations, inferring the status of unseen agents, and classifying different global states. Traditional data-driven approaches often handle these tasks separately with specialized models. We introduce TranSPORTmer, a unified transformer-based framework ca… ▽ More

    Submitted 9 November, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: Accepted to ACCV 2024

  3. arXiv:2406.19852  [pdf, other

    cs.CV cs.MA

    FootBots: A Transformer-based Architecture for Motion Prediction in Soccer

    Authors: Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer

    Abstract: Motion prediction in soccer involves capturing complex dynamics from player and ball interactions. We present FootBots, an encoder-decoder transformer-based architecture addressing motion prediction and conditioned motion prediction through equivariance properties. FootBots captures temporal and social dynamics using set attention blocks and multi-attention block decoder. Our evaluation utilizes t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at IEEE ICIP 2024

  4. arXiv:1412.6537  [pdf, other

    cs.CV

    Fracking Deep Convolutional Image Descriptors

    Authors: Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Francesc Moreno-Noguer

    Abstract: In this paper we propose a novel framework for learning local image descriptors in a discriminative manner. For this purpose we explore a siamese architecture of Deep Convolutional Neural Networks (CNN), with a Hinge embedding loss on the L2 distance between descriptors. Since a siamese architecture uses pairs rather than single image patches to train, there exist a large number of positive sample… ▽ More

    Submitted 25 February, 2015; v1 submitted 19 December, 2014; originally announced December 2014.