Skip to main content

Showing 1–3 of 3 results for author: Santos-Villafranca, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06026  [pdf, ps, other

    cs.CV

    O-MaMa @ EgoExo4D Correspondence Challenge: Learning Object Mask Matching between Egocentric and Exocentric Views

    Authors: Lorenzo Mur-Labadia, Maria Santos-Villafranca, Alejandro Perez-Yus, Jesus Bermudez-Cameo, Ruben Martinez-Cantin, Jose J. Guerrero

    Abstract: The goal of the correspondence task is to segment specific objects across different views. This technical report re-defines cross-image segmentation by treating it as a mask matching task. Our method consists of: (1) A Mask-Context Encoder that pools dense DINOv2 semantic features to obtain discriminative object-level representations from FastSAM mask candidates, (2) an Ego$\leftrightarrow$Exo Cro… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2504.08578  [pdf, other

    cs.CV

    Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities

    Authors: Maria Santos-Villafranca, Dustin Carrión-Ojeda, Alejandro Perez-Yus, Jesus Bermudez-Cameo, Jose J. Guerrero, Simone Schaub-Meyer

    Abstract: Action recognition is an essential task in egocentric vision due to its wide range of applications across many fields. While deep learning methods have been proposed to address this task, most rely on a single modality, typically video. However, including additional modalities may improve the robustness of the approaches to common issues in egocentric videos, such as blurriness and occlusions. Rec… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: Project Page: https://visinf.github.io/KARMMA

  3. arXiv:2402.01456  [pdf, other

    cs.CV

    Convolution kernel adaptation to calibrated fisheye

    Authors: Bruno Berenguel-Baeta, Maria Santos-Villafranca, Jesus Bermudez-Cameo, Alejandro Perez-Yus, Jose J. Guerrero

    Abstract: Convolution kernels are the basic structural component of convolutional neural networks (CNNs). In the last years there has been a growing interest in fisheye cameras for many applications. However, the radially symmetric projection model of these cameras produces high distortions that affect the performance of CNNs, especially when the field of view is very large. In this work, we tackle this pro… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Previously presented at BMVC: https://proceedings.bmvc2023.org/721/