Skip to main content

Showing 1–50 of 111 results for author: Gonzalez, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04746  [pdf, ps, other

    cs.CL

    A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic

    Authors: Juan Moreno Gonzalez, Bashar Alhafni, Nizar Habash

    Abstract: Judeo-Arabic refers to Arabic variants historically spoken by Jewish communities across the Arab world, primarily during the Middle Ages. Unlike standard Arabic, it is written in Hebrew script by Jewish writers and for Jewish audiences. Transliterating Judeo-Arabic into Arabic script is challenging due to ambiguous letter mappings, inconsistent orthographic conventions, and frequent code-switching… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  2. arXiv:2507.00999  [pdf, ps, other

    cs.CL

    La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

    Authors: María Grandury, Javier Aula-Blasco, Júlia Falcão, Clémentine Fourrier, Miguel González, Gonzalo Martínez, Gonzalo Santamaría, Rodrigo Agerri, Nuria Aldama, Luis Chiruzzo, Javier Conde, Helena Gómez, Marta Guerrero, Guido Ivetta, Natalia López, Flor Miriam Plaza-del-Arco, María Teresa Martín-Valdivia, Helena Montoro, Carmen Muñoz, Pedro Reviriego, Leire Rosado, Alejandro Vaca, María Estrella Vallecillo-Rodríguez, Jorge Vallego, Irune Zubiaga

    Abstract: Leaderboards showcase the current capabilities and limitations of Large Language Models (LLMs). To motivate the development of LLMs that represent the linguistic and cultural diversity of the Spanish-speaking community, we present La Leaderboard, the first open-source leaderboard to evaluate generative LLMs in languages and language varieties of Spain and Latin America. La Leaderboard is a communi… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Accepted at ACL 2025 Main

  3. arXiv:2506.22439  [pdf, ps, other

    cs.CL cs.AI

    Psycholinguistic Word Features: a New Approach for the Evaluation of LLMs Alignment with Humans

    Authors: Javier Conde, Miguel González, María Grandury, Gonzalo Martínez, Pedro Reviriego, Mar Brysbaert

    Abstract: The evaluation of LLMs has so far focused primarily on how well they can perform different tasks such as reasoning, question-answering, paraphrasing, or translating. For most of these tasks, performance can be measured with objective metrics, such as the number of correct answers. However, other language features are not easily quantified. For example, arousal, concreteness, or gender associated w… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.

    Comments: Accepted for the GEM2 workshop at ACL 2025

  4. arXiv:2506.17989  [pdf, ps, other

    cs.LG

    Data Curation Matters: Model Collapse and Spurious Shift Performance Prediction from Training on Uncurated Text Embeddings

    Authors: Lucas Mattioli, Youness Ait Hadichou, Sabrina Chaouche, Martin Gonzalez

    Abstract: Training models on uncurated Text Embeddings (TEs) derived from raw tabular data can lead to a severe failure mode known as model collapse, where predictions converge to a single class regardless of input. By comparing models trained with identical hyper-parameter configurations on both raw tabular data and their TE-derived counterparts, we find that collapse is a consistent failure mode in the la… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: 37 pages. Multiple figures

  5. arXiv:2505.24802  [pdf, ps, other

    cs.LG

    ByzFL: Research Framework for Robust Federated Learning

    Authors: Marc González, Rachid Guerraoui, Rafael Pinot, Geovani Rizk, John Stephan, François Taïani

    Abstract: We present ByzFL, an open-source Python library for developing and benchmarking robust federated learning (FL) algorithms. ByzFL provides a unified and extensible framework that includes implementations of state-of-the-art robust aggregators, a suite of configurable attacks, and tools for simulating a variety of FL scenarios, including heterogeneous data distributions, multiple training algorithms… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  6. arXiv:2505.18978  [pdf, other

    cs.CL

    AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models

    Authors: Miguel Angel Peñaloza Perez, Bruno Lopez Orozco, Jesus Tadeo Cruz Soto, Michelle Bruno Hernandez, Miguel Angel Alvarado Gonzalez, Sandra Malagon

    Abstract: Existing mathematical reasoning benchmarks are predominantly English only or translation-based, which can introduce semantic drift and mask languagespecific reasoning errors. To address this, we present AI4Math, a benchmark of 105 original university level math problems natively authored in Spanish. The dataset spans seven advanced domains (Algebra, Calculus, Geometry, Probability, Number Theory,… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 36 pages, 5 figures

    MSC Class: 68 ACM Class: I.2

  7. arXiv:2505.15988  [pdf, ps, other

    cs.DC

    An Ecosystem of Services for FAIR Computational Workflows

    Authors: Sean R. Wilkinson, Johan Gustafsson, Finn Bacall, Khalid Belhajjame, Salvador Capella, Jose Maria Fernandez Gonzalez, Jacob Fosso Tande, Luiz Gadelha, Daniel Garijo, Patricia Grubel, Bjorn Grüning, Farah Zaib Khan, Sehrish Kanwal, Simone Leo, Stuart Owen, Luca Pireddu, Line Pouchard, Laura Rodríguez-Navas, Beatriz Serrano-Solano, Stian Soiland-Reyes, Baiba Vilne, Alan Williams, Merridee Ann Wouters, Frederik Coppens, Carole Goble

    Abstract: Computational workflows, regardless of their portability or maturity, represent major investments of both effort and expertise. They are first class, publishable research objects in their own right. They are key to sharing methodological know-how for reuse, reproducibility, and transparency. Consequently, the application of the FAIR principles to workflows is inevitable to enable them to be Findab… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 41 pages, 4 figures, 3 tables; to appear as chapter in upcoming book

  8. arXiv:2505.12145  [pdf, ps, other

    cs.SI

    Trajectory-Integrated Accessibility Analysis of Public Electric Vehicle Charging Stations

    Authors: Yi Ju, Jiaman Wu, Zhihan Su, Lunlong Li, Jinhua Zhao, Marta C. González, Scott J. Moura

    Abstract: Electric vehicle (EV) charging infrastructure is crucial for advancing EV adoption, managing charging loads, and ensuring equitable transportation electrification. However, there remains a notable gap in comprehensive accessibility metrics that integrate the mobility of the users. This study introduces a novel accessibility metric, termed Trajectory-Integrated Public EVCS Accessibility (TI-acs), a… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: 19 pages, 8 figures

  9. arXiv:2505.10862  [pdf, ps, other

    cs.CL

    Have Multimodal Large Language Models (MLLMs) Really Learned to Tell the Time on Analog Clocks?

    Authors: Tairan Fu, Miguel González, Javier Conde, Elena Merino-Gómez, Pedro Reviriego

    Abstract: Multimodal Large Language Models which can answer complex questions on an image struggle to tell the time on analog clocks. This is probably due to the lack of images with clocks at different times in their training set. In this work we explore this issue with one of the latest MLLMs: GPT-4.1 to understand why MLLMs fail to tell the time and whether fine-tuning can solve the problem. The results s… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 6 pages, 5 figures, 2 tables

    ACM Class: I.2.7

  10. arXiv:2505.09319  [pdf, other

    cs.PF

    Statistical Modeling and Uncertainty Estimation of LLM Inference Systems

    Authors: Kaustabha Ray, Nelson Mimura Gonzalez, Bruno Wassermann, Rachel Tzoref-Brill, Dean H. Lorenz

    Abstract: Large Language Model (LLM) inference systems present significant challenges in statistical performance characterization due to dynamic workload variations, diverse hardware architectures, and complex interactions between model size, batch processing, and throughput requirements. Accurate statistical characterization enables better workload scheduling, adaptive resource provisioning, and cost-aware… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  11. arXiv:2505.05331  [pdf, ps, other

    cs.CV q-bio.NC stat.CO

    Aesthetics Without Semantics

    Authors: C. Alejandro Parraga, Olivier Penacchio, Marcos Muňoz Gonzalez, Bogdan Raducanu, Xavier Otazu

    Abstract: While it is easy for human observers to judge an image as beautiful or ugly, aesthetic decisions result from a combination of entangled perceptual and cognitive (semantic) factors, making the understanding of aesthetic judgements particularly challenging from a scientific point of view. Furthermore, our research shows a prevailing bias in current databases, which include mostly beautiful images, f… ▽ More

    Submitted 12 June, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Parts of this work were presented in abstract format at the Vision Science of Art Conference (VSAC2016), the Iberian Conference on Perception (CIP2022), and the European Conference on Visual Perception (ECVP2022). See Perception 51, No1 (Suppl.) pp139, 2022)

  12. arXiv:2504.16609  [pdf, other

    cs.IR

    Information Leakage of Sentence Embeddings via Generative Embedding Inversion Attacks

    Authors: Antonios Tragoudaras, Theofanis Aslanidis, Emmanouil Georgios Lionis, Marina Orozco González, Panagiotis Eustratiadis

    Abstract: Text data are often encoded as dense vectors, known as embeddings, which capture semantic, syntactic, contextual, and domain-specific information. These embeddings, widely adopted in various applications, inherently contain rich information that may be susceptible to leakage under certain attacks. The GEIA framework highlights vulnerabilities in sentence embeddings, demonstrating that they can rev… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: This is a preprint of our paper accepted at SIGIR 2025

  13. arXiv:2504.01208  [pdf, other

    eess.IV cs.AI cs.CV

    Lightweight Deep Models for Dermatological Disease Detection: A Study on Instance Selection and Channel Optimization

    Authors: Ian Mateos Gonzalez, Estefani Jaramilla Nava, Abraham Sánchez Morales, Jesús García-Ramírez, Ricardo Ramos-Aguilar

    Abstract: The identification of dermatological disease is an important problem in Mexico according with different studies. Several works in literature use the datasets of different repositories without applying a study of the data behavior, especially in medical images domain. In this work, we propose a methodology to preprocess dermaMNIST dataset in order to improve its quality for the classification stage… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Submitted to Mexican Conference on Pattern Recognition 2025

  14. arXiv:2504.00979  [pdf

    cs.CV

    Artificial Intelligence-Assisted Prostate Cancer Diagnosis for Reduced Use of Immunohistochemistry

    Authors: Anders Blilie, Nita Mulliqi, Xiaoyi Ji, Kelvin Szolnoky, Sol Erika Boman, Matteo Titus, Geraldine Martinez Gonzalez, José Asenjo, Marcello Gambacorta, Paolo Libretti, Einar Gudlaugsson, Svein R. Kjosavik, Lars Egevad, Emiel A. M. Janssen, Martin Eklund, Kimmo Kartasalo

    Abstract: Prostate cancer diagnosis heavily relies on histopathological evaluation, which is subject to variability. While immunohistochemical staining (IHC) assists in distinguishing benign from malignant tissue, it involves increased work, higher costs, and diagnostic delays. Artificial intelligence (AI) presents a promising solution to reduce reliance on IHC by accurately classifying atypical glands and… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Comments: 29 pages, 5 figures and 3 tables

  15. arXiv:2502.21264  [pdf

    cs.CV cs.AI

    Foundation Models -- A Panacea for Artificial Intelligence in Pathology?

    Authors: Nita Mulliqi, Anders Blilie, Xiaoyi Ji, Kelvin Szolnoky, Henrik Olsson, Sol Erika Boman, Matteo Titus, Geraldine Martinez Gonzalez, Julia Anna Mielcarz, Masi Valkonen, Einar Gudlaugsson, Svein R. Kjosavik, José Asenjo, Marcello Gambacorta, Paolo Libretti, Marcin Braun, Radzislaw Kordek, Roman Łowicki, Kristina Hotakainen, Päivi Väre, Bodil Ginnerup Pedersen, Karina Dalsgaard Sørensen, Benedicte Parm Ulhøi, Pekka Ruusuvuori, Brett Delahunt , et al. (6 additional authors not shown)

    Abstract: The role of artificial intelligence (AI) in pathology has evolved from aiding diagnostics to uncovering predictive morphological patterns in whole slide images (WSIs). Recently, foundation models (FMs) leveraging self-supervised pre-training have been widely advocated as a universal solution for diverse downstream tasks. However, open questions remain about their clinical applicability and general… ▽ More

    Submitted 3 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

    Comments: 50 pages, 15 figures and an appendix (study protocol) which is previously published, see https://doi.org/10.1101/2024.07.04.24309948; updated authors list format

  16. Speed and Conversational Large Language Models: Not All Is About Tokens per Second

    Authors: Javier Conde, Miguel González, Pedro Reviriego, Zhen Gao, Shanshan Liu, Fabrizio Lombardi

    Abstract: The speed of open-weights large language models (LLMs) and its dependency on the task at hand, when run on GPUs, is studied to present a comparative analysis of the speed of the most popular open LLMs.

    Submitted 23 February, 2025; originally announced February 2025.

    Journal ref: Computer (Volume: 57, Issue: 8, August 2024)

  17. arXiv:2502.13231  [pdf, other

    math.CA cs.DM

    Las funciones booleans y el lema de Bonami

    Authors: María José González, Paul MacManus, María Cristina Pereyra

    Abstract: In this expository article, we study the relation between the boolean functions and the hypercontractivity theorems of Aline Bonami. We focus on the social choice theory, and present some of the most important results in the area, such as the Friedgut-Kalai-Naor (FKN) and the Kahn-Kalai-Linial (KKL) theorems, and the famous Fourier Entropy/Influence conjecture. -- En este artículo expositivo e… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 37 pages, in Spanish, 2 photos

    MSC Class: 43.01; 68R01

    Journal ref: La Gaceta de la RSME, Vol. 28 (2025), Núm. 1, Págs. 51-87

  18. arXiv:2501.06561  [pdf, other

    cs.AI

    Where to Go Next Day: Multi-scale Spatial-Temporal Decoupled Model for Mid-term Human Mobility Prediction

    Authors: Zongyuan Huang, Weipeng Wang, Shaoyu Huang, Marta C. Gonzalez, Yaohui Jin, Yanyan Xu

    Abstract: Predicting individual mobility patterns is crucial across various applications. While current methods mainly focus on predicting the next location for personalized services like recommendations, they often fall short in supporting broader applications such as traffic management and epidemic control, which require longer period forecasts of human mobility. This study addresses mid-term mobility pre… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

  19. arXiv:2412.17440  [pdf, other

    cs.AI

    The Role of XAI in Transforming Aeronautics and Aerospace Systems

    Authors: Francisco Javier Cantero Zorita, Mikel Galafate, Javier M. Moguerza, Isaac Martín de Diego, M. Teresa Gonzalez, Gema Gutierrez Peña

    Abstract: Recent advancements in Artificial Intelligence (AI) have transformed decision-making in aeronautics and aerospace. These advancements in AI have brought with them the need to understand the reasons behind the predictions generated by AI systems and models, particularly by professionals in these sectors. In this context, the emergence of eXplainable Artificial Intelligence (XAI) has helped bridge t… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

  20. arXiv:2410.17648  [pdf, other

    cs.LG

    Towards Active Participant Centric Vertical Federated Learning: Some Representations May Be All You Need

    Authors: Jon Irureta, Jon Imaz, Aizea Lojo, Javier Fernandez-Marques, Marco González, Iñigo Perona

    Abstract: Existing Vertical FL (VFL) methods often struggle with realistic and unaligned data partitions, and incur into high communication costs and significant operational complexity. This work introduces a novel approach to VFL, Active Participant Centric VFL (APC-VFL), that excels in scenarios when data samples among participants are partially aligned at training. Among its strengths, APC-VFL only requi… ▽ More

    Submitted 19 February, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  21. arXiv:2407.18745  [pdf, other

    cs.LG

    FairAIED: Navigating Fairness, Bias, and Ethics in Educational AI Applications

    Authors: Sribala Vidyadhari Chinta, Zichong Wang, Zhipeng Yin, Nhat Hoang, Matthew Gonzalez, Tai Le Quy, Wenbin Zhang

    Abstract: The integration of Artificial Intelligence (AI) into education has transformative potential, providing tailored learning experiences and creative instructional approaches. However, the inherent biases in AI algorithms hinder this improvement by unintentionally perpetuating prejudice against specific demographics, especially in human-centered applications like education. This survey delves deeply i… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  22. Recursive InPainting (RIP): how much information is lost under recursive inferences?

    Authors: Javier Conde, Miguel González, Gonzalo Martínez, Fernando Moral, Elena Merino-Gómez, Pedro Reviriego

    Abstract: The rapid adoption of generative artificial intelligence (AI) is accelerating content creation and modification. For example, variations of a given content, be it text or images, can be created almost instantly and at a low cost. This will soon lead to the majority of text and images being created directly by AI models or by humans assisted by AI. This poses new risks; for example, AI-generated co… ▽ More

    Submitted 25 May, 2025; v1 submitted 27 June, 2024; originally announced July 2024.

    Comments: AI & Soc (2025)

  23. arXiv:2406.12346  [pdf, other

    cs.AR

    Towards the Certification of Hybrid Architectures: Analysing Interference on Hardware Accelerators through PML

    Authors: Benjamin Lesage, Frédéric Boniol, Kevin Delmas, Adrien Gauffriau, Alfonso Mascarenas Gonzalez, Claire Pagetti

    Abstract: The emergence of Deep Neural Network (DNN) and machine learning-based applications paved the way for a new generation of hybrid hardware platforms. Hybrid platforms embed several cores and accelerators in a small package. However, in order to satisfy the Size, Weight and Power (SWaP) constraints, limited and shared resources are integrated. This paper presents an overview of the standards applicab… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 12th European Congress on Embedded Real Time Software and Systems (ERTS 2024), Jun 2024, Toulouse, France

  24. arXiv:2405.10054  [pdf, other

    cs.LG eess.SY

    A finite-sample generalization bound for stable LPV systems

    Authors: Daniel Racz, Martin Gonzalez, Mihaly Petreczky, Andras Benczur, Balint Daroczy

    Abstract: One of the main theoretical challenges in learning dynamical systems from data is providing upper bounds on the generalization error, that is, the difference between the expected prediction error and the empirical prediction error measured on some finite sample. In machine learning, a popular class of such bounds are the so-called Probably Approximately Correct (PAC) bounds. In this paper, we deri… ▽ More

    Submitted 21 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure, under review

    MSC Class: 68 ACM Class: I.2.0

  25. arXiv:2404.16208  [pdf, other

    cs.ET cs.AI

    GPU-RANC: A CUDA Accelerated Simulation Framework for Neuromorphic Architectures

    Authors: Sahil Hassan, Michael Inouye, Miguel C. Gonzalez, Ilkin Aliyev, Joshua Mack, Maisha Hafiz, Ali Akoglu

    Abstract: Open-source simulation tools play a crucial role for neuromorphic application engineers and hardware architects to investigate performance bottlenecks and explore design optimizations before committing to silicon. Reconfigurable Architecture for Neuromorphic Computing (RANC) is one such tool that offers ability to execute pre-trained Spiking Neural Network (SNN) models within a unified ecosystem t… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted for publication in Neuro-Inspired Computational Elements (NICE) Workshop 2024

  26. Open Conversational LLMs do not know most Spanish words

    Authors: Javier Conde, Miguel González, Nina Melero, Raquel Ferrando, Gonzalo Martínez, Elena Merino-Gómez, José Alberto Hernández, Pedro Reviriego

    Abstract: The growing interest in Large Language Models (LLMs) and in particular in conversational models with which users can interact has led to the development of a large number of open-source chat LLMs. These models are evaluated on a wide range of benchmarks to assess their capabilities in answering questions or solving problems on almost any possible topic or to test their ability to reason or interpr… ▽ More

    Submitted 24 September, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Procesamiento del Lenguaje Natural, 73, 95-108

    Journal ref: Procesamiento del Lenguaje Natural, n. 73, 2024. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6603

  27. arXiv:2402.15243  [pdf, other

    cs.RO eess.SY

    Safety-Conscious Pushing on Diverse Oriented Surfaces with Underactuated Aerial Vehicles

    Authors: Tong Hui, Manuel J. Fernandez Gonzalez, Matteo Fumagalli

    Abstract: Pushing tasks performed by aerial manipulators can be used for contact-based industrial inspections. Underactuated aerial vehicles are widely employed in aerial manipulation due to their widespread availability and relatively low cost. Industrial infrastructures often consist of diverse oriented work surfaces. When interacting with such surfaces, the coupled gravity compensation and interaction fo… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

  28. arXiv:2401.16247  [pdf, other

    cs.CL cs.CY

    Towards Red Teaming in Multimodal and Multilingual Translation

    Authors: Christophe Ropers, David Dale, Prangthip Hansanti, Gabriel Mejia Gonzalez, Ivan Evtimov, Corinne Wong, Christophe Touret, Kristina Pereyra, Seohyun Sonia Kim, Cristian Canton Ferrer, Pierre Andrews, Marta R. Costa-jussà

    Abstract: Assessing performance in Natural Language Processing is becoming increasingly complex. One particular challenge is the potential for evaluation datasets to overlap with training data, either directly or indirectly, which can lead to skewed results and overestimation of model performance. As a consequence, human evaluation is gaining increasing interest as a means to assess the performance and reli… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.05187

    ACM Class: I.2.7

  29. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  30. arXiv:2311.18491  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs

    Authors: Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

    Abstract: In the field of media production, video editing techniques play a pivotal role. Recent approaches have had great success at performing novel view image synthesis of static scenes. But adding temporal information adds an extra layer of complexity. Previous models have focused on implicitly representing static and dynamic scenes using NeRF. These models achieve impressive results but are costly at t… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: VUA BMVC 2023

  31. arXiv:2311.11742  [pdf, other

    eess.IV cs.CV

    Fuzzy Information Seeded Region Growing for Automated Lesions After Stroke Segmentation in MR Brain Images

    Authors: Mario Pascual González

    Abstract: In the realm of medical imaging, precise segmentation of stroke lesions from brain MRI images stands as a critical challenge with significant implications for patient diagnosis and treatment. Addressing this, our study introduces an innovative approach using a Fuzzy Information Seeded Region Growing (FISRG) algorithm. Designed to effectively delineate the complex and irregular boundaries of stroke… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 14 figures. Associated code and data available at: https://github.com/Mawio02/FISRG-for-Automated-Lesion-After-Stroke-Segmentation-in-MRI

    MSC Class: 92C55

  32. arXiv:2308.16599  [pdf, other

    cs.LG physics.soc-ph

    Using machine learning to understand causal relationships between urban form and travel CO2 emissions across continents

    Authors: Felix Wagner, Florian Nachtigall, Lukas Franken, Nikola Milojevic-Dupont, Rafael H. M. Pereira, Nicolas Koch, Jakob Runge, Marta Gonzalez, Felix Creutzig

    Abstract: Climate change mitigation in urban mobility requires policies reconfiguring urban form to increase accessibility and facilitate low-carbon modes of transport. However, current policy research has insufficiently assessed urban form effects on car travel at three levels: (1) Causality -- Can causality be established beyond theoretical and correlation-based analyses? (2) Generalizability -- Do relati… ▽ More

    Submitted 15 December, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: 32 pages, 24 figures, 6 tables

  33. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  34. arXiv:2308.02534  [pdf, other

    cs.CV cs.AI

    Exploring the Role of Explainability in AI-Assisted Embryo Selection

    Authors: Lucia Urcelay, Daniel Hinjos, Pablo A. Martin-Torres, Marta Gonzalez, Marta Mendez, Salva Cívico, Sergio Álvarez-Napagao, Dario Garcia-Gasulla

    Abstract: In Vitro Fertilization is among the most widespread treatments for infertility. One of its main challenges is the evaluation and selection of embryo for implantation, a process with large inter- and intra-clinician variability. Deep learning based methods are gaining attention, but their opaque nature compromises their acceptance in the clinical context, where transparency in the decision making i… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  35. arXiv:2306.14258  [pdf, ps, other

    cs.LG math.OC

    A Neural RDE approach for continuous-time non-Markovian stochastic control problems

    Authors: Melker Hoglund, Emilio Ferrucci, Camilo Hernandez, Aitor Muguruza Gonzalez, Cristopher Salvi, Leandro Sanchez-Betancourt, Yufei Zhang

    Abstract: We propose a novel framework for solving continuous-time non-Markovian stochastic control problems by means of neural rough differential equations (Neural RDEs) introduced in Morrill et al. (2021). Non-Markovianity naturally arises in control problems due to the time delay effects in the system coefficients or the driving noises, which leads to optimal control strategies depending explicitly on th… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023, Workshop on New Frontiers in Learning, Control, and Dynamical Systems

  36. arXiv:2306.06194  [pdf, other

    cs.LG

    Share, Collaborate, Benchmark: Advancing Travel Demand Research through rigorous open-source collaboration

    Authors: Juan D. Caicedo, Carlos Guirado, Marta C. González, Joan L. Walker

    Abstract: This research foregrounds general practices in travel demand research, emphasizing the need to change our ways. A critical barrier preventing travel demand literature from effectively informing policy is the volume of publications without clear, consolidated benchmarks, making it difficult for researchers and policymakers to gather insights and use models to guide decision-making. By emphasizing r… ▽ More

    Submitted 14 July, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 18 pages, 8 figures

  37. arXiv:2305.14267  [pdf, other

    cs.LG cs.CV math.NA

    SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

    Authors: Martin Gonzalez, Nelson Fernandez, Thuy Tran, Elies Gherbi, Hatem Hajri, Nader Masmoudi

    Abstract: A potent class of generative models known as Diffusion Probabilistic Models (DPMs) has become prominent. A forward diffusion process adds gradually noise to data, while a model learns to gradually denoise. Sampling from pre-trained DPMs is obtained by solving differential equations (DE) defined by the learnt model, a process which has shown to be prohibitively slow. Numerous efforts on speeding-up… ▽ More

    Submitted 26 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 60 pages. Camera-Ready version for the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

    MSC Class: I.2.6

  38. arXiv:2211.07301  [pdf, other

    cs.CV cs.GR cs.LG

    SVS: Adversarial refinement for sparse novel view synthesis

    Authors: Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

    Abstract: This paper proposes Sparse View Synthesis. This is a view synthesis problem where the number of reference views is limited, and the baseline between target and reference view is significant. Under these conditions, current radiance field methods fail catastrophically due to inescapable artifacts such 3D floating blobs, blurring and structural duplication, whenever the number of reference views is… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: BMVC 2022

  39. arXiv:2210.10865  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization

    Authors: Thomas Lew, Sumeet Singh, Mario Prats, Jeffrey Bingham, Jonathan Weisz, Benjie Holson, Xiaohan Zhang, Vikas Sindhwani, Yao Lu, Fei Xia, Peng Xu, Tingnan Zhang, Jie Tan, Montserrat Gonzalez

    Abstract: We propose a framework to enable multipurpose assistive mobile robots to autonomously wipe tables to clean spills and crumbs. This problem is challenging, as it requires planning wiping actions while reasoning over uncertain latent dynamics of crumbs and spills captured via high-dimensional visual observations. Simultaneously, we must guarantee constraints satisfaction to enable safe deployment in… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  40. arXiv:2210.09817  [pdf, other

    cs.LG

    Universal hidden monotonic trend estimation with contrastive learning

    Authors: Edouard Pineau, Sébastien Razakarivony, Mauricio Gonzalez, Anthony Schrapffer

    Abstract: In this paper, we describe a universal method for extracting the underlying monotonic trend factor from time series data. We propose an approach related to the Mann-Kendall test, a standard monotonic trend detection method and call it contrastive trend estimation (CTE). We show that the CTE method identifies any hidden trend underlying temporal data while avoiding the standard assumptions used for… ▽ More

    Submitted 23 April, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

  41. arXiv:2209.07888  [pdf, other

    cs.CV cs.RO

    TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM

    Authors: Mathieu Gonzalez, Eric Marchand, Amine Kacete, Jérôme Royan

    Abstract: Most classical SLAM systems rely on the static scene assumption, which limits their applicability in real world scenarios. Recent SLAM frameworks have been proposed to simultaneously track the camera and moving objects. However they are often unable to estimate the canonical pose of the objects and exhibit a low object tracking accuracy. To solve this problem we propose TwistSLAM++, a semantic, dy… ▽ More

    Submitted 22 March, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

  42. arXiv:2207.04672  [pdf

    cs.CL cs.AI

    No Language Left Behind: Scaling Human-Centered Machine Translation

    Authors: NLLB Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran , et al. (14 additional authors not shown)

    Abstract: Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality res… ▽ More

    Submitted 25 August, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 190 pages

    MSC Class: 68T50 ACM Class: I.2.7

  43. arXiv:2207.02132  [pdf, other

    cs.LG

    Deterministic Decoupling of Global Features and its Application to Data Analysis

    Authors: Eduardo Martinez-Enriquez, Maria del Mar Gonzalez, Javier Portilla

    Abstract: We introduce a method for deterministic decoupling of global features and show its applicability to improve data analysis performance, as well as to open new venues for feature transfer. We propose a new formalism that is based on defining transformations on submanifolds, by following trajectories along the features gradients. Through these transformations we define a normalization that, we demons… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 29 pages, 12 figures

    ACM Class: I.4.7; I.5.1

  44. arXiv:2206.08237  [pdf, other

    cs.LG

    Noisy Learning for Neural ODEs Acts as a Robustness Locus Widening

    Authors: Martin Gonzalez, Hatem Hajri, Loic Cantat, Mihaly Petreczky

    Abstract: We investigate the problems and challenges of evaluating the robustness of Differential Equation-based (DE) networks against synthetic distribution shifts. We propose a novel and simple accuracy metric which can be used to evaluate intrinsic robustness and to validate dataset corruption simulators. We also propose methodology recommendations, destined for evaluating the many faces of neural DEs' r… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at ICLM 2022 Workshop "PODS"

    ACM Class: I.2.6

  45. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  46. Realization Theory Of Recurrent Neural ODEs Using Polynomial System Embeddings

    Authors: Martin Gonzalez, Thibault Defourneau, Hatem Hajri, Mihaly Petreczky

    Abstract: In this paper we show that neural ODE analogs of recurrent (ODE-RNN) and Long Short-Term Memory (ODE-LSTM) networks can be algorithmically embeddeded into the class of polynomial systems. This embedding preserves input-output behavior and can suitably be extended to other neural DE architectures. We then use realization theory of polynomial systems to provide necessary conditions for an input-outp… ▽ More

    Submitted 1 August, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 10 pages. Corrected typos and added references

    Journal ref: Systems & Control Letters 173 (2023)

  47. arXiv:2205.07014  [pdf, other

    cs.CV cs.GR cs.LG

    SaiNet: Stereo aware inpainting behind objects with generative networks

    Authors: Violeta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield

    Abstract: In this work, we present an end-to-end network for stereo-consistent image inpainting with the objective of inpainting large missing regions behind objects. The proposed model consists of an edge-guided UNet-like network using Partial Convolutions. We enforce multi-view stereo consistency by introducing a disparity loss. More importantly, we develop a training scheme where the model is learned fro… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Presented at AI4CC workshop at CVPR

  48. TwistSLAM: Constrained SLAM in Dynamic Environment

    Authors: Mathieu Gonzalez, Eric Marchand, Amine Kacete, Jérôme Royan

    Abstract: Classical visual simultaneous localization and mapping (SLAM) algorithms usually assume the environment to be rigid. This assumption limits the applicability of those algorithms as they are unable to accurately estimate the camera poses and world structure in real life scenes containing moving objects (e.g. cars, bikes, pedestrians, etc.). To tackle this issue, we propose TwistSLAM: a semantic, dy… ▽ More

    Submitted 27 September, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: This work has been accepted at IEEE Robotics and Automation Letters

  49. arXiv:2112.11853  [pdf, other

    cs.CV

    Geodesic squared exponential kernel for non-rigid shape registration

    Authors: Florent Jousse, Xavier Pennec, Hervé Delingette, Matilde Gonzalez

    Abstract: This work addresses the problem of non-rigid registration of 3D scans, which is at the core of shape modeling techniques. Firstly, we propose a new kernel based on geodesic distances for the Gaussian Process Morphable Models (GPMMs) framework. The use of geodesic distances into the kernel makes it more adapted to the topological and geometric characteristics of the surface and leads to more realis… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021) PROCEEDINGS, Dec 2021, JODHPUR, India

  50. arXiv:2110.14122  [pdf, other

    stat.ML cs.IT cs.LG

    Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation

    Authors: Mauricio E. Gonzalez, Jorge F. Silva, Miguel Videla, Marcos E. Orchard

    Abstract: This work addresses testing the independence of two continuous and finite-dimensional random variables from the design of a data-driven partition. The empirical log-likelihood statistic is adopted to approximate the sufficient statistics of an oracle test against independence (that knows the two hypotheses). It is shown that approximating the sufficient statistics of the oracle test offers a learn… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.