Skip to main content

Showing 1–50 of 131 results for author: Gómez, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.16501  [pdf, ps, other

    cs.PF

    Performance of Confidential Computing GPUs

    Authors: Antonio Martínez Ibarra, Julian James Stephen, Aurora González Vidal, K. R. Jayaram, Antonio Fernando Skarmeta Gómez

    Abstract: This work examines latency, throughput, and other metrics when performing inference on confidential GPUs. We explore different traffic patterns and scheduling strategies using a single Virtual Machine with one NVIDIA H100 GPU, to perform relaxed batch inferences on multiple Large Language Models (LLMs), operating under the constraint of swapping models in and out of memory, which necessitates effi… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 6 pages, 7 tables. Accepted in conference IEEE ICDCS 2025

  2. arXiv:2505.08751  [pdf, other

    cs.CL cs.CV cs.LG

    Aya Vision: Advancing the Frontier of Multilingual Multimodality

    Authors: Saurabh Dash, Yiyang Nan, John Dang, Arash Ahmadian, Shivalika Singh, Madeline Smith, Bharat Venkitesh, Vlad Shmyhlo, Viraat Aryabumi, Walter Beller-Morales, Jeremy Pekmez, Jason Ozuzu, Pierre Richemond, Acyr Locatelli, Nick Frosst, Phil Blunsom, Aidan Gomez, Ivan Zhang, Marzieh Fadaee, Manoj Govindassamy, Sudip Roy, Matthias Gallé, Beyza Ermis, Ahmet Üstün, Sara Hooker

    Abstract: Building multimodal language models is fundamentally challenging: it requires aligning vision and language modalities, curating high-quality instruction data, and avoiding the degradation of existing text-only capabilities once vision is introduced. These difficulties are further magnified in the multilingual setting, where the need for multimodal data in different languages exacerbates existing d… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  3. arXiv:2505.06927  [pdf, ps, other

    math.OC cs.LG stat.ML

    Stability Regularized Cross-Validation

    Authors: Ryan Cory-Wright, Andrés Gómez

    Abstract: We revisit the problem of ensuring strong test-set performance via cross-validation. Motivated by the generalization theory literature, we propose a nested k-fold cross-validation scheme that selects hyperparameters by minimizing a weighted sum of the usual cross-validation metric and an empirical model-stability measure. The weight on the stability term is itself chosen via a nested cross-validat… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: Some of this material previously appeared in 2306.14851v2, which we have split into two papers (this one and 2306.14851v3), because it contained two ideas that need separate papers

  4. arXiv:2505.05857  [pdf, ps, other

    cs.LG math.OC stat.ML

    Mixed-Integer Optimization for Responsible Machine Learning

    Authors: Nathan Justin, Qingshi Sun, Andrés Gómez, Phebe Vayanos

    Abstract: In the last few decades, Machine Learning (ML) has achieved significant success across domains ranging from healthcare, sustainability, and the social sciences, to criminal justice and finance. But its deployment in increasingly sophisticated, critical, and sensitive areas affecting individuals, the groups they belong to, and society as a whole raises critical concerns around fairness, transparenc… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 56 pages, 10 figures

  5. arXiv:2504.07910  [pdf, other

    cs.LG

    Hodge Laplacians and Hodge Diffusion Maps

    Authors: Alvaro Almeida Gomez, Jorge Duque Franco

    Abstract: We introduce Hodge Diffusion Maps, a novel manifold learning algorithm designed to analyze and extract topological information from high-dimensional data-sets. This method approximates the exterior derivative acting on differential forms, thereby providing an approximation of the Hodge Laplacian operator. Hodge Diffusion Maps extend existing non-linear dimensionality reduction techniques, includin… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 53 Pages, comments are welcome!

    MSC Class: 68P05; 68T10; 68T45; 68W25

  6. arXiv:2504.00698  [pdf

    cs.CL cs.AI cs.LG

    Command A: An Enterprise-Ready Large Language Model

    Authors: Team Cohere, :, Aakanksha, Arash Ahmadian, Marwan Ahmed, Jay Alammar, Milad Alizadeh, Yazeed Alnumay, Sophia Althammer, Arkady Arkhangorodsky, Viraat Aryabumi, Dennis Aumiller, Raphaël Avalos, Zahara Aviv, Sammie Bae, Saurabh Baji, Alexandre Barbet, Max Bartolo, Björn Bebensee, Neeral Beladia, Walter Beller-Morales, Alexandre Bérard, Andrew Berneshawi, Anna Bialas, Phil Blunsom , et al. (205 additional authors not shown)

    Abstract: In this report we describe the development of Command A, a powerful large language model purpose-built to excel at real-world enterprise use cases. Command A is an agent-optimised and multilingual-capable model, with support for 23 languages of global business, and a novel hybrid architecture balancing efficiency with top of the range performance. It offers best-in-class Retrieval Augmented Genera… ▽ More

    Submitted 14 April, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

    Comments: 55 pages

  7. arXiv:2503.22357  [pdf, other

    cs.CV

    EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation

    Authors: Hadrien Reynaud, Alberto Gomez, Paul Leeson, Qingjie Meng, Bernhard Kainz

    Abstract: Advances in deep learning have significantly enhanced medical image analysis, yet the availability of large-scale medical datasets remains constrained by patient privacy concerns. We present EchoFlow, a novel framework designed to generate high-quality, privacy-preserving synthetic echocardiogram images and videos. EchoFlow comprises four key components: an adversarial variational autoencoder for… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  8. arXiv:2503.12012  [pdf, other

    cs.LG math.OC stat.ML

    Mixed-feature Logistic Regression Robust to Distribution Shifts

    Authors: Qingshi Sun, Nathan Justin, Andres Gomez, Phebe Vayanos

    Abstract: Logistic regression models are widely used in the social and behavioral sciences and in high-stakes domains, due to their simplicity and interpretability properties. At the same time, such domains are permeated by distribution shifts, where the distribution generating the data changes between training and deployment. In this paper, we study a distributionally robust logistic regression problem tha… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: The 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025

  9. arXiv:2502.12713  [pdf, other

    cs.CV

    Uncertainty Propagation for Echocardiography Clinical Metric Estimation via Contour Sampling

    Authors: Thierry Judge, Olivier Bernard, Woo-Jin Cho Kim, Alberto Gomez, Arian Beqiri, Agisilaos Chartsias, Pierre-Marc Jodoin

    Abstract: Echocardiography plays a fundamental role in the extraction of important clinical parameters (e.g. left ventricular volume and ejection fraction) required to determine the presence and severity of heart-related conditions. When deploying automated techniques for computing these parameters, uncertainty estimation is crucial for assessing their utility. Since clinical parameters are usually derived… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 10 pages, submitted to IEEE TMI

  10. arXiv:2501.06108  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Inferring High-Order Couplings with Neural Networks

    Authors: Aurélien Decelle, Alfonso de Jesús Navas Gómez, Beatriz Seoane

    Abstract: Maximum entropy methods, based on the inverse Ising/Potts problem from statistical mechanics, are essential for modeling interactions between pairs of variables in data-driven problems across disciplines such as bioinformatics, ecology, and neuroscience. Despite their considerable success, these methods typically fail to capture higher-order interactions that are often essential for understanding… ▽ More

    Submitted 10 February, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: 16 Pages and 5 Figures

  11. arXiv:2412.17116  [pdf, other

    cs.LG cs.CY math.OC stat.ML

    Fair and Accurate Regression: Strong Formulations and Algorithms

    Authors: Anna Deza, Andrés Gómez, Alper Atamtürk

    Abstract: This paper introduces mixed-integer optimization methods to solve regression problems that incorporate fairness metrics. We propose an exact formulation for training fair regression models. To tackle this computationally hard problem, we study the polynomially-solvable single-factor and single-observation subproblems as building blocks and derive their closed convex hull descriptions. Strong formu… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  12. arXiv:2412.04261  [pdf, other

    cs.CL

    Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

    Authors: John Dang, Shivalika Singh, Daniel D'souza, Arash Ahmadian, Alejandro Salamanca, Madeline Smith, Aidan Peppin, Sungjin Hong, Manoj Govindassamy, Terrence Zhao, Sandra Kublik, Meor Amer, Viraat Aryabumi, Jon Ander Campos, Yi-Chern Tan, Tom Kocmi, Florian Strub, Nathan Grinsztajn, Yannis Flet-Berliac, Acyr Locatelli, Hangyu Lin, Dwarak Talupuru, Bharat Venkitesh, David Cairuz, Bowen Yang , et al. (20 additional authors not shown)

    Abstract: We introduce the Aya Expanse model family, a new generation of 8B and 32B parameter multilingual language models, aiming to address the critical challenge of developing highly performant multilingual models that match or surpass the capabilities of monolingual models. By leveraging several years of research at Cohere For AI and Cohere, including advancements in data arbitrage, multilingual prefere… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  13. arXiv:2411.11190  [pdf, other

    eess.IV cs.CV

    DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images

    Authors: Zhen Yuan, David Stojanovski, Lei Li, Alberto Gomez, Haran Jogeesvaran, Esther Puyol-Antón, Baba Inusa, Andrew P. King

    Abstract: Splenomegaly, the enlargement of the spleen, is an important clinical indicator for various associated medical conditions, such as sickle cell disease (SCD). Spleen length measured from 2D ultrasound is the most widely used metric for characterising spleen size. However, it is still considered a surrogate measure, and spleen volume remains the gold standard for assessing spleen size. Accurate sple… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.08038

  14. arXiv:2409.19371  [pdf, other

    eess.IV cs.CV

    Efficient Semantic Diffusion Architectures for Model Training on Synthetic Echocardiograms

    Authors: David Stojanovski, Mariana da Silva, Pablo Lamata, Arian Beqiri, Alberto Gomez

    Abstract: We investigate the utility of diffusion generative models to efficiently synthesise datasets that effectively train deep learning models for image analysis. Specifically, we propose novel $Γ$-distribution Latent Denoising Diffusion Models (LDMs) designed to generate semantically guided synthetic cardiac ultrasound images with improved computational efficiency. We also investigate the potential of… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  15. arXiv:2409.17214  [pdf, other

    cs.MA cs.GT

    Grounded Predictions of Teamwork as a One-Shot Game: A Multiagent Multi-Armed Bandits Approach

    Authors: Alejandra López de Aberasturi Gómez, Carles Sierra, Jordi Sabater-Mir

    Abstract: Humans possess innate collaborative capacities. However, effective teamwork often remains challenging. This study delves into the feasibility of collaboration within teams of rational, self-interested agents who engage in teamwork without the obligation to contribute. Drawing from psychological and game theoretical frameworks, we formalise teamwork as a one-shot aggregative game, integrating insig… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  16. arXiv:2409.09645  [pdf, other

    cs.LG cs.AI cs.NE

    COSCO: A Sharpness-Aware Training Framework for Few-shot Multivariate Time Series Classification

    Authors: Jesus Barreda, Ashley Gomez, Ruben Puga, Kaixiong Zhou, Li Zhang

    Abstract: Multivariate time series classification is an important task with widespread domains of applications. Recently, deep neural networks (DNN) have achieved state-of-the-art performance in time series classification. However, they often require large expert-labeled training datasets which can be infeasible in practice. In few-shot settings, i.e. only a limited number of samples per class are available… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

    Comments: 5 pages, 5 figures, CIKM '24 Short Paper Track

  17. arXiv:2408.10361  [pdf, other

    eess.AS cs.SD

    ASASVIcomtech: The Vicomtech-UGR Speech Deepfake Detection and SASV Systems for the ASVspoof5 Challenge

    Authors: Juan M. Martín-Doñas, Eros Roselló, Angel M. Gomez, Aitor Álvarez, Iván López-Espejo, Antonio M. Peinado

    Abstract: This paper presents the work carried out by the ASASVIcomtech team, made up of researchers from Vicomtech and University of Granada, for the ASVspoof5 Challenge. The team has participated in both Track 1 (speech deepfake detection) and Track 2 (spoofing-aware speaker verification). This work started with an analysis of the challenge available data, which was regarded as an essential step to avoid… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: This paper was accepted at ASVspoof Workshop 2024

  18. arXiv:2407.21577  [pdf, other

    cs.CV cs.AI

    Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography

    Authors: Kit M. Bransby, Woo-jin Cho Kim, Jorge Oliveira, Alex Thorley, Arian Beqiri, Alberto Gomez, Agisilaos Chartsias

    Abstract: Building an echocardiography view classifier that maintains performance in real-life cases requires diverse multi-site data, and frequent updates with newly available data to mitigate model drift. Simply fine-tuning on new datasets results in "catastrophic forgetting", and cannot adapt to variations of view labels between sites. Alternatively, collecting all data on a single server and re-training… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Accepted for Oral at MICCAI workshop ASMUS-2024

  19. STT-RAM-based Hierarchical In-Memory Computing

    Authors: Dhruv Gajaria, Kevin Antony Gomez, Tosiron Adegbija

    Abstract: In-memory computing promises to overcome the von Neumann bottleneck in computer systems by performing computations directly within the memory. Previous research has suggested using Spin-Transfer Torque RAM (STT-RAM) for in-memory computing due to its non-volatility, low leakage power, high density, endurance, and commercial viability. This paper explores hierarchical in-memory computing, where dif… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: Published in: IEEE Transactions on Parallel and Distributed Systems ( Volume: 35, Issue: 9, September 2024)

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, vol. 35, no. 9, pp. 1615-1629, Sept. 2024

  20. arXiv:2406.19148  [pdf, other

    cs.CV cs.AI

    BackMix: Mitigating Shortcut Learning in Echocardiography with Minimal Supervision

    Authors: Kit Mills Bransby, Arian Beqiri, Woo-Jin Cho Kim, Jorge Oliveira, Agisilaos Chartsias, Alberto Gomez

    Abstract: Neural networks can learn spurious correlations that lead to the correct prediction in a validation set, but generalise poorly because the predictions are right for the wrong reason. This undesired learning of naive shortcuts (Clever Hans effect) can happen for example in echocardiogram view classification when background cues (e.g. metadata) are biased towards a class and the model learns to focu… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at MICCAI 2024 (Pre-print)

  21. arXiv:2406.17303  [pdf, other

    cs.MA

    Learnings from Implementation of a BDI Agent-based Battery-less Wireless Sensor

    Authors: Ganesh Ramanathan, Andres Gomez, Simon Mayer

    Abstract: Battery-less embedded devices powered by energy harvesting are increasingly being used in wireless sensing applications. However, their limited and often uncertain energy availability challenges designing application programs. To examine if BDI-based agent programming can address this challenge, we used it for a real-life application involving an environmental sensor that works on energy harvested… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  22. arXiv:2406.00808  [pdf, other

    cs.CV

    EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing

    Authors: Hadrien Reynaud, Qingjie Meng, Mischa Dombrowski, Arijit Ghosh, Thomas Day, Alberto Gomez, Paul Leeson, Bernhard Kainz

    Abstract: To make medical datasets accessible without sharing sensitive patient information, we introduce a novel end-to-end approach for generative de-identification of dynamic medical imaging data. Until now, generative methods have faced constraints in terms of fidelity, spatio-temporal coherence, and the length of generation, failing to capture the complete details of dataset distributions. We present a… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at MICCAI 2024

  23. arXiv:2405.15032  [pdf, other

    cs.CL

    Aya 23: Open Weight Releases to Further Multilingual Progress

    Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Jon Ander Campos, Yi Chern Tan, Kelly Marchisio, Max Bartolo, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Aidan Gomez, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

    Abstract: This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (Üstün et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modelin… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  24. arXiv:2404.15320  [pdf, other

    cs.DL cs.AI cs.CL

    Using Large Language Models to Enrich the Documentation of Datasets for Machine Learning

    Authors: Joan Giner-Miguelez, Abel Gómez, Jordi Cabot

    Abstract: Recent regulatory initiatives like the European AI Act and relevant voices in the Machine Learning (ML) community stress the need to describe datasets along several key dimensions for trustworthy AI, such as the provenance processes and social concerns. However, this information is typically presented as unstructured text in accompanying documentation, hampering their automated analysis and proces… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    ACM Class: H.4.4

  25. arXiv:2404.14848  [pdf, other

    cs.RO

    Evaluating Dynamic Environment Difficulty for Obstacle Avoidance Benchmarking

    Authors: Moji Shi, Gang Chen, Álvaro Serra Gómez, Siyuan Wu, Javier Alonso-Mora

    Abstract: Dynamic obstacle avoidance is a popular research topic for autonomous systems, such as micro aerial vehicles and service robots. Accurately evaluating the performance of dynamic obstacle avoidance methods necessitates the establishment of a metric to quantify the environment's difficulty, a crucial aspect that remains unexplored. In this paper, we propose four metrics to measure the difficulty of… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  26. arXiv:2404.02251  [pdf, other

    cs.IT

    Generating gaussian pseudorandom noise with binary sequences

    Authors: Francisco-Javier Soto, Ana I. Gómez, Domingo Gómez-Pérez

    Abstract: Gaussian random number generators attract a widespread interest due to their applications in several fields. Important requirements include easy implementation, tail accuracy, and, finally, a flat spectrum. In this work, we study the applicability of uniform pseudorandom binary generators in combination with the Central Limit Theorem to propose an easy to implement, efficient and flexible algorith… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  27. Review of Distributed Quantum Computing. From single QPU to High Performance Quantum Computing

    Authors: David Barral, F. Javier Cardama, Guillermo Díaz, Daniel Faílde, Iago F. Llovo, Mariamo Mussa Juane, Jorge Vázquez-Pérez, Juan Villasuso, César Piñeiro, Natalia Costas, Juan C. Pichel, Tomás F. Pena, Andrés Gómez

    Abstract: The emerging field of quantum computing has shown it might change how we process information by using the unique principles of quantum mechanics. As researchers continue to push the boundaries of quantum technologies to unprecedented levels, distributed quantum computing raises as an obvious path to explore with the aim of boosting the computational power of current quantum systems. This paper pre… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  28. arXiv:2402.18204  [pdf, other

    cs.SD eess.AS

    ConvDTW-ACS: Audio Segmentation for Track Type Detection During Car Manufacturing

    Authors: Álvaro López-Chilet, Zhaoyi Liu, Jon Ander Gómez, Carlos Alvarez, Marivi Alonso Ortiz, Andres Orejuela Mesa, David Newton, Friedrich Wolf-Monheim, Sam Michiels, Danny Hughes

    Abstract: This paper proposes a method for Acoustic Constrained Segmentation (ACS) in audio recordings of vehicles driven through a production test track, delimiting the boundaries of surface types in the track. ACS is a variant of classical acoustic segmentation where the sequence of labels is known, contiguous and invariable, which is especially useful in this work as the test track has a standard configu… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  29. arXiv:2402.01797  [pdf, other

    cs.LG math.OC stat.CO

    Robust support vector machines via conic optimization

    Authors: Valentina Cepeda, Andrés Gómez, Shaoning Han

    Abstract: We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust estimators, at the expense of la… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  30. arXiv:2401.12251  [pdf, other

    cs.LG eess.IV

    Diffusion Representation for Asymmetric Kernels

    Authors: Alvaro Almeida Gomez, Antonio Silva Neto, Jorge zubelli

    Abstract: We extend the diffusion-map formalism to data sets that are induced by asymmetric kernels. Analytical convergence results of the resulting expansion are proved, and an algorithm is proposed to perform the dimensional reduction. In this work we study data sets in which its geometry structure is induced by an asymmetric kernel. We use a priori coordinate system to represent this geometry and, thus,… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Journal ref: Applied Numerical Mathematics, 2021

  31. arXiv:2401.10304  [pdf, other

    cs.LG cs.AI cs.DL

    On the Readiness of Scientific Data for a Fair and Transparent Use in Machine Learning

    Authors: Joan Giner-Miguelez, Abel Gómez, Jordi Cabot

    Abstract: To ensure the fairness and trustworthiness of machine learning (ML) systems, recent legislative initiatives and relevant research in the ML community have pointed out the need to document the data used to train ML models. Besides, data-sharing practices in many scientific domains have evolved in recent years for reproducibility purposes. In this sense, academic institutions' adoption of these prac… ▽ More

    Submitted 17 December, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  32. Density Matrix Emulation of Quantum Recurrent Neural Networks for Multivariate Time Series Prediction

    Authors: José Daniel Viqueira, Daniel Faílde, Mariamo M. Juane, Andrés Gómez, David Mera

    Abstract: Quantum Recurrent Neural Networks (QRNNs) are robust candidates for modelling and predicting future values in multivariate time series. However, the effective implementation of some QRNN models is limited by the need for mid-circuit measurements. Those increase the requirements for quantum hardware, which in the current NISQ era does not allow reliable computations. Emulation arises as the main ne… ▽ More

    Submitted 30 January, 2025; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 19 pages, 8 figures

    Journal ref: Mach. Learn.: Sci. Technol. 6, 015023 (2025)

  33. arXiv:2310.17772  [pdf, other

    cs.LG math.OC stat.ML

    Learning Optimal Classification Trees Robust to Distribution Shifts

    Authors: Nathan Justin, Sina Aghaei, Andrés Gómez, Phebe Vayanos

    Abstract: We consider the problem of learning classification trees that are robust to distribution shifts between training and testing/deployment data. This problem arises frequently in high stakes settings such as public health and social work where data is often collected using self-reported surveys which are highly sensitive to e.g., the framing of the questions, the time when and place where the survey… ▽ More

    Submitted 12 May, 2025; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 51 pages, 10 figures

  34. arXiv:2309.02292  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech cs.LG

    Inferring effective couplings with Restricted Boltzmann Machines

    Authors: Aurélien Decelle, Cyril Furtlehner, Alfonso De Jesus Navas Gómez, Beatriz Seoane

    Abstract: Generative models offer a direct way of modeling complex data. Energy-based models attempt to encode the statistical correlations observed in the data at the level of the Boltzmann weight associated with an energy function in the form of a neural network. We address here the challenge of understanding the physical interpretation of such models. In this study, we propose a simple solution by implem… ▽ More

    Submitted 24 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 17 figures, 39 pages

    Journal ref: SciPost Phys. 16, 095 (2024)

  35. arXiv:2308.16767  [pdf, other

    cs.RO

    Reinforcement learning for safety-critical control of an automated vehicle

    Authors: Florian Thaler, Franz Rammerstorfer, Jon Ander Gomez, Raul Garcia Crespo, Leticia Pasqual, Markus Postl

    Abstract: We present our approach for the development, validation and deployment of a data-driven decision-making function for the automated control of a vehicle. The decisionmaking function, based on an artificial neural network is trained to steer the mobile robot SPIDER towards a predefined, static path to a target point while avoiding collisions with obstacles along the path. The training is conducted b… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  36. arXiv:2308.03554  [pdf, other

    cs.CR

    TemporalFED: Detecting Cyberattacks in Industrial Time-Series Data Using Decentralized Federated Learning

    Authors: Ángel Luis Perales Gómez, Enrique Tomás Martínez Beltrán, Pedro Miguel Sánchez Sánchez, Alberto Huertas Celdrán

    Abstract: Industry 4.0 has brought numerous advantages, such as increasing productivity through automation. However, it also presents major cybersecurity issues such as cyberattacks affecting industrial processes. Federated Learning (FL) combined with time-series analysis is a promising cyberattack detection mechanism proposed in the literature. However, the fact of having a single point of failure and netw… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  37. arXiv:2307.15691  [pdf, other

    stat.ML cs.LG math.OC

    ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription

    Authors: Patrick Vossler, Sina Aghaei, Nathan Justin, Nathanael Jo, Andrés Gómez, Phebe Vayanos

    Abstract: ODTLearn is an open-source Python package that provides methods for learning optimal decision trees for high-stakes predictive and prescriptive tasks based on the mixed-integer optimization (MIO) framework proposed in Aghaei et al. (2019) and several of its extensions. The current version of the package provides implementations for learning optimal classification trees, optimal fair classification… ▽ More

    Submitted 12 November, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 7 pages, 2 figures

  38. arXiv:2307.13750  [pdf, other

    math.OC cs.LG

    Solution Path of Time-varying Markov Random Fields with Discrete Regularization

    Authors: Salar Fattahi, Andres Gomez

    Abstract: We study the problem of inferring sparse time-varying Markov random fields (MRFs) with different discrete and temporal regularizations on the parameters. Due to the intractability of discrete regularization, most approaches for solving this problem rely on the so-called maximum-likelihood estimation (MLE) with relaxed regularization, which neither results in ideal statistical properties nor scale… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  39. arXiv:2307.05975  [pdf, other

    math.OC cs.LG stat.ME stat.ML

    Outlier detection in regression: conic quadratic formulations

    Authors: Andrés Gómez, José Neto

    Abstract: In many applications, when building linear regression models, it is important to account for the presence of outliers, i.e., corrupted input data points. Such problems can be formulated as mixed-integer optimization problems involving cubic terms, each given by the product of a binary variable and a quadratic term of the continuous variables. Existing approaches in the literature, typically relyin… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  40. arXiv:2307.02997  [pdf, other

    eess.IV cs.CV

    Fourier-Net+: Leveraging Band-Limited Representation for Efficient 3D Medical Image Registration

    Authors: Xi Jia, Alexander Thorley, Alberto Gomez, Wenqi Lu, Dipak Kotecha, Jinming Duan

    Abstract: U-Net style networks are commonly utilized in unsupervised image registration to predict dense displacement fields, which for high-resolution volumetric image data is a resource-intensive and time-consuming task. To tackle this challenge, we first propose Fourier-Net, which replaces the costly U-Net style expansive path with a parameter-free model-driven decoder. Instead of directly predicting a f… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Under review. arXiv admin note: text overlap with arXiv:2211.16342

  41. arXiv:2306.15414  [pdf, other

    cs.DL

    FAIR EVA: Bringing institutional multidisciplinary repositories into the FAIR picture

    Authors: Fernando Aguilar Gómez, Isabel Bernal

    Abstract: The FAIR Principles are a set of good practices to improve the reproducibility and quality of data in an Open Science context. Different sets of indicators have been proposed to evaluate the FAIRness of digital objects, including datasets that are usually stored in repositories or data portals. However, indicators like those proposed by the Research Data Alliance are provided from a high-level per… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  42. arXiv:2306.14851  [pdf, other

    math.OC cs.LG stat.ME

    Optimal Cross-Validation for Sparse Linear Regression

    Authors: Ryan Cory-Wright, Andrés Gómez

    Abstract: Given a high-dimensional covariate matrix and a response vector, ridge-regularized sparse linear regression selects a subset of features that explains the relationship between covariates and the response in an interpretable manner. To select the sparsity and robustness of linear regressors, techniques like k-fold cross-validation are commonly used for hyperparameter tuning. However, cross-validati… ▽ More

    Submitted 11 May, 2025; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Moved stability-adjustment content to a different paper, as it was a separate idea to the main point of the paper

  43. arXiv:2306.09750  [pdf, other

    cs.LG cs.AI cs.DC cs.NI

    Fedstellar: A Platform for Decentralized Federated Learning

    Authors: Enrique Tomás Martínez Beltrán, Ángel Luis Perales Gómez, Chao Feng, Pedro Miguel Sánchez Sánchez, Sergio López Bernal, Gérôme Bovet, Manuel Gil Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán

    Abstract: In 2016, Google proposed Federated Learning (FL) as a novel paradigm to train Machine Learning (ML) models across the participants of a federation while preserving data privacy. Since its birth, Centralized FL (CFL) has been the most used approach, where a central entity aggregates participants' models to create a global one. However, CFL presents limitations such as communication bottlenecks, sin… ▽ More

    Submitted 8 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  44. arXiv:2306.04739  [pdf, other

    cs.LG

    Automatic retrieval of corresponding US views in longitudinal examinations

    Authors: Hamideh Kerdegari, Tran Huy Nhat Phung1, Van Hao Nguyen, Thi Phuong Thao Truong, Ngoc Minh Thu Le, Thanh Phuong Le, Thi Mai Thao Le, Luigi Pisani, Linda Denehy, Vital Consortium, Reza Razavi, Louise Thwaites, Sophie Yacoub, Andrew P. King, Alberto Gomez

    Abstract: Skeletal muscle atrophy is a common occurrence in critically ill patients in the intensive care unit (ICU) who spend long periods in bed. Muscle mass must be recovered through physiotherapy before patient discharge and ultrasound imaging is frequently used to assess the recovery process by measuring the muscle size over time. However, these manual measurements are subject to large variability, par… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 10 pages, 6 figures

  45. arXiv:2305.05424  [pdf, other

    eess.IV cs.CV cs.LG

    Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

    Authors: David Stojanovski, Uxio Hermida, Pablo Lamata, Arian Beqiri, Alberto Gomez

    Abstract: We propose a novel pipeline for the generation of synthetic ultrasound images via Denoising Diffusion Probabilistic Models (DDPMs) guided by cardiac semantic label maps. We show that these synthetic images can serve as a viable substitute for real data in the training of deep-learning models for ultrasound image analysis tasks such as cardiac segmentation. To demonstrate the effectiveness of this… ▽ More

    Submitted 15 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  46. arXiv:2304.03571  [pdf, other

    physics.flu-dyn cs.LG

    $β$-Variational autoencoders and transformers for reduced-order modelling of fluid flows

    Authors: Alberto Solera-Rico, Carlos Sanmiguel Vila, M. A. Gómez, Yuning Wang, Abdulrahman Almashjary, Scott T. M. Dawson, Ricardo Vinuesa

    Abstract: Variational autoencoder (VAE) architectures have the potential to develop reduced-order models (ROMs) for chaotic fluid flows. We propose a method for learning compact and near-orthogonal ROMs using a combination of a $β$-VAE and a transformer, tested on numerical data from a two-dimensional viscous flow in both periodic and chaotic regimes. The $β$-VAE is trained to learn a compact latent represe… ▽ More

    Submitted 15 November, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

  47. Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis

    Authors: Hadrien Reynaud, Mengyun Qiao, Mischa Dombrowski, Thomas Day, Reza Razavi, Alberto Gomez, Paul Leeson, Bernhard Kainz

    Abstract: Image synthesis is expected to provide value for the translation of machine learning methods into clinical practice. Fundamental problems like model robustness, domain transfer, causal modelling, and operator training become approachable through synthetic data. Especially, heavily operator-dependant modalities like Ultrasound imaging require robust frameworks for image and video generation. So far… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Published in MICCAI 2023 proceedings. https://link.springer.com/chapter/10.1007/978-3-031-43999-5_14

  48. arXiv:2212.14510  [pdf, other

    physics.med-ph cs.LG eess.IV

    A Machine Learning Case Study for AI-empowered echocardiography of Intensive Care Unit Patients in low- and middle-income countries

    Authors: Miguel Xochicale, Louise Thwaites, Sophie Yacoub, Luigi Pisani, Phung-Nhat Tran-Huy, Hamideh Kerdegari, Andrew King, Alberto Gomez

    Abstract: We present a Machine Learning (ML) study case to illustrate the challenges of clinical translation for a real-time AI-empowered echocardiography system with data of ICU patients in LMICs. Such ML case study includes data preparation, curation and labelling from 2D Ultrasound videos of 31 ICU patients in LMICs and model selection, validation and deployment of three thinner neural networks to classi… ▽ More

    Submitted 5 March, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

  49. arXiv:2209.13569  [pdf, other

    cs.LG stat.ML

    Exploring Low Rank Training of Deep Neural Networks

    Authors: Siddhartha Rao Kamalakara, Acyr Locatelli, Bharat Venkitesh, Jimmy Ba, Yarin Gal, Aidan N. Gomez

    Abstract: Training deep neural networks in low rank, i.e. with factorised layers, is of particular interest to the community: it offers efficiency over unfactorised training in terms of both memory consumption and training time. Prior work has focused on low rank approximations of pre-trained networks and training in low rank space with additional objectives, offering various ad hoc explanations for chosen… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  50. arXiv:2207.13424  [pdf, other

    eess.IV cs.AI cs.CV

    Efficient Pix2Vox++ for 3D Cardiac Reconstruction from 2D echo views

    Authors: David Stojanovski, Uxio Hermida, Marica Muffoletto, Pablo Lamata, Arian Beqiri, Alberto Gomez

    Abstract: Accurate geometric quantification of the human heart is a key step in the diagnosis of numerous cardiac diseases, and in the management of cardiac patients. Ultrasound imaging is the primary modality for cardiac imaging, however acquisition requires high operator skill, and its interpretation and analysis is difficult due to artifacts. Reconstructing cardiac anatomy in 3D can enable discovery of n… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 11 pages, 4 figures, July 27 2022 submitted to 3rd International Workshop, Advances in Simplifying Medical Ultrasound (ASMUS2022), https://miccai-ultrasound.github.io/#/asmus22