Skip to main content

Showing 1–50 of 281 results for author: Ribeiro, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01045  [pdf, ps, other

    cs.LG cs.AI eess.SP

    Sensing Cardiac Health Across Scenarios and Devices: A Multi-Modal Foundation Model Pretrained on Heterogeneous Data from 1.7 Million Individuals

    Authors: Xiao Gu, Wei Tang, Jinpei Han, Veer Sangha, Fenglin Liu, Shreyank N Gowda, Antonio H. Ribeiro, Patrick Schwab, Kim Branson, Lei Clifton, Antonio Luiz P. Ribeiro, Zhangdaihong Liu, David A. Clifton

    Abstract: Cardiac biosignals, such as electrocardiograms (ECG) and photoplethysmograms (PPG), are of paramount importance for the diagnosis, prevention, and management of cardiovascular diseases, and have been extensively used in a variety of clinical tasks. Conventional deep learning approaches for analyzing these signals typically rely on homogeneous datasets and static bespoke models, limiting their robu… ▽ More

    Submitted 23 June, 2025; originally announced July 2025.

  2. arXiv:2507.00647  [pdf, ps, other

    cs.LG

    Cooperative Sheaf Neural Networks

    Authors: André Ribeiro, Ana Luiza Tenório, Juan Belieni, Amauri H. Souza, Diego Mesquita

    Abstract: Sheaf diffusion has recently emerged as a promising design pattern for graph representation learning due to its inherent ability to handle heterophilic data and avoid oversmoothing. Meanwhile, cooperative message passing has also been proposed as a way to enhance the flexibility of information diffusion by allowing nodes to independently choose whether to propagate/gather information from/to neigh… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  3. arXiv:2506.18748  [pdf, ps, other

    eess.SP cs.LG

    Fast State-Augmented Learning for Wireless Resource Allocation with Dual Variable Regression

    Authors: Yigit Berkay Uslu, Navid NaderiAlizadeh, Mark Eisen, Alejandro Ribeiro

    Abstract: We consider resource allocation problems in multi-user wireless networks, where the goal is to optimize a network-wide utility function subject to constraints on the ergodic average performance of users. We demonstrate how a state-augmented graph neural network (GNN) parametrization for the resource allocation policy circumvents the drawbacks of the ubiquitous dual subgradient methods by represent… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: This work has been submitted to the IEEE TSP for possible publication

  4. arXiv:2506.17486  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Distilling On-device Language Models for Robot Planning with Minimal Human Intervention

    Authors: Zachary Ravichandran, Ignacio Hounie, Fernando Cladera, Alejandro Ribeiro, George J. Pappas, Vijay Kumar

    Abstract: Large language models (LLMs) provide robots with powerful contextual reasoning abilities and a natural human interface. Yet, current LLM-enabled robots typically depend on cloud-hosted models, limiting their usability in environments with unreliable communication infrastructure, such as outdoor or industrial settings. We present PRISM, a framework for distilling small language model (SLM)-enabled… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  5. arXiv:2506.06557  [pdf, ps, other

    cs.IR cs.LG eess.SP math.MG

    Infinity Search: Approximate Vector Search with Projections on q-Metric Spaces

    Authors: Antonio Pariente, Ignacio Hounie, Santiago Segarra, Alejandro Ribeiro

    Abstract: Despite the ubiquity of vector search applications, prevailing search algorithms overlook the metric structure of vector embeddings, treating it as a constraint rather than exploiting its underlying properties. In this paper, we demonstrate that in $q$-metric spaces, metric trees can leverage a stronger version of the triangle inequality to reduce comparisons for exact search. Notably, as $q$ appr… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  6. arXiv:2505.19387  [pdf, ps, other

    cs.LG eess.SY math.OC

    Alignment of large language models with constrained learning

    Authors: Botong Zhang, Shuo Li, Ignacio Hounie, Osbert Bastani, Dongsheng Ding, Alejandro Ribeiro

    Abstract: We study the problem of computing an optimal large language model (LLM) policy for a constrained alignment problem, where the goal is to maximize a primary reward objective while satisfying constraints on secondary utilities. Despite the popularity of Lagrangian-based LLM policy search in constrained alignment, iterative primal-dual methods often fail to converge, and non-iterative dual-based meth… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 48 pages, 7 figures, 7 tables

  7. arXiv:2505.15062  [pdf, other

    cs.CL cs.AI

    Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning

    Authors: Jiashu He, Jinxuan Fan, Bowen Jiang, Ignacio Houine, Dan Roth, Alejandro Ribeiro

    Abstract: When addressing complex questions that require new information, people often associate the question with existing knowledge to derive a sensible answer. For instance, when evaluating whether melatonin aids insomnia, one might associate "hormones helping mental disorders" with "melatonin being a hormone and insomnia a mental disorder" to complete the reasoning. Large Language Models (LLMs) also req… ▽ More

    Submitted 23 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  8. arXiv:2505.09011  [pdf

    cs.LG

    Signal-based AI-driven software solution for automated quantification of metastatic bone disease and treatment response assessment using Whole-Body Diffusion-Weighted MRI (WB-DWI) biomarkers in Advanced Prostate Cancer

    Authors: Antonio Candito, Matthew D Blackledge, Richard Holbrey, Nuria Porta, Ana Ribeiro, Fabio Zugni, Luca D'Erme, Francesca Castagnoli, Alina Dragan, Ricardo Donners, Christina Messiou, Nina Tunariu, Dow-Mu Koh

    Abstract: We developed an AI-driven software solution to quantify metastatic bone disease from WB-DWI scans. Core technologies include: (i) a weakly-supervised Residual U-Net model generating a skeleton probability map to isolate bone; (ii) a statistical framework for WB-DWI intensity normalisation, obtaining a signal-normalised b=900s/mm^2 (b900) image; and (iii) a shallow convolutional neural network that… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  9. arXiv:2505.06542  [pdf, ps, other

    cs.LG cs.AI stat.ML

    dcFCI: Robust Causal Discovery Under Latent Confounding, Unfaithfulness, and Mixed Data

    Authors: Adèle H. Ribeiro, Dominik Heider

    Abstract: Causal discovery is central to inferring causal relationships from observational data. In the presence of latent confounding, algorithms such as Fast Causal Inference (FCI) learn a Partial Ancestral Graph (PAG) representing the true model's Markov Equivalence Class. However, their correctness critically depends on empirical faithfulness, the assumption that observed (in)dependencies perfectly refl… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 31 pages. This work has been submitted to the IEEE for possible publication

  10. arXiv:2504.20277  [pdf, other

    cs.LG eess.SP

    Generative Diffusion Models for Resource Allocation in Wireless Networks

    Authors: Yigit Berkay Uslu, Samar Hadou, Shirin Saeedi Bidokhti, Alejandro Ribeiro

    Abstract: This paper proposes a supervised training algorithm for learning stochastic resource allocation policies with generative diffusion models (GDMs). We formulate the allocation problem as the maximization of an ergodic utility function subject to ergodic Quality of Service (QoS) constraints. Given samples from a stochastic expert policy that yields a near-optimal solution to the problem, we train a G… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  11. arXiv:2504.16097  [pdf, other

    eess.SP cs.AI cs.LG

    A CNN-based Local-Global Self-Attention via Averaged Window Embeddings for Hierarchical ECG Analysis

    Authors: Arthur Buzelin, Pedro Robles Dutenhefner, Turi Rezende, Luisa G. Porfirio, Pedro Bento, Yan Aquino, Jose Fernandes, Caio Santana, Gabriela Miana, Gisele L. Pappa, Antonio Ribeiro, Wagner Meira Jr

    Abstract: Cardiovascular diseases remain the leading cause of global mortality, emphasizing the critical need for efficient diagnostic tools such as electrocardiograms (ECGs). Recent advancements in deep learning, particularly transformers, have revolutionized ECG analysis by capturing detailed waveform features as well as global rhythm patterns. However, traditional transformers struggle to effectively cap… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  12. arXiv:2503.20722  [pdf

    cs.CV cs.LG

    A weakly-supervised deep learning model for fast localisation and delineation of the skeleton, internal organs, and spinal canal on Whole-Body Diffusion-Weighted MRI (WB-DWI)

    Authors: A. Candito, A. Dragan, R. Holbrey, A. Ribeiro, R. Donners, C. Messiou, N. Tunariu, D. -M. Koh, M. D. Blackledge, The Institute of Cancer Research, London, United Kingdom, The Royal Marsden NHS Foundation Trust, London, United Kingdom, University Hospital Basel, Basel, Switzerland

    Abstract: Background: Apparent Diffusion Coefficient (ADC) values and Total Diffusion Volume (TDV) from Whole-body diffusion-weighted MRI (WB-DWI) are recognized cancer imaging biomarkers. However, manual disease delineation for ADC and TDV measurements is unfeasible in clinical practice, demanding automation. As a first step, we propose an algorithm to generate fast and reproducible probability maps of the… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  13. arXiv:2502.20135  [pdf, other

    cs.CL

    Educator Attention: How computational tools can systematically identify the distribution of a key resource for students

    Authors: Qingyang Zhang, Rose E. Wang, Ana T. Ribeiro, Dora Demszky, Susanna Loeb

    Abstract: Educator attention is critical for student success, yet how educators distribute their attention across students remains poorly understood due to data and methodological constraints. This study presents the first large-scale computational analysis of educator attention patterns, leveraging over 1 million educator utterances from virtual group tutoring sessions linked to detailed student demographi… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: The first two authors QZ and REW contributed equally. The last two authors DD and SL advised equally

  14. arXiv:2502.03081  [pdf, ps, other

    cs.CV cs.LG

    Human-Aligned Image Models Improve Visual Decoding from the Brain

    Authors: Nona Rajabi, Antônio H. Ribeiro, Miguel Vasco, Farzaneh Taleb, Mårten Björkman, Danica Kragic

    Abstract: Decoding visual images from brain activity has significant potential for advancing brain-computer interaction and enhancing the understanding of human perception. Recent approaches align the representation spaces of images and brain activity to enable visual decoding. In this paper, we introduce the use of human-aligned image encoders to map brain signals to images. We hypothesize that these model… ▽ More

    Submitted 10 June, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted to ICML 2025

  15. arXiv:2502.02594  [pdf, other

    cs.CE eess.SY

    Offshore Wind Turbine Tower Design and Optimization: A Review and AI-Driven Future Directions

    Authors: João Alves Ribeiro, Bruno Alves Ribeiro, Francisco Pimenta, Sérgio M. O. Tavares, Jie Zhang, Faez Ahmed

    Abstract: Offshore wind energy leverages the high intensity and consistency of oceanic winds, playing a key role in the transition to renewable energy. As energy demands grow, larger turbines are required to optimize power generation and reduce the Levelized Cost of Energy (LCoE), which represents the average cost of electricity over a project's lifetime. However, upscaling turbines introduces engineering c… ▽ More

    Submitted 28 December, 2024; originally announced February 2025.

  16. arXiv:2502.01122  [pdf, other

    cs.LG

    Learning Efficient Positional Encodings with Graph Neural Networks

    Authors: Charilaos I. Kanatsoulis, Evelyn Choi, Stephanie Jegelka, Jure Leskovec, Alejandro Ribeiro

    Abstract: Positional encodings (PEs) are essential for effective graph representation learning because they provide position awareness in inherently position-agnostic transformer architectures and increase the expressive capacity of Graph Neural Networks (GNNs). However, designing powerful and efficient PEs for graphs poses significant challenges due to the absence of canonical node ordering and the scale o… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  17. arXiv:2501.14912  [pdf, other

    cs.LG cs.AI

    Feasible Learning

    Authors: Juan Ramirez, Ignacio Hounie, Juan Elenter, Jose Gallego-Posada, Meraj Hashemizadeh, Alejandro Ribeiro, Simon Lacoste-Julien

    Abstract: We introduce Feasible Learning (FL), a sample-centric learning paradigm where models are trained by solving a feasibility problem that bounds the loss for each training sample. In contrast to the ubiquitous Empirical Risk Minimization (ERM) framework, which optimizes for average performance, FL demands satisfactory performance on every individual data point. Since any model that meets the prescrib… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: Published at AISTATS 2025. Code available at https://github.com/juan43ramirez/feasible-learning

  18. arXiv:2501.01510  [pdf, other

    cs.LG eess.SP q-bio.QM

    Explainable Brain Age Gap Prediction in Neurodegenerative Conditions using coVariance Neural Networks

    Authors: Saurabh Sihag, Gonzalo Mateos, Alejandro Ribeiro

    Abstract: Brain age is the estimate of biological age derived from neuroimaging datasets using machine learning algorithms. Increasing \textit{brain age gap} characterized by an elevated brain age relative to the chronological age can reflect increased vulnerability to neurodegeneration and cognitive decline. Hence, brain age gap is a promising biomarker for monitoring brain health. However, black-box machi… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: Accepted at ISBI, 2025

  19. arXiv:2411.08730  [pdf

    cs.MM cs.HC

    3D Modelling to Address Pandemic Challenges: A Project-Based Learning Methodology

    Authors: Tânia Rocha, Ana Ribeiro, Joana Oliveira, Ricardo Nunes, Diana Carvalho, Hugo Paredes, Paulo Martins

    Abstract: The use of 3D modelling in medical education is a revolutionary tool during the learning process. In fact, this type of technology enables a more interactive teaching approach, making information retention more effective and enhancing students' understanding. 3D modelling allows for the creation of precise representations of the human body, as well as interaction with three-dimensional models, giv… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

  20. arXiv:2411.03038  [pdf, other

    cs.LG

    Can Transformers Smell Like Humans?

    Authors: Farzaneh Taleb, Miguel Vasco, Antônio H. Ribeiro, Mårten Björkman, Danica Kragic

    Abstract: The human brain encodes stimuli from the environment into representations that form a sensory perception of the world. Despite recent advances in understanding visual and auditory perception, olfactory perception remains an under-explored topic in the machine learning community due to the lack of large-scale datasets annotated with labels of human olfactory perception. In this work, we ask the que… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: Spotlight paper at NeurIPS 2024

  21. arXiv:2411.01341  [pdf, ps, other

    cs.LG eess.SP

    Convolutional Filtering with RKHS Algebras

    Authors: Alejandro Parada-Mayorga, Leopoldo Agorio, Alejandro Ribeiro, Juan Bazerque

    Abstract: In this paper, we develop a generalized theory of convolutional signal processing and neural networks for Reproducing Kernel Hilbert Spaces (RKHS). Leveraging the theory of algebraic signal processing (ASP), we show that any RKHS allows the formal definition of multiple algebraic convolutional models. We show that any RKHS induces algebras whose elements determine convolutional operators acting on… ▽ More

    Submitted 1 June, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  22. arXiv:2410.12677  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Efficient Optimization Algorithms for Linear Adversarial Training

    Authors: Antônio H. RIbeiro, Thomas B. Schön, Dave Zahariah, Francis Bach

    Abstract: Adversarial training can be used to learn models that are robust against perturbations. For linear models, it can be formulated as a convex optimization problem. Compared to methods proposed in the context of deep learning, leveraging the optimization structure allows significantly faster convergence rates. Still, the use of generic convex solvers can be inefficient for large-scale problems. Here,… ▽ More

    Submitted 19 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Paper accepted at AISTATS 2025

  23. arXiv:2410.08475  [pdf, ps, other

    cs.AI cs.CL

    GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation

    Authors: Jiashu He, Mingyu Derek Ma, Jinxuan Fan, Dan Roth, Wei Wang, Alejandro Ribeiro

    Abstract: Existing approaches based on context prompting or reinforcement learning (RL) to improve the reasoning capacities of large language models (LLMs) depend on the LLMs' internal knowledge to produce reliable Chain-Of-Thought (CoT). However, no matter the size of LLMs, certain problems cannot be resolved in a single forward pass. Meanwhile, agent-based reasoning systems require access to a comprehensi… ▽ More

    Submitted 29 May, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

  24. arXiv:2410.04060  [pdf, other

    cs.CL cs.AI

    LoRTA: Low Rank Tensor Adaptation of Large Language Models

    Authors: Ignacio Hounie, Charilaos Kanatsoulis, Arnuv Tandon, Alejandro Ribeiro

    Abstract: Low Rank Adaptation (LoRA) is a popular Parameter Efficient Fine Tuning (PEFT) method that effectively adapts large pre-trained models for downstream tasks. LoRA parameterizes model updates using low-rank matrices at each layer, significantly reducing the number of trainable parameters and, consequently, resource requirements during fine-tuning. However, the lower bound on the number of trainable… ▽ More

    Submitted 2 February, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

  25. arXiv:2410.03058  [pdf, other

    cs.CV

    DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images

    Authors: Chen Liu, Danqi Liao, Alejandro Parada-Mayorga, Alejandro Ribeiro, Marcello DiStasio, Smita Krishnaswamy

    Abstract: The proliferation of digital microscopy images, driven by advances in automated whole slide scanning, presents significant opportunities for biomedical research and clinical diagnostics. However, accurately annotating densely packed information in these images remains a major challenge. To address this, we introduce DiffKillR, a novel framework that reframes cell annotation as the combination of a… ▽ More

    Submitted 24 April, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: ICASSP 2025, Oral Presentation

  26. arXiv:2410.03017  [pdf, other

    cs.CL

    Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

    Authors: Rose E. Wang, Ana T. Ribeiro, Carly D. Robinson, Susanna Loeb, Dora Demszky

    Abstract: Generative AI, particularly Language Models (LMs), has the potential to transform real-world domains with societal impact, particularly where access to experts is limited. For example, in education, training novice educators with expert guidance is important for effectiveness but expensive, creating significant barriers to improving education quality at scale. This challenge disproportionately har… ▽ More

    Submitted 25 January, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: Our pre-registration for this randomized controlled trial can be found here: https://osf.io/8d6ha. Demonstration code and video tutorials on how to develop your own Tutor CoPilot at this link: https://github.com/rosewang2008/tutor-copilot/. * The last two authors provided equal advising

  27. arXiv:2410.00645  [pdf, other

    cs.LG

    LoRanPAC: Low-rank Random Features and Pre-trained Models for Bridging Theory and Practice in Continual Learning

    Authors: Liangzu Peng, Juan Elenter, Joshua Agterberg, Alejandro Ribeiro, René Vidal

    Abstract: The goal of continual learning (CL) is to train a model that can solve multiple tasks presented sequentially. Recent CL approaches have achieved strong performance by leveraging large pre-trained models that generalize well to downstream tasks. However, such methods lack theoretical guarantees, making them prone to unexpected failures. Conversely, principled CL approaches often fail to achieve com… ▽ More

    Submitted 18 May, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: 47 pages, 18 figures, 16 tables (v3, accepted to ICLR 2025)

  28. arXiv:2409.20536  [pdf, other

    cs.LG cs.CY

    Best Practices for Responsible Machine Learning in Credit Scoring

    Authors: Giovani Valdrighi, Athyrson M. Ribeiro, Jansen S. B. Pereira, Vitoria Guardieiro, Arthur Hendricks, Décio Miranda Filho, Juan David Nieto Garcia, Felipe F. Bocca, Thalita B. Veronese, Lucas Wanner, Marcos Medeiros Raimundo

    Abstract: The widespread use of machine learning in credit scoring has brought significant advancements in risk assessment and decision-making. However, it has also raised concerns about potential biases, discrimination, and lack of transparency in these automated systems. This tutorial paper performed a non-systematic literature review to guide best practices for developing responsible machine learning mod… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  29. arXiv:2409.19829  [pdf, other

    cs.RO cs.AI eess.SY

    Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning

    Authors: Shreyas Muthusamy, Damian Owerko, Charilaos I. Kanatsoulis, Saurav Agarwal, Alejandro Ribeiro

    Abstract: Unlabeled motion planning involves assigning a set of robots to target locations while ensuring collision avoidance, aiming to minimize the total distance traveled. The problem forms an essential building block for multi-robot systems in applications such as exploration, surveillance, and transportation. We address this problem in a decentralized setting where each robot knows only the positions o… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: 6 pages, 6 figures, submitted to ICRA 2025

  30. arXiv:2409.05191  [pdf, ps, other

    eess.SP cs.LG

    Generalization of Geometric Graph Neural Networks with Lipschitz Loss Functions

    Authors: Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

    Abstract: In this paper, we study the generalization capabilities of geometric graph neural networks (GNNs). We consider GNNs over a geometric graph constructed from a finite set of randomly sampled points over an embedded manifold with topological information captured. We prove a generalization gap between the optimal empirical risk and the optimal statistical risk of this GNN, which decreases with the num… ▽ More

    Submitted 6 June, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

    Comments: 13 pages, 6 figures

  31. arXiv:2408.15094  [pdf, other

    cs.LG cs.CV math.OC

    Constrained Diffusion Models via Dual Training

    Authors: Shervin Khalafi, Dongsheng Ding, Alejandro Ribeiro

    Abstract: Diffusion models have attained prominence for their ability to synthesize a probability distribution for a given dataset via a diffusion process, enabling the generation of new data points with high fidelity. However, diffusion processes are prone to generating samples that reflect biases in a training dataset. To address this issue, we develop constrained diffusion models by imposing diffusion co… ▽ More

    Submitted 22 November, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: 31 pages, 4 figures, 4 tables

  32. arXiv:2408.13878  [pdf, other

    cs.LG eess.SP

    Generalization of Graph Neural Networks is Robust to Model Mismatch

    Authors: Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

    Abstract: Graph neural networks (GNNs) have demonstrated their effectiveness in various tasks supported by their generalization capabilities. However, the current analysis of GNN generalization relies on the assumption that training and testing data are independent and identically distributed (i.i.d). This imposes limitations on the cases where a model mismatch exists when generating testing data. In this p… ▽ More

    Submitted 10 September, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: 20 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:2406.05225

  33. arXiv:2408.10015  [pdf, other

    cs.AI math.OC

    Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs

    Authors: Sergio Rozada, Dongsheng Ding, Antonio G. Marques, Alejandro Ribeiro

    Abstract: We study the problem of computing deterministic optimal policies for constrained Markov decision processes (MDPs) with continuous state and action spaces, which are widely encountered in constrained dynamical systems. Designing deterministic policy gradient methods in continuous state and action spaces is particularly challenging due to the lack of enumerable state-action pairs and the adoption of… ▽ More

    Submitted 4 April, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

  34. arXiv:2406.17611  [pdf, other

    cs.LG eess.SP

    Distributed Training of Large Graph Neural Networks with Variable Communication Rates

    Authors: Juan Cervino, Md Asadullah Turja, Hesham Mostafa, Nageen Himayat, Alejandro Ribeiro

    Abstract: Training Graph Neural Networks (GNNs) on large graphs presents unique challenges due to the large memory and computing requirements. Distributed GNN training, where the graph is partitioned across multiple machines, is a common approach to training GNNs on large graphs. However, as the graph cannot generally be decomposed into small non-interacting components, data communication between the traini… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  35. arXiv:2406.05225  [pdf, ps, other

    cs.LG stat.ML

    A Manifold Perspective on the Statistical Generalization of Graph Neural Networks

    Authors: Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

    Abstract: Graph Neural Networks (GNNs) extend convolutional neural networks to operate on graphs. Despite their impressive performances in various graph learning tasks, the theoretical understanding of their generalization capability is still lacking. Previous GNN generalization bounds ignore the underlying graph structures, often leading to bounds that increase with the number of nodes -- a behavior contra… ▽ More

    Submitted 6 June, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 38 pages,22 figures

  36. arXiv:2405.13487  [pdf, other

    cs.CY cs.AI cs.HC

    Qualitative and quantitative analysis of student's perceptions in the use of generative AI in educational environments

    Authors: Sergio Altares-López, José M. Bengochea-Guevara, Carlos Ranz, Héctor Montes, Angela Ribeiro

    Abstract: The effective integration of generative artificial intelligence in education is a fundamental aspect to prepare future generations. The objective of this study is to analyze from a quantitative and qualitative point of view the perception of controlled student-IA interaction within the classroom. This analysis includes assessing the ethical implications and everyday use of AI tools, as well as und… ▽ More

    Submitted 2 September, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 7 figures, 4 tables

  37. arXiv:2405.10490  [pdf

    stat.ME cs.AI cs.IR cs.LG math.OC

    Neural Optimization with Adaptive Heuristics for Intelligent Marketing System

    Authors: Changshuai Wei, Benjamin Zelditch, Joyce Chen, Andre Assuncao Silva T Ribeiro, Jingyi Kenneth Tay, Borja Ocejo Elizondo, Keerthi Selvaraj, Aman Gupta, Licurgo Benemann De Almeida

    Abstract: Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimizat… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: KDD 2024

    ACM Class: G.3; G.1.6; I.2

  38. arXiv:2405.05748  [pdf, other

    eess.SP cs.LG

    Learning to Slice Wi-Fi Networks: A State-Augmented Primal-Dual Approach

    Authors: Yiğit Berkay Uslu, Roya Doostnejad, Alejandro Ribeiro, Navid NaderiAlizadeh

    Abstract: Network slicing is a key feature in 5G/NG cellular networks that creates customized slices for different service types with various quality-of-service (QoS) requirements, which can achieve service differentiation and guarantee service-level agreement (SLA) for each service type. In Wi-Fi networks, there is limited prior work on slicing, and a potential solution is based on a multi-tenant architect… ▽ More

    Submitted 27 January, 2025; v1 submitted 9 May, 2024; originally announced May 2024.

  39. arXiv:2404.03227  [pdf, other

    eess.SP cs.LG

    Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks

    Authors: Xingran Chen, Navid NaderiAlizadeh, Alejandro Ribeiro, Shirin Saeedi Bidokhti

    Abstract: We address the challenge of sampling and remote estimation for autoregressive Markovian processes in a multi-hop wireless network with statistically-identical agents. Agents cache the most recent samples from others and communicate over wireless collision channels governed by an underlying graph topology. Our goal is to minimize time-average estimation error and/or age of information with decentra… ▽ More

    Submitted 8 March, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

  40. arXiv:2403.11844  [pdf, other

    cs.LG eess.SP math.OC

    Near-Optimal Solutions of Constrained Learning Problems

    Authors: Juan Elenter, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: With the widespread adoption of machine learning systems, the need to curtail their behavior has become increasingly apparent. This is evidenced by recent advancements towards developing models that satisfy robustness, safety, and fairness requirements. These requirements can be imposed (with generalization guarantees) by formulating constrained learning problems that can then be tackled by dual a… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  41. arXiv:2402.09373  [pdf, other

    cs.LG stat.ML

    Loss Shaping Constraints for Long-Term Time Series Forecasting

    Authors: Ignacio Hounie, Javier Porras-Valenzuela, Alejandro Ribeiro

    Abstract: Several applications in time series forecasting require predicting multiple steps ahead. Despite the vast amount of literature in the topic, both classical and recent deep learning based approaches have mostly focused on minimising performance averaged over the predicted window. We observe that this can lead to disparate distributions of errors across forecasting steps, especially for recent trans… ▽ More

    Submitted 11 July, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  42. arXiv:2402.07684  [pdf, other

    q-bio.QM cs.LG stat.AP

    Towards a Foundation Model for Brain Age Prediction using coVariance Neural Networks

    Authors: Saurabh Sihag, Gonzalo Mateos, Alejandro Ribeiro

    Abstract: Brain age is the estimate of biological age derived from neuroimaging datasets using machine learning algorithms. Increasing brain age with respect to chronological age can reflect increased vulnerability to neurodegeneration and cognitive decline. In this paper, we study NeuroVNN, based on coVariance neural networks, as a paradigm for foundation model for the brain age prediction application. Neu… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Preliminary work. Contact [email protected] for the NeuroVNN model and code used for results reported in this manuscript

  43. arXiv:2401.06279  [pdf, ps, other

    cs.LG eess.SP

    Sampling and Uniqueness Sets in Graphon Signal Processing

    Authors: Alejandro Parada-Mayorga, Alejandro Ribeiro

    Abstract: In this work, we study the properties of sampling sets on families of large graphs by leveraging the theory of graphons and graph limits. To this end, we extend to graphon signals the notion of removable and uniqueness sets, which was developed originally for the analysis of signals on graphs. We state the formal definition of a $Λ-$removable set and conditions under which a bandlimited graphon si… ▽ More

    Submitted 1 June, 2025; v1 submitted 11 January, 2024; originally announced January 2024.

  44. arXiv:2401.04855  [pdf, other

    cs.RO cs.LG

    LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control

    Authors: Saurav Agarwal, Ramya Muthukrishnan, Walker Gosrich, Vijay Kumar, Alejandro Ribeiro

    Abstract: Coverage control is the problem of navigating a robot swarm to collaboratively monitor features or a phenomenon of interest not known a priori. The problem is challenging in decentralized settings with robots that have limited communication and sensing capabilities. We propose a learnable Perception-Action-Communication (LPAC) architecture for the problem, wherein a convolution neural network (CNN… ▽ More

    Submitted 8 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  45. arXiv:2312.17194  [pdf, other

    math.OC cs.LG eess.SY

    Resilient Constrained Reinforcement Learning

    Authors: Dongsheng Ding, Zhengyan Huan, Alejandro Ribeiro

    Abstract: We study a class of constrained reinforcement learning (RL) problems in which multiple constraint specifications are not identified before training. It is challenging to identify appropriate constraint specifications due to the undefined trade-off between the reward maximization objective and the constraint satisfaction, which is ubiquitous in constrained decision-making. To tackle this issue, we… ▽ More

    Submitted 29 December, 2023; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 42 pages, 25 figures; HTML converted

  46. arXiv:2312.15788  [pdf, other

    cs.LG eess.SP

    Robust Stochastically-Descending Unrolled Networks

    Authors: Samar Hadou, Navid NaderiAlizadeh, Alejandro Ribeiro

    Abstract: Deep unrolling, or unfolding, is an emerging learning-to-optimize method that unrolls a truncated iterative algorithm in the layers of a trainable neural network. However, the convergence guarantees and generalizability of the unrolled networks are still open theoretical problems. To tackle these problems, we provide deep unrolled architectures with a stochastic descent nature by imposing descendi… ▽ More

    Submitted 29 November, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  47. arXiv:2312.02365  [pdf, other

    eess.IV cs.CV

    MEDPSeg: Hierarchical polymorphic multitask learning for the segmentation of ground-glass opacities, consolidation, and pulmonary structures on computed tomography

    Authors: Diedre S. Carmo, Jean A. Ribeiro, Alejandro P. Comellas, Joseph M. Reinhardt, Sarah E. Gerard, Letícia Rittner, Roberto A. Lotufo

    Abstract: The COVID-19 pandemic response highlighted the potential of deep learning methods in facilitating the diagnosis, prognosis and understanding of lung diseases through automated segmentation of pulmonary structures and lesions in chest computed tomography (CT). Automated separation of lung lesion into ground-glass opacity (GGO) and consolidation is hindered due to the labor-intensive and subjective… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: This manuscript is under review and might change in the future

  48. arXiv:2311.17847  [pdf, other

    cs.DC

    FastSample: Accelerating Distributed Graph Neural Network Training for Billion-Scale Graphs

    Authors: Hesham Mostafa, Adam Grabowski, Md Asadullah Turja, Juan Cervino, Alejandro Ribeiro, Nageen Himayat

    Abstract: Training Graph Neural Networks(GNNs) on a large monolithic graph presents unique challenges as the graph cannot fit within a single machine and it cannot be decomposed into smaller disconnected components. Distributed sampling-based training distributes the graph across multiple machines and trains the GNN on small parts of the graph that are randomly sampled every training iteration. We show that… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  49. arXiv:2311.03053  [pdf, other

    cs.CV cs.AI

    Masking Hyperspectral Imaging Data with Pretrained Models

    Authors: Elias Arbash, Andréa de Lima Ribeiro, Sam Thiele, Nina Gnann, Behnood Rasti, Margret Fuchs, Pedram Ghamisi, Richard Gloaguen

    Abstract: The presence of undesired background areas associated with potential noise and unknown spectral characteristics degrades the performance of hyperspectral data processing. Masking out unwanted regions is key to addressing this issue. Processing only regions of interest yields notable improvements in terms of computational costs, required memory, and overall performance. The proposed processing pipe… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  50. arXiv:2310.10807  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Regularization properties of adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Francis Bach, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it searches for the best solution when the training data were corrupted by the worst-case attacks. Linear models are among the simple models where vulnerabilities can be… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted (spotlight) NeurIPS 2023; A preliminary version of this work titled: "Surprises in adversarially-trained linear regression" was made available under a different identifier: arXiv:2205.12695