Skip to main content

Showing 1–50 of 105 results for author: Ribeiro, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.06557  [pdf, ps, other

    cs.IR cs.LG eess.SP math.MG

    Infinity Search: Approximate Vector Search with Projections on q-Metric Spaces

    Authors: Antonio Pariente, Ignacio Hounie, Santiago Segarra, Alejandro Ribeiro

    Abstract: Despite the ubiquity of vector search applications, prevailing search algorithms overlook the metric structure of vector embeddings, treating it as a constraint rather than exploiting its underlying properties. In this paper, we demonstrate that in $q$-metric spaces, metric trees can leverage a stronger version of the triangle inequality to reduce comparisons for exact search. Notably, as $q$ appr… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2505.19387  [pdf, ps, other

    cs.LG eess.SY math.OC

    Alignment of large language models with constrained learning

    Authors: Botong Zhang, Shuo Li, Ignacio Hounie, Osbert Bastani, Dongsheng Ding, Alejandro Ribeiro

    Abstract: We study the problem of computing an optimal large language model (LLM) policy for a constrained alignment problem, where the goal is to maximize a primary reward objective while satisfying constraints on secondary utilities. Despite the popularity of Lagrangian-based LLM policy search in constrained alignment, iterative primal-dual methods often fail to converge, and non-iterative dual-based meth… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 48 pages, 7 figures, 7 tables

  3. arXiv:2410.12677  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Efficient Optimization Algorithms for Linear Adversarial Training

    Authors: Antônio H. RIbeiro, Thomas B. Schön, Dave Zahariah, Francis Bach

    Abstract: Adversarial training can be used to learn models that are robust against perturbations. For linear models, it can be formulated as a convex optimization problem. Compared to methods proposed in the context of deep learning, leveraging the optimization structure allows significantly faster convergence rates. Still, the use of generic convex solvers can be inefficient for large-scale problems. Here,… ▽ More

    Submitted 19 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Paper accepted at AISTATS 2025

  4. arXiv:2410.08331  [pdf, other

    math.OC

    Fejér* monotonicity in optimization algorithms

    Authors: Roger Behling, Yunier Bello-Cruz, Alfredo Noel Iusem, Ademir Alves Ribeiro, Luiz-Rafael Santos

    Abstract: Fejér monotonicity is a well-established property commonly observed in sequences generated by optimization algorithms. In this paper, we introduce an extension of this property, called Fejér* monotonicity, which was initially proposed in [SIAM J. Optim., 34(3), 2535-2556 (2024)]. We discuss and build upon the concept by exploring its behavior within Hilbert spaces, presenting an illustrative examp… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    MSC Class: 49M27; 65K05; 65B99; 90C25

  5. arXiv:2408.15094  [pdf, other

    cs.LG cs.CV math.OC

    Constrained Diffusion Models via Dual Training

    Authors: Shervin Khalafi, Dongsheng Ding, Alejandro Ribeiro

    Abstract: Diffusion models have attained prominence for their ability to synthesize a probability distribution for a given dataset via a diffusion process, enabling the generation of new data points with high fidelity. However, diffusion processes are prone to generating samples that reflect biases in a training dataset. To address this issue, we develop constrained diffusion models by imposing diffusion co… ▽ More

    Submitted 22 November, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: 31 pages, 4 figures, 4 tables

  6. arXiv:2408.10015  [pdf, other

    cs.AI math.OC

    Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs

    Authors: Sergio Rozada, Dongsheng Ding, Antonio G. Marques, Alejandro Ribeiro

    Abstract: We study the problem of computing deterministic optimal policies for constrained Markov decision processes (MDPs) with continuous state and action spaces, which are widely encountered in constrained dynamical systems. Designing deterministic policy gradient methods in continuous state and action spaces is particularly challenging due to the lack of enumerable state-action pairs and the adoption of… ▽ More

    Submitted 4 April, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

  7. arXiv:2405.10490  [pdf

    stat.ME cs.AI cs.IR cs.LG math.OC

    Neural Optimization with Adaptive Heuristics for Intelligent Marketing System

    Authors: Changshuai Wei, Benjamin Zelditch, Joyce Chen, Andre Assuncao Silva T Ribeiro, Jingyi Kenneth Tay, Borja Ocejo Elizondo, Keerthi Selvaraj, Aman Gupta, Licurgo Benemann De Almeida

    Abstract: Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimizat… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: KDD 2024

    ACM Class: G.3; G.1.6; I.2

  8. arXiv:2403.11844  [pdf, other

    cs.LG eess.SP math.OC

    Near-Optimal Solutions of Constrained Learning Problems

    Authors: Juan Elenter, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: With the widespread adoption of machine learning systems, the need to curtail their behavior has become increasingly apparent. This is evidenced by recent advancements towards developing models that satisfy robustness, safety, and fairness requirements. These requirements can be imposed (with generalization guarantees) by formulating constrained learning problems that can then be tackled by dual a… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2312.17194  [pdf, other

    math.OC cs.LG eess.SY

    Resilient Constrained Reinforcement Learning

    Authors: Dongsheng Ding, Zhengyan Huan, Alejandro Ribeiro

    Abstract: We study a class of constrained reinforcement learning (RL) problems in which multiple constraint specifications are not identified before training. It is challenging to identify appropriate constraint specifications due to the undefined trade-off between the reward maximization objective and the constraint satisfaction, which is ubiquitous in constrained decision-making. To tackle this issue, we… ▽ More

    Submitted 29 December, 2023; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 42 pages, 25 figures; HTML converted

  10. arXiv:2310.10807  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Regularization properties of adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Francis Bach, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it searches for the best solution when the training data were corrupted by the worst-case attacks. Linear models are among the simple models where vulnerabilities can be… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted (spotlight) NeurIPS 2023; A preliminary version of this work titled: "Surprises in adversarially-trained linear regression" was made available under a different identifier: arXiv:2205.12695

  11. arXiv:2309.14284  [pdf, other

    math.OC cs.RO

    Navigation with shadow prices to optimize multi-commodity flow rates

    Authors: Ignacio Boero, Igor Spasojevic, Mariana del Castillo, George Pappas, Vijay Kumar, Alejandro Ribeiro

    Abstract: We propose a method for providing communication network infrastructure in autonomous multi-agent teams. In particular, we consider a set of communication agents that are placed alongside regular agents from the system in order to improve the rate of information transfer between the latter. In order to find the optimal positions to place such agents, we define a flexible performance function that a… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: (c) 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  12. arXiv:2309.10190  [pdf, ps, other

    math.AP math.OC

    Revisited convexity notions for $L^\infty$ variational problems

    Authors: Ana Margarida Ribeiro, Elvira Zappale

    Abstract: We address a deep study of the convexity notions that arise in the study of weak* lower semicontinuity of supremal functionals as well as those raised by the power-law approximation of such functionals. Our quest is motivated by the knowledge we have on the analogous integral functionals and aims at establishing a solid groundwork to ease any research in the $L^\infty$ context.

    Submitted 18 September, 2023; originally announced September 2023.

    MSC Class: 26B25; 49J45

  13. arXiv:2306.11700  [pdf, other

    math.OC cs.LG eess.SY

    Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

    Authors: Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

    Abstract: We study the problem of computing an optimal policy of an infinite-horizon discounted constrained Markov decision process (constrained MDP). Despite the popularity of Lagrangian-based policy search methods used in practice, the oscillation of policy iterates in these methods has not been fully understood, bringing out issues such as violation of constraints and sensitivity to hyper-parameters. To… ▽ More

    Submitted 16 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 65 pages, 17 figures, and 1 table; NeurIPS 2023

  14. arXiv:2306.08737  [pdf, other

    cs.RO cs.IT math.OC

    A Networked Multi-Agent System for Mobile Wireless Infrastructure on Demand

    Authors: Miguel Calvo-Fullana, Mikhail Gerasimenko, Daniel Mox, Leopoldo Agorio, Mariana del Castillo, Vijay Kumar, Alejandro Ribeiro, Juan Andres Bazerque

    Abstract: Despite the prevalence of wireless connectivity in urban areas around the globe, there remain numerous and diverse situations where connectivity is insufficient or unavailable. To address this, we introduce mobile wireless infrastructure on demand, a system of UAVs that can be rapidly deployed to establish an ad-hoc wireless network. This network has the capability of reconfiguring itself dynamica… ▽ More

    Submitted 16 September, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

  15. arXiv:2209.06642  [pdf

    math.OC cs.LG stat.ME

    A Robust Scientific Machine Learning for Optimization: A Novel Robustness Theorem

    Authors: Luana P. Queiroz, Carine M. Rebello, Erber A. Costa, Vinicius V. Santana, Alirio E. Rodrigues, Ana M. Ribeiro, Idelfonso B. R. Nogueira

    Abstract: Scientific machine learning (SciML) is a field of increasing interest in several different application fields. In an optimization context, SciML-based tools have enabled the development of more efficient optimization methods. However, implementing SciML tools for optimization must be rigorously evaluated and performed with caution. This work proposes the deductions of a robustness test that guaran… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  16. arXiv:2205.12695  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Surprises in adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max problem, searching for the best solution when the training data was corrupted by the worst-case attacks. For linear regression problems, adversarial training can… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  17. arXiv:2204.12524  [pdf, other

    math.OC

    On strong second-order optimality conditions under relaxed constant rank constraint qualification

    Authors: Ademir Alves Ribeiro, Mael Sachine

    Abstract: We discuss the (first- and second-order) optimality conditions for nonlinear programming under the relaxed constant rank constraint qualification. This condition generalizes the so-called linear independence constraint qualification. Although the optimality conditions are well established in the literature, the proofs presented here are based solely on the well-known inverse function theorem. This… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  18. arXiv:2204.06274  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Overparameterized Linear Regression under Adversarial Attacks

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to s… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

  19. arXiv:2202.04108  [pdf, other

    cs.LG math.OC

    A Lagrangian Duality Approach to Active Learning

    Authors: Juan Elenter, Navid NaderiAlizadeh, Alejandro Ribeiro

    Abstract: We consider the pool-based active learning problem, where only a subset of the training data is labeled, and the goal is to query a batch of unlabeled samples to be labeled so as to maximally improve model performance. We formulate the problem using constrained learning, where a set of constraints bounds the performance of the model on labeled samples. Considering a primal-dual approach, we optimi… ▽ More

    Submitted 29 October, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  20. arXiv:2112.07564  [pdf, other

    math.OC eess.SY

    Linear Quadratic Control with Risk Constraints

    Authors: Anastasios Tsiamis, Dionysios S. Kalogerias, Alejandro Ribeiro, George J. Pappas

    Abstract: We propose a new risk-constrained formulation of the classical Linear Quadratic (LQ) stochastic control problem for general partially-observed systems. Our framework is motivated by the fact that the risk-neutral LQ controllers, although optimal in expectation, might be ineffective under relatively infrequent, yet statistically significant extreme events. To effectively trade between average and e… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 32 pages, under review. arXiv admin note: substantial text overlap with arXiv:2004.04685

  21. A note on gradient Ricci soliton warped metrics

    Authors: José N. V. Gomes, Marcus A. M. Marrocos, Adrian V. C. Ribeiro

    Abstract: In this note, we prove triviality and nonexistence results for gradient Ricci soliton warped metrics. The proofs stem from the construction of gradient Ricci solitons that are realized as warped products, from which we know that the base spaces of these products are Ricci-Hessian type manifolds. We study this latter class of manifolds as the most appropriate setting to prove our results.

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 12 pages

    MSC Class: 53C21; 53C25; 53C15

    Journal ref: Mathematische Nachrichten 294 (2021) 1879-1888

  22. arXiv:2103.05134  [pdf, other

    cs.LG math.ST stat.ML

    Constrained Learning with Non-Convex Losses

    Authors: Luiz F. O. Chamon, Santiago Paternain, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Though learning has become a core component of modern information processing, there is now ample evidence that it can lead to biased, unsafe, and prejudiced systems. The need to impose requirements on learning is therefore paramount, especially as it reaches critical applications in social, industrial, and medical domains. However, the non-convexity of most modern statistical problems is only exac… ▽ More

    Submitted 19 October, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: IEEE Transactions on Information Theory

  23. arXiv:2102.11941  [pdf, other

    cs.LG cs.RO math.OC

    State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning with Rewards

    Authors: Miguel Calvo-Fullana, Santiago Paternain, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: A common formulation of constrained reinforcement learning involves multiple rewards that must individually accumulate to given thresholds. In this class of problems, we show a simple example in which the desired optimal policy cannot be induced by any weighted linear combination of rewards. Hence, there exist constrained reinforcement learning problems for which neither regularized nor classical… ▽ More

    Submitted 21 September, 2023; v1 submitted 23 February, 2021; originally announced February 2021.

  24. arXiv:2011.12344  [pdf, other

    cs.LG math.OC

    Trust but Verify: Assigning Prediction Credibility by Counterfactual Constrained Learning

    Authors: Luiz F. O. Chamon, Santiago Paternain, Alejandro Ribeiro

    Abstract: Prediction credibility measures, in the form of confidence intervals or probability distributions, are fundamental in statistics and machine learning to characterize model robustness, detect out-of-distribution samples (outliers), and protect against adversarial attacks. To be effective, these measures should (i) account for the wide variety of models used in practice, (ii) be computable for train… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  25. arXiv:2008.03158  [pdf, ps, other

    math.OC

    A sequential optimality condition for Mathematical Programs with Cardinality Constraints

    Authors: Evelin H. M. Krulikovski, Ademir A. Ribeiro, Mael Sachine

    Abstract: In this paper we propose an Approximate Weak stationarity ($AW$-stationarity) concept designed to deal with {\em Mathematical Programs with Cardinality Constraints} (MPCaC), and we proved that it is a legitimate optimality condition independently of any constraint qualification. Such a sequential optimality condition improves weaker stationarity conditions, presented in a previous work. Many resea… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: 23 pages. arXiv admin note: text overlap with arXiv:2008.00019

    MSC Class: 90C30; 90C33; 90C46

  26. arXiv:2008.00019  [pdf, ps, other

    math.OC

    On the weak stationarity conditions for Mathematical Programs with Cardinality Constraints: a unified approach

    Authors: Evelin H. M. Krulikovski, Ademir A. Ribeiro, Mael Sachine

    Abstract: In this paper, we study a class of optimization problems, called Mathematical Programs with Cardinality Constraints (MPCaC). This kind of problem is generally difficult to deal with, because it involves a constraint that is not continuous neither convex, but provides sparse solutions. Thereby we reformulate MPCaC in a suitable way, by modeling it as mixed-integer problem and then addressing its co… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    MSC Class: 90C30; 90C33; 90C46

  27. arXiv:2006.07314  [pdf, other

    cs.LG math.OC stat.ML

    Zeroth-order Deterministic Policy Gradient

    Authors: Harshat Kumar, Dionysios S. Kalogerias, George J. Pappas, Alejandro Ribeiro

    Abstract: Deterministic Policy Gradient (DPG) removes a level of randomness from standard randomized-action Policy Gradient (PG), and demonstrates substantial empirical success for tackling complex dynamic problems involving Markov decision processes. At the same time, though, DPG loses its ability to learn in a model-free (i.e., actor-only) fashion, frequently necessitating the use of critics in order to o… ▽ More

    Submitted 11 July, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 18 pages, 5 figures. Fixed some minor oversights in the theoretical development present in the previous version of the manuscript and significantly revised and expanded the simulations sections, both in the main body and supplementary material

  28. arXiv:2006.05487  [pdf, other

    cs.LG math.ST stat.ML

    Probably Approximately Correct Constrained Learning

    Authors: Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: As learning solutions reach critical applications in social, industrial, and medical domains, the need to curtail their behavior has become paramount. There is now ample evidence that without explicit tailoring, learning can lead to biased, unsafe, and prejudiced solutions. To tackle these problems, we develop a generalization theory of constrained learning based on the probably approximately corr… ▽ More

    Submitted 17 February, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  29. arXiv:2004.04685  [pdf, other

    eess.SY math.OC

    Risk-Constrained Linear-Quadratic Regulators

    Authors: Anastasios Tsiamis, Dionysios S. Kalogerias, Luiz F. O. Chamon, Alejandro Ribeiro, George J. Pappas

    Abstract: We propose a new risk-constrained reformulation of the standard Linear Quadratic Regulator (LQR) problem. Our framework is motivated by the fact that the classical (risk-neutral) LQR controller, although optimal in expectation, might be ineffective under relatively infrequent, yet statistically significant (risky) events. To effectively trade between average and extreme event performance, we intro… ▽ More

    Submitted 28 October, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: In the first version there was a typo in the reported A, B in the Simulations, equation (20). The second version reports the correct A, B matrices

  30. arXiv:2004.03726  [pdf, other

    math.OC

    Resilient Control: Compromising to Adapt

    Authors: Luiz F. O. Chamon, Alexandre Amice, Santiago Paternain, Alejandro Ribeiro

    Abstract: In optimal control problems, disturbances are typically dealt with using robust solutions, such as H-infinity or tube model predictive control, that plan control actions feasible for the worst-case disturbance. Yet, planning for every contingency can lead to over-conservative, poorly performing solutions or even, in extreme cases, to infeasibility. Resilience addresses these shortcomings by adapti… ▽ More

    Submitted 25 August, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  31. arXiv:2003.08841  [pdf, other

    math.OC eess.SY

    Approximately Supermodular Scheduling Subject to Matroid Constraints

    Authors: Luiz F. O. Chamon, Alexandre Amice, Alejandro Ribeiro

    Abstract: Control scheduling refers to the problem of assigning agents or actuators to act upon a dynamical system at specific times so as to minimize a quadratic control cost, such as the objective of the Linear-quadratic-Gaussian (LQG) or the Linear Quadratic Regulator (LQR). When budget or operational constraints are imposed on the schedule, this problem is in general NP-hard and its solution can therefo… ▽ More

    Submitted 29 March, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in the IEEE Transactions of Automatic Control

  32. arXiv:2002.05183  [pdf, other

    cs.LG math.OC stat.ML

    The empirical duality gap of constrained statistical learning

    Authors: Luiz F. O. Chamon, Santiago Paternain, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: This paper is concerned with the study of constrained statistical learning problems, the unconstrained version of which are at the core of virtually all of modern information processing. Accounting for constraints, however, is paramount to incorporate prior knowledge and impose desired structural and statistical properties on the solutions. Still, solving constrained statistical problems remains c… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  33. arXiv:2001.11116  [pdf, other

    math.OC

    Counterfactual Programming for Optimal Control

    Authors: Luiz F. O. Chamon, Santiago Paternain, Alejandro Ribeiro

    Abstract: In recent years, considerable work has been done to tackle the issue of designing control laws based on observations to allow unknown dynamical systems to perform pre-specified tasks. At least as important for autonomy, however, is the issue of learning which tasks can be performed in the first place. This is particularly critical in situations where multiple (possibly conflicting) tasks and requi… ▽ More

    Submitted 5 May, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

  34. Approximate Supermodularity of Kalman Filter Sensor Selection

    Authors: Luiz F. O. Chamon, George J. Pappas, Alejandro Ribeiro

    Abstract: This work considers the problem of selecting sensors in a large scale system to minimize the error in estimating its states. More specifically, the state estimation mean-square error(MSE) and worst-case error for Kalman filtering and smoothing. Such selection problems are in general NP-hard, i.e., their solution can only be approximated in practice even for moderately large problems. Due to its lo… ▽ More

    Submitted 21 February, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Accepted to Transactions on Automatic Control

  35. arXiv:1912.02933  [pdf, other

    math.OC cs.IT eess.SP eess.SY stat.ML

    Risk-Aware MMSE Estimation

    Authors: Dionysios S. Kalogerias, Luiz F. O. Chamon, George J. Pappas, Alejandro Ribeiro

    Abstract: Despite the simplicity and intuitive interpretation of Minimum Mean Squared Error (MMSE) estimators, their effectiveness in certain scenarios is questionable. Indeed, minimizing squared errors on average does not provide any form of stability, as the volatility of the estimation error is left unconstrained. When this volatility is statistically significant, the difference between the average and r… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: 18 pages, 4 figures

  36. arXiv:1911.09101  [pdf, other

    eess.SY cs.LG math.OC

    Safe Policies for Reinforcement Learning via Primal-Dual Methods

    Authors: Santiago Paternain, Miguel Calvo-Fullana, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: In this paper, we study the learning of safe policies in the setting of reinforcement learning problems. This is, we aim to control a Markov Decision Process (MDP) of which we do not know the transition probabilities, but we have access to sample trajectories through experience. We define safety as the agent remaining in a desired safe set with high probability during the operation time. We theref… ▽ More

    Submitted 12 January, 2022; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1910.13393

  37. arXiv:1911.03988  [pdf, ps, other

    eess.SY cs.LG eess.SP math.OC stat.ML

    Model-Free Learning of Optimal Ergodic Policies in Wireless Systems

    Authors: Dionysios S. Kalogerias, Mark Eisen, George J. Pappas, Alejandro Ribeiro

    Abstract: Learning optimal resource allocation policies in wireless systems can be effectively achieved by formulating finite dimensional constrained programs which depend on system configuration, as well as the adopted learning parameterization. The interest here is in cases where system models are unavailable, prompting methods that probe the wireless system with candidate policies, and then use observed… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: 13 pages, 4 figures

  38. arXiv:1911.00164  [pdf, other

    cs.SI math.MG

    Metric Representations of Networks: A Uniqueness Result

    Authors: Santiago Segarra, T. Mitchell Roddenberry, Facundo Memoli, Alejandro Ribeiro

    Abstract: In this paper, we consider the problem of projecting networks onto metric spaces. Networks are structures that encode relationships between pairs of elements or nodes. However, these relationships can be independent of each other, and need not be defined for every pair of nodes. This is in contrast to a metric space, which requires that a distance between every pair of elements in the space be def… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Comments: 7 pages, 5 figures

  39. arXiv:1910.13393  [pdf, other

    cs.LG math.OC stat.ML

    Constrained Reinforcement Learning Has Zero Duality Gap

    Authors: Santiago Paternain, Luiz F. O. Chamon, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Autonomous agents must often deal with conflicting requirements, such as completing tasks using the least amount of time/energy, learning multiple tasks, or dealing with multiple opponents. In the context of reinforcement learning~(RL), these problems are addressed by (i)~designing a reward function that simultaneously describes all requirements or (ii)~combining modular value functions that encod… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  40. arXiv:1910.08412  [pdf, other

    cs.LG math.OC stat.ML

    On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation

    Authors: Harshat Kumar, Alec Koppel, Alejandro Ribeiro

    Abstract: Reinforcement learning, mathematically described by Markov Decision Problems, may be approached either through dynamic programming or policy search. Actor-critic algorithms combine the merits of both approaches by alternating between steps to estimate the value function and policy gradient updates. Due to the fact that the updates exhibit correlated noise and biased gradient updates, only the asym… ▽ More

    Submitted 27 January, 2023; v1 submitted 18 October, 2019; originally announced October 2019.

  41. arXiv:1909.10704  [pdf, other

    cs.RO math.CO

    Graph Policy Gradients for Large Scale Unlabeled Motion Planning with Constraints

    Authors: Arbaaz Khan, Vijay Kumar, Alejandro Ribeiro

    Abstract: In this paper, we present a learning method to solve the unlabelled motion problem with motion constraints and space constraints in 2D space for a large number of robots. To solve the problem of arbitrary dynamics and constraints we propose formulating the problem as a multi-agent problem. In contrast to previous works that propose using learning solutions for unlabelled motion planning with const… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

  42. arXiv:1909.07496  [pdf, other

    math.OC cs.RO

    Source Seeking in Unknown Environments with Convex Obstacles

    Authors: Bruno A. Angélico, Luiz F. O. Chamon, Santiago Paternain, Alejandro Ribeiro, George J. Pappas

    Abstract: Navigation tasks often cannot be defined in terms of a target, either because global position information is unavailable or unreliable or because target location is not explicitly known a priori. This task is then often defined indirectly as a source seeking problem in which the autonomous agent navigates so as to minimize the convex potential induced by a source while avoiding obstacles. This wor… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: 8 pages, 13 figures, submitted to ICRA 2020

  43. arXiv:1908.08509  [pdf, other

    math.OC

    Navigation of a Quadratic Potential with Ellipsoidal Obstacles

    Authors: Harshat Kumar, Santiago Paternain, Alejandro Ribeiro

    Abstract: Given a convex quadratic potential of which its minimum is the agent's goal and a Euclidean space populated with ellipsoidal obstacles, one can construct a Rimon-Koditschek (RK) artificial potential to navigate. Its negative gradient attracts the agent toward the goal and repels the agent away from the boundary of the obstacles. This is a popular approach to navigation problems since it can be imp… ▽ More

    Submitted 12 September, 2022; v1 submitted 22 August, 2019; originally announced August 2019.

  44. arXiv:1907.01681  [pdf, other

    q-bio.PE math.AP math.PR

    Gradient flow formulations of discrete and continuous evolutionary models: a unifying perspective

    Authors: Fabio A. C. C. Chalub, Léonard Monsaingeon, Ana Margarida Ribeiro, Max O. Souza

    Abstract: We consider three classical models of biological evolution: (i) the Moran process, an example of a reducible Markov Chain; (ii) the Kimura Equation, a particular case of a degenerated Fokker-Planck Diffusion; (iii) the Replicator Equation, a paradigm in Evolutionary Game Theory. While these approaches are not completely equivalent, they are intimately connected, since (ii) is the diffusion approxi… ▽ More

    Submitted 8 October, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    MSC Class: 35Q92; 60J25; 92D15; 58E30

  45. arXiv:1906.08482  [pdf, other

    cs.LG cs.NE math.DS stat.ML

    Beyond exploding and vanishing gradients: analysing RNN training using attractors and smoothness

    Authors: Antônio H. Ribeiro, Koen Tiels, Luis A. Aguirre, Thomas B. Schön

    Abstract: The exploding and vanishing gradient problem has been the major conceptual principle behind most architecture and training improvements in recurrent neural networks (RNNs) during the last decade. In this paper, we argue that this principle, while powerful, might need some refinement to explain recent developments. We refine the concept of exploding gradients by reformulating the problem in terms o… ▽ More

    Submitted 5 March, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: To appear in the Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 2020. PMLR: Volume 108. This paper was previously titled "The trade-off between long-term memory and smoothness for recurrent networks". The current version subsumes all previous versions

  46. On the smoothness of nonlinear system identification

    Authors: Antônio H. Ribeiro, Koen Tiels, Jack Umenberger, Thomas B. Schön, Luis A. Aguirre

    Abstract: We shed new light on the \textit{smoothness} of optimization problems arising in prediction error parameter estimation of linear and nonlinear systems. We show that for regions of the parameter space where the model is not contractive, the Lipschitz constant and $β$-smoothness of the objective function might blow up exponentially with the simulation length, making it hard to numerically find minim… ▽ More

    Submitted 7 August, 2020; v1 submitted 2 May, 2019; originally announced May 2019.

    Journal ref: Automatica, vol. 121, 109158, Nov. 2020

  47. arXiv:1904.05222  [pdf, other

    math.HO

    Calculus, constrained minimization and Lagrange multipliers: Is the optimal critical point a local minimizer?

    Authors: Ademir Alves Ribeiro, Jose Renato Ramos Barbosa

    Abstract: In this short note, we discuss how the optimality conditions for the problem of minimizing a multivariate function subject to equality constraints have been dealt with in undergraduate Calculus. We are particularly interested in the 2 or 3-dimensional cases, which are the most common cases in Calculus courses. Besides giving sufficient conditions to a critical point to be a local minimizer, we als… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: 12 pages, 7 figures, the paper is for those who study or teach calculus

    MSC Class: 97Axx

  48. Distributed Constrained Online Learning

    Authors: Santiago Paternain, Soomin Lee, Michael M. Zavlanos, Alejandro Ribeiro

    Abstract: In this paper, we consider groups of agents in a network that select actions in order to satisfy a set of constraints that vary arbitrarily over time and minimize a time-varying function of which they have only local observations. The selection of actions, also called a strategy, is causal and decentralized, i.e., the dynamical system that determines the actions of a given agent depends only on th… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

  49. arXiv:1903.01540  [pdf, other

    math.OC stat.ML

    A Stochastic Trust Region Method for Non-convex Minimization

    Authors: Zebang Shen, Pan Zhou, Cong Fang, Alejandro Ribeiro

    Abstract: We target the problem of finding a local minimum in non-convex finite-sum minimization. Towards this goal, we first prove that the trust region method with inexact gradient and Hessian estimation can achieve a convergence rate of order $\mathcal{O}(1/{k^{2/3}})$ as long as those differential estimations are sufficiently accurate. Combining such result with a novel Hessian estimator, we propose the… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

  50. arXiv:1811.00577  [pdf, other

    cs.LG eess.SP math.OC stat.ML

    Functional Nonlinear Sparse Models

    Authors: Luiz F. O. Chamon, Yonina C. Eldar, Alejandro Ribeiro

    Abstract: Signal processing is rich in inherently continuous and often nonlinear applications, such as spectral estimation, optical imaging, and super-resolution microscopy, in which sparsity plays a key role in obtaining state-of-the-art results. Coping with the infinite dimensionality and non-convexity of these problems typically involves discretization and convex relaxations, e.g., using atomic norms. Ne… ▽ More

    Submitted 20 March, 2020; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Accepted for publication on the IEEE Transactions on Signal Processing