Skip to main content

Showing 1–15 of 15 results for author: Barakat, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.21242  [pdf

    cs.HC cs.LG

    Passive Measurement of Autonomic Arousal in Real-World Settings

    Authors: Samy Abdel-Ghaffar, Isaac Galatzer-Levy, Conor Heneghan, Xin Liu, Sarah Kernasovskiy, Brennan Garrett, Andrew Barakat, Daniel McDuff

    Abstract: The autonomic nervous system (ANS) is activated during stress, which can have negative effects on cardiovascular health, sleep, the immune system, and mental health. While there are ways to quantify ANS activity in laboratories, there is a paucity of methods that have been validated in real-world contexts. We present the Fitbit Body Response Algorithm, an approach to continuous remote measurement… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  2. arXiv:2504.03592  [pdf, ps, other

    math.OC cs.GT cs.LG

    Optimistic Online Learning in Symmetric Cone Games

    Authors: Anas Barakat, Wayne Lin, John Lazarsfeld, Antonios Varvitsiotis

    Abstract: Optimistic online learning algorithms have led to significant advances in equilibrium computation, particularly for two-player zero-sum games, achieving an iteration complexity of $\mathcal{O}(1/ε)$ to reach an $ε$-saddle point. These advances have been established in normal-form games, where strategies are simplex vectors, and quantum games, where strategies are trace-one positive semidefinite ma… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  3. arXiv:2410.04108  [pdf, other

    cs.LG cs.AI

    Towards Scalable General Utility Reinforcement Learning: Occupancy Approximation, Sample Complexity and Global Optimality

    Authors: Anas Barakat, Souradip Chakraborty, Peihong Yu, Pratap Tokekar, Amrit Singh Bedi

    Abstract: Reinforcement learning with general utilities has recently gained attention thanks to its ability to unify several problems, including imitation learning, pure exploration, and safe reinforcement learning. However, prior work for solving this general problem in a unified way has only focused on the tabular setting. This is restrictive when considering larger state-action spaces because of the need… ▽ More

    Submitted 26 February, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: revised version

  4. arXiv:2410.02605  [pdf, other

    cs.LG cs.AI

    A Prospect-Theoretic Policy Gradient Algorithm for Behavioral Alignment in Reinforcement Learning

    Authors: Olivier Lepel, Anas Barakat

    Abstract: Classical reinforcement learning (RL) typically assumes rational decision-making based on expected utility theory. However, this model has been shown to be empirically inconsistent with actual human preferences, as evidenced in psychology and behavioral economics. Cumulative Prospect Theory (CPT) provides a more nuanced model for human-based decision-making, capturing diverse attitudes and percept… ▽ More

    Submitted 26 February, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: revised version

  5. arXiv:2408.08075  [pdf, ps, other

    cs.LG cs.GT cs.MA

    Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

    Authors: Pragnya Alatur, Anas Barakat, Niao He

    Abstract: Markov Potential Games (MPGs) form an important sub-class of Markov games, which are a common framework to model multi-agent reinforcement learning problems. In particular, MPGs include as a special case the identical-interest setting where all the agents share the same reward function. Scaling the performance of Nash equilibrium learning algorithms to a large number of agents is crucial for multi… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 16 pages, CDC 2024

    Journal ref: CDC 2024 - Proceedings of the 63rd IEEE Conference on Decision and Control

  6. arXiv:2404.16244  [pdf, other

    cs.CY

    The Ethics of Advanced AI Assistants

    Authors: Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomašev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz , et al. (32 additional authors not shown)

    Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  7. arXiv:2403.14156  [pdf, other

    cs.LG cs.AI stat.ML

    Policy Mirror Descent with Lookahead

    Authors: Kimon Protopapas, Anas Barakat

    Abstract: Policy Mirror Descent (PMD) stands as a versatile algorithmic framework encompassing several seminal policy gradient algorithms such as natural policy gradient, with connections with state-of-the-art reinforcement learning (RL) algorithms such as TRPO and PPO. PMD can be seen as a soft Policy Iteration algorithm implementing regularized 1-step greedy policy improvement. However, 1-step greedy poli… ▽ More

    Submitted 6 November, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  8. arXiv:2402.17885  [pdf, other

    cs.LG cs.GT cs.MA

    Independent Learning in Constrained Markov Potential Games

    Authors: Philip Jordan, Anas Barakat, Niao He

    Abstract: Constrained Markov games offer a formal mathematical framework for modeling multi-agent reinforcement learning problems where the behavior of the agents is subject to constraints. In this work, we focus on the recently introduced class of constrained Markov Potential Games. While centralized algorithms have been proposed for solving such constrained games, the design of converging independent lear… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: AISTATS 2024

  9. arXiv:2309.04272  [pdf, other

    eess.SY cs.GT cs.LG

    Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity and Last-Iterate Convergence

    Authors: Jiduan Wu, Anas Barakat, Ilyas Fatkhullin, Niao He

    Abstract: Zero-sum Linear Quadratic (LQ) games are fundamental in optimal control and can be used (i)~as a dynamic game formulation for risk-sensitive or robust control and (ii)~as a benchmark setting for multi-agent reinforcement learning with two competing agents in continuous state-control spaces. In contrast to the well-studied single-agent linear quadratic regulator problem, zero-sum LQ games entail so… ▽ More

    Submitted 31 October, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  10. arXiv:2307.05795  [pdf

    cs.HC

    Research Protocol for the Google Health Digital Well-being Study

    Authors: Daniel McDuff, Andrew Barakat, Ari Winbush, Allen Jiang, Felicia Cordeiro, Ryann Crowley, Lauren E. Kahn, John Hernandez, Nicholas B. Allen

    Abstract: The impact of digital device use on health and well-being is a pressing question to which individuals, families, schools, policy makers, legislators, and digital designers are all demanding answers. However, the scientific literature on this topic to date is marred by small and/or unrepresentative samples, poor measurement of core constructs (e.g., device use, smartphone addiction), and a limited… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  11. arXiv:2306.01854  [pdf, other

    cs.LG math.OC

    Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space

    Authors: Anas Barakat, Ilyas Fatkhullin, Niao He

    Abstract: We consider the reinforcement learning (RL) problem with general utilities which consists in maximizing a function of the state-action occupancy measure. Beyond the standard cumulative reward RL setting, this problem includes as particular cases constrained RL, pure exploration and learning from demonstrations among others. For this problem, we propose a simpler single-loop parameter-free normaliz… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 48 pages, 2 figures, ICML 2023, this paper was initially submitted in January 26th 2023

    Journal ref: Proceedings of the Fortieth International Conference on Machine Learning (ICML 2023)

  12. arXiv:2302.01734  [pdf, other

    cs.LG math.OC

    Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies

    Authors: Ilyas Fatkhullin, Anas Barakat, Anastasia Kireeva, Niao He

    Abstract: Recently, the impressive empirical success of policy gradient (PG) methods has catalyzed the development of their theoretical foundations. Despite the huge efforts directed at the design of efficient stochastic PG-type algorithms, the understanding of their convergence to a globally optimal policy is still limited. In this work, we develop improved global convergence guarantees for a general class… ▽ More

    Submitted 8 November, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: This work was initially submitted in October 2022

    MSC Class: 90C26; 90C15 ACM Class: G.1.6

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:9827-9869, 2023

  13. arXiv:2106.07472  [pdf, ps, other

    cs.LG math.OC stat.ML

    Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

    Authors: Anas Barakat, Pascal Bianchi, Julien Lehmann

    Abstract: Actor-critic methods integrating target networks have exhibited a stupendous empirical success in deep reinforcement learning. However, a theoretical understanding of the use of target networks in actor-critic methods is largely missing in the literature. In this paper, we reduce this gap between theory and practice by proposing the first theoretical analysis of an online target-based actor-critic… ▽ More

    Submitted 22 February, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 50 pages

    Journal ref: AISTATS 2022

  14. arXiv:1911.07596  [pdf, other

    math.OC cs.LG stat.ML

    Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

    Authors: Anas Barakat, Pascal Bianchi

    Abstract: Although ADAM is a very popular algorithm for optimizing the weights of neural networks, it has been recently shown that it can diverge even in simple convex optimization examples. Several variants of ADAM have been proposed to circumvent this convergence issue. In this work, we study the ADAM algorithm for smooth nonconvex optimization under a boundedness assumption on the adaptive learning rate.… ▽ More

    Submitted 24 September, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 28 pages, 1 figure, published in ACML2020

  15. arXiv:1810.02263  [pdf, ps, other

    stat.ML cs.LG math.CA math.DS math.OC

    Convergence and Dynamical Behavior of the ADAM Algorithm for Non-Convex Stochastic Optimization

    Authors: Anas Barakat, Pascal Bianchi

    Abstract: Adam is a popular variant of stochastic gradient descent for finding a local minimizer of a function. In the constant stepsize regime, assuming that the objective function is differentiable and non-convex, we establish the convergence in the long run of the iterates to a stationary point under a stability condition. The key ingredient is the introduction of a continuous-time version of Adam, under… ▽ More

    Submitted 13 May, 2020; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: 30 pages