Skip to main content

Showing 1–33 of 33 results for author: Doan, T T

Searching in archive math. Search in all archives.
.
  1. arXiv:2409.07767  [pdf, other

    math.OC

    Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games

    Authors: Sihan Zeng, Thinh T. Doan

    Abstract: Multi-time-scale stochastic approximation is an iterative algorithm for finding the fixed point of a set of $N$ coupled operators given their noisy samples. It has been observed that due to the coupling between the decision variables and noisy samples of the operators, the performance of this method decays as $N$ increases. In this work, we develop a new accelerated variant of multi-time-scale sto… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  2. arXiv:2409.03092  [pdf, other

    math.OC

    Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning

    Authors: Amit Dutta, Thinh T. Doan

    Abstract: We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of $f$ malicious agents that could observe the entire network and send arbitrary values to the coordinator to disrupt the performance of other non-faulty agents.… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  3. arXiv:2405.09660  [pdf, other

    math.OC cs.LG

    Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning

    Authors: Sihan Zeng, Thinh T. Doan

    Abstract: Two-time-scale optimization is a framework introduced in Zeng et al. (2024) that abstracts a range of policy evaluation and policy optimization problems in reinforcement learning (RL). Akin to bi-level optimization under a particular type of stochastic oracle, the two-time-scale optimization framework has an upper level objective whose gradient evaluation depends on the solution of a lower level p… ▽ More

    Submitted 2 March, 2025; v1 submitted 15 May, 2024; originally announced May 2024.

  4. arXiv:2405.02456  [pdf, ps, other

    math.OC cs.LG

    Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: Multi-task reinforcement learning (RL) aims to find a single policy that effectively solves multiple tasks at the same time. This paper presents a constrained formulation for multi-task RL where the goal is to maximize the average performance of the policy across tasks subject to bounds on the performance in each task. We consider solving this problem both in the centralized setting, where informa… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2401.12764  [pdf, other

    math.OC cs.LG

    Fast Nonlinear Two-Time-Scale Stochastic Approximation: Achieving $O(1/k)$ Finite-Sample Complexity

    Authors: Thinh T. Doan

    Abstract: This paper proposes to develop a new variant of the two-time-scale stochastic approximation to find the roots of two coupled nonlinear operators, assuming only noisy samples of these operators can be observed. Our key idea is to leverage the classic Ruppert-Polyak averaging technique to dynamically estimate the operators through their samples. The estimated values of these averaging steps will the… ▽ More

    Submitted 22 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  6. arXiv:2312.10189  [pdf, other

    math.OC

    Resilient Federated Learning under Byzantine Attack in Distributed Nonconvex Optimization with 2-f Redundancy

    Authors: Amit Dutta, Thinh T. Doan, Jeffrey H. Reed

    Abstract: We study the problem of Byzantine fault tolerance in a distributed optimization setting, where there is a group of $N$ agents communicating with a trusted centralized coordinator. Among these agents, there is a subset of $f$ agents that may not follow a prescribed algorithm and may share arbitrarily incorrect information with the coordinator. The goal is to find the optimizer of the aggregate cost… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  7. arXiv:2303.12981  [pdf, other

    cs.LG math.OC

    Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: The aim of this paper is to improve the understanding of the optimization landscape for policy optimization problems in reinforcement learning. Specifically, we show that the superlevel set of the objective function with respect to the policy parameter is always a connected set both in the tabular setting and under policies represented by a class of neural networks. In addition, we show that the o… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  8. arXiv:2205.13746  [pdf, other

    math.OC cs.LG

    Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: We study the problem of finding the Nash equilibrium in a two-player zero-sum Markov game. Due to its formulation as a minimax optimization program, a natural approach to solve the problem is to perform gradient descent/ascent with respect to each player in an alternating fashion. However, due to the non-convexity/non-concavity of the underlying objective function, theoretical understandings of th… ▽ More

    Submitted 12 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  9. arXiv:2112.09579  [pdf, ps, other

    math.OC cs.GT cs.LG

    Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems

    Authors: Thinh T. Doan

    Abstract: There are much recent interests in solving noncovnex min-max optimization problems due to its broad applications in many areas including machine learning, networked resource allocations, and distributed optimization. Perhaps, the most popular first-order method in solving min-max optimization is the so-called simultaneous (or single-loop) gradient descent-ascent algorithm due to its simplicity in… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  10. arXiv:2110.11383  [pdf, other

    math.OC cs.LG

    Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: We consider a discounted cost constrained Markov decision process (CMDP) policy optimization problem, in which an agent seeks to maximize a discounted cumulative reward subject to a number of constraints on discounted cumulative utilities. To solve this constrained optimization program, we study an online actor-critic variant of a classic primal-dual method where the gradients of both the primal a… ▽ More

    Submitted 19 November, 2024; v1 submitted 21 October, 2021; originally announced October 2021.

  11. arXiv:2110.06992  [pdf, other

    math.OC

    Convergence Rates of Decentralized Gradient Methods over Cluster Networks

    Authors: Amit Dutta, Nila Masrourisaadat, Thinh T. Doan

    Abstract: We present an analysis for the performance of decentralized consensus-based gradient (DCG) methods for solving optimization problems over a cluster network of nodes. This type of network is composed of a number of densely connected clusters with a sparse connection between them. Decentralized algorithms over cluster networks have been observed to constitute two-time-scale dynamics, where informati… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2104.07781

  12. arXiv:2109.14756  [pdf, other

    math.OC cs.LG

    A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying MDPs controlled by the underlying optimization variable. These time-varying samples make gradient directions in our update biased and dependent, which can potentially lead to the divergence of the… ▽ More

    Submitted 23 August, 2024; v1 submitted 29 September, 2021; originally announced September 2021.

  13. arXiv:2107.07061  [pdf, other

    math.OC eess.SY

    Distributed Dual Subgradient Methods with Averaging and Applications to Grid Optimization

    Authors: Subhonmesh Bose, Hoa Dinh Nguyen, Haitian Liu, Ye Guo, Thinh T. Doan, Carolyn L. Beck

    Abstract: We study finite-time performance of a recently proposed distributed dual subgradient (DDSG) method for convex constrained multi-agent optimization problems. The algorithm enjoys performance guarantees on the last primal iterate, as opposed to those derived for ergodic means for vanilla DDSG algorithms. Our work improves the recently published convergence rate of $\Ocal(\log T/\sqrt{T})$ with decay… ▽ More

    Submitted 26 July, 2023; v1 submitted 14 July, 2021; originally announced July 2021.

  14. arXiv:2104.07781  [pdf, other

    math.OC

    Convergence Rates of Distributed Consensus over Cluster Networks: A Two-Time-Scale Approach

    Authors: Amit Dutta, Almuatazbellah M. Boker, Thinh T. Doan

    Abstract: We study the popular distributed consensus method over networks composed of a number of densely connected clusters with a sparse connection between them. In these cluster networks, the method often constitutes two-time-scale dynamics, where the internal nodes within each cluster reach consensus quickly relative to the aggregate nodes across clusters. Our main contribution is to provide the rate of… ▽ More

    Submitted 12 September, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

  15. arXiv:2104.01627  [pdf, ps, other

    math.OC cs.LG

    Finite-Time Convergence Rates of Nonlinear Two-Time-Scale Stochastic Approximation under Markovian Noise

    Authors: Thinh T. Doan

    Abstract: We study the so-called two-time-scale stochastic approximation, a simulation-based approach for finding the roots of two coupled nonlinear operators. Our focus is to characterize its finite-time performance in a Markov setting, which often arises in stochastic control and reinforcement learning problems. In particular, we consider the scenario where the data in the method are generated by Markov p… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2011.01868

  16. arXiv:2011.01868  [pdf, ps, other

    math.OC cs.LG eess.SY

    Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance

    Authors: Thinh T. Doan

    Abstract: Two-time-scale stochastic approximation, a generalized version of the popular stochastic approximation, has found broad applications in many areas including stochastic control, optimization, and machine learning. Despite its popularity, theoretical guarantees of this method, especially its finite-time performance, are mostly achieved for the linear case while the results for the nonlinear counterp… ▽ More

    Submitted 23 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

  17. arXiv:2010.15088  [pdf, other

    cs.LG math.OC

    Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning

    Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg

    Abstract: We study a decentralized variant of stochastic approximation, a data-driven approach for finding the root of an operator under noisy measurements. A network of agents, each with its own operator and data observations, cooperatively find the fixed point of the aggregate operator over a decentralized communication graph. Our main contribution is to provide a finite-time analysis of this decentralize… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

  18. arXiv:2006.13460  [pdf, ps, other

    cs.LG math.OC stat.ML

    Local Stochastic Approximation: A Unified View of Federated Learning and Distributed Multi-Task Reinforcement Learning Algorithms

    Authors: Thinh T. Doan

    Abstract: Motivated by broad applications in reinforcement learning and federated learning, we study local stochastic approximation over a network of agents, where their goal is to find the root of an operator composed of the local operators at the agents. Our focus is to characterize the finite-time performance of this method when the data at each agent are generated from Markov processes, and hence they a… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  19. arXiv:2003.10973  [pdf, ps, other

    math.OC cs.LG

    Finite-Time Analysis of Stochastic Gradient Descent under Markov Randomness

    Authors: Thinh T. Doan, Lam M. Nguyen, Nhan H. Pham, Justin Romberg

    Abstract: Motivated by broad applications in reinforcement learning and machine learning, this paper considers the popular stochastic gradient descent (SGD) when the gradients of the underlying objective function are sampled from Markov processes. This Markov sampling leads to the gradient samples being biased and not independent. The existing results for the convergence of SGD under Markov randomness are o… ▽ More

    Submitted 1 April, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

  20. arXiv:2002.02873  [pdf, other

    math.OC

    Convergence Rates of Accelerated Markov Gradient Descent with Applications in Reinforcement Learning

    Authors: Thinh T. Doan, Lam M. Nguyen, Nhan H. Pham, Justin Romberg

    Abstract: Motivated by broad applications in machine learning, we study the popular accelerated stochastic gradient descent (ASGD) algorithm for solving (possibly nonconvex) optimization problems. We characterize the finite-time performance of this method when the gradients are sampled from Markov processes, and hence biased and dependent from time step to time step; in contrast, the analysis in existing wo… ▽ More

    Submitted 19 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  21. arXiv:1912.10583  [pdf, ps, other

    cs.LG math.OC stat.ML

    Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation

    Authors: Thinh T. Doan

    Abstract: Motivated by their broad applications in reinforcement learning, we study the linear two-time-scale stochastic approximation, an iterative method using two different step sizes for finding the solutions of a system of two equations. Our main focus is to characterize the finite-time complexity of this method under time-varying step sizes and Markovian noise. In particular, we show that the mean squ… ▽ More

    Submitted 9 January, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

  22. arXiv:1912.10155  [pdf, ps, other

    math.OC

    Finite-Time Performance of Distributed Two-Time-Scale Stochastic Approximation

    Authors: Thinh T. Doan, Justin Romberg

    Abstract: Two-time-scale stochastic approximation is a popular iterative method for finding the solution of a system of two equations. Such methods have found broad applications in many areas, especially in machine learning and reinforcement learning. In this paper, we propose a distributed variant of this method over a network of agents, where the agents use two graphs representing their communication at d… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  23. arXiv:1907.12530  [pdf, ps, other

    math.OC cs.LG

    Finite-Time Performance of Distributed Temporal Difference Learning with Linear Function Approximation

    Authors: Thinh T. Doan, Siva Theja Maguluri, Justin Romberg

    Abstract: We study the policy evaluation problem in multi-agent reinforcement learning, modeled by a Markov decision process. In this problem, the agents operate in a common environment under a fixed control policy, working together to discover the value (global discounted accumulative reward) associated with each environmental state. Over a series of time steps, the agents act, get rewarded, update their l… ▽ More

    Submitted 9 January, 2020; v1 submitted 25 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: text overlap with arXiv:1902.07393

  24. arXiv:1905.11425  [pdf, other

    math.OC cs.LG

    Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning

    Authors: Zaiwei Chen, Sheng Zhang, Thinh T. Doan, John-Paul Clarke, Siva Theja Maguluri

    Abstract: Motivated by applications in reinforcement learning (RL), we study a nonlinear stochastic approximation (SA) algorithm under Markovian noise, and establish its finite-sample convergence bounds under various stepsizes. Specifically, we show that when using constant stepsize (i.e., $α_k\equiv α$), the algorithm achieves exponential fast convergence to a neighborhood (with radius $O(α\log(1/α))$) aro… ▽ More

    Submitted 26 January, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

  25. arXiv:1902.07393  [pdf, ps, other

    math.OC

    Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation for Multi-Agent Reinforcement Learning

    Authors: Thinh T. Doan, Siva Theja Maguluri, Justin Romberg

    Abstract: We study the policy evaluation problem in multi-agent reinforcement learning. In this problem, a group of agents works cooperatively to evaluate the value function for the global discounted accumulative reward problem, which is composed of local rewards observed by the agents. Over a series of time steps, the agents act, get rewarded, update their local estimate of the value function, then communi… ▽ More

    Submitted 1 June, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

  26. arXiv:1810.13245  [pdf, other

    math.OC

    Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization

    Authors: Thinh T. Doan, Siva Theja Maguluri, Justin Romberg

    Abstract: We study distributed optimization problems over a network when the communication between the nodes is constrained, and so information that is exchanged between the nodes must be quantized. Recent advances using the distributed gradient algorithm with a quantization scheme at a fixed resolution have established convergence, but at rates significantly slower than when the communications are unquanti… ▽ More

    Submitted 10 May, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1810.11568

  27. arXiv:1810.11568  [pdf, other

    math.OC

    Distributed Stochastic Approximation for Solving Network Optimization Problems Under Random Quantization

    Authors: Thinh T. Doan, Siva Theja Maguluri, Justin Romberg

    Abstract: We study distributed optimization problems over a network when the communication between the nodes is constrained, and so information that is exchanged between the nodes must be quantized. This imperfect communication poses a fundamental challenge, and this imperfect communication, if not properly accounted for, prevents the convergence of these algorithms. Our first contribution in this paper is… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

  28. arXiv:1805.01526  [pdf, other

    math.OC

    Convergence of the Iterates in Mirror Descent Methods

    Authors: Thinh T. Doan, Subhonmesh Bose, D. Hoa Nguyen, Carolyn L. Beck

    Abstract: We consider centralized and distributed mirror descent algorithms over a finite-dimensional Hilbert space, and prove that the problem variables converge to an optimizer of a possibly nonsmooth function when the step sizes are square summable but not summable. Prior literature has focused on the convergence of the function value to its optimum. However, applications from distributed optimization an… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

  29. arXiv:1708.03543  [pdf, ps, other

    math.OC

    Distributed Resource Allocation Over Dynamic Networks with Uncertainty

    Authors: Thinh T. Doan, Carolyn L. Beck

    Abstract: Motivated by broad applications in various fields of engineering, we study a network resource allocation problem where the goal is to optimally allocate a fixed quantity of resources over a network of nodes. We consider large scale networks with complex interconnection structures, thus any solution must be implemented in parallel and based only on local data resulting in a need for distributed alg… ▽ More

    Submitted 2 August, 2018; v1 submitted 11 August, 2017; originally announced August 2017.

  30. arXiv:1708.03277  [pdf, other

    math.OC

    On the convergence rate of distributed gradient methods for finite-sum optimization under communication delays

    Authors: Thinh T. Doan, Carolyn L. Beck, R. Srikant

    Abstract: Motivated by applications in machine learning and statistics, we study distributed optimization problems over a network of processors, where the goal is to optimize a global objective composed of a sum of local functions. In these problems, due to the large scale of the data sets, the data and computation must be distributed over processors resulting in the need for distributed algorithms. In this… ▽ More

    Submitted 11 May, 2019; v1 submitted 10 August, 2017; originally announced August 2017.

  31. arXiv:1609.06660   

    math.OC

    On the geometric convergence rate of distributed economic dispatch/demand response in power networks

    Authors: Thinh T. Doan, Alex Olshevsky

    Abstract: Motivated by potential applications in power systems, we study a problem of optimizing a sum of $n$ convex functions on dynamic networks of $n$ nodes when each function is known to only a single node. The nodes' variables, while satisfy their local constraints, are coupled through a linear constraint. Our main contribution is to design a fully distributed primal-dual method for this problem. Under… ▽ More

    Submitted 30 September, 2016; v1 submitted 21 September, 2016; originally announced September 2016.

    Comments: Paper was uploaded without the consent of the second author

  32. arXiv:1609.06287  [pdf, other

    math.OC

    Distributed Lagrangian Methods for Network Resource Allocation

    Authors: Thinh T. Doan, Carolyn L. Beck

    Abstract: Motivated by a variety of applications in control engineering and information sciences, we study network resource allocation problems where the goal is to optimally allocate a fixed amount of resource over a network of nodes. In these problems, due to the large scale of the network and complicated inter-connections between nodes, any solution must be implemented in parallel and based only on local… ▽ More

    Submitted 24 August, 2017; v1 submitted 20 September, 2016; originally announced September 2016.

  33. arXiv:1507.07850  [pdf, ps, other

    math.OC

    Distributed Resource Allocation on Dynamic Networks in Quadratic Time

    Authors: Thinh T. Doan, Alex Olshevsky

    Abstract: We consider the problem of allocating a fixed amount of resource among nodes in a network when each node suffers a cost which is a convex function of the amount of resource allocated to it. We propose a new deterministic and distributed protocol for this problem. Our main result is that the associated convergence time for the global objective scales quadratically in the number of nodes on any sequ… ▽ More

    Submitted 12 June, 2016; v1 submitted 28 July, 2015; originally announced July 2015.