Skip to main content

Showing 1–50 of 217 results for author: Başar, T

.
  1. arXiv:2505.06691  [pdf, other

    math.OC eess.SY

    Distributed Event-Triggered Nash Equilibrium Seeking for Noncooperative Games

    Authors: Victor Hugo Pereira Rodrigues, Tiago Roux Oliveira, Miroslav Krstic, Tamer Basar

    Abstract: We propose locally convergent Nash equilibrium seeking algorithms for $N$-player noncooperative games, which use distributed event-triggered pseudo-gradient estimates. The proposed approach employs sinusoidal perturbations to estimate the pseudo-gradients of unknown quadratic payoff functions. This is the first instance of noncooperative games being tackled in a model-free fashion with event-trigg… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  2. arXiv:2504.09638  [pdf, other

    math.OC eess.SY

    Data-Driven Two-Stage Distributionally Robust Dispatch of Multi-Energy Microgrid

    Authors: Xunhang Sun, Xiaoyu Cao, Bo Zeng, Miaomiao Li, Xiaohong Guan, Tamer Başar

    Abstract: This paper studies adaptive distributionally robust dispatch (DRD) of the multi-energy microgrid under supply and demand uncertainties. A Wasserstein ambiguity set is constructed to support data-driven decision-making. By fully leveraging the special structure of worst-case expectation from the primal perspective, a novel and high-efficient decomposition algorithm under the framework of column-and… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  3. arXiv:2504.09035  [pdf, ps, other

    math.OC cs.LG eess.SY

    InterQ: A DQN Framework for Optimal Intermittent Control

    Authors: Shubham Aggarwal, Dipankar Maity, Tamer Başar

    Abstract: In this letter, we explore the communication-control co-design of discrete-time stochastic linear systems through reinforcement learning. Specifically, we examine a closed-loop system involving two sequential decision-makers: a scheduler and a controller. The scheduler continuously monitors the system's state but transmits it to the controller intermittently to balance the communication cost and c… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: Submitted to IEEE for possible publication

  4. arXiv:2503.00313  [pdf, other

    eess.SY cs.GT math.OC

    Communication and Control Co-design in Non-cooperative Games

    Authors: Shubham Aggarwal, Tamer Başar, Dipankar Maity

    Abstract: In this article, we revisit a communication-control co-design problem for a class of two-player stochastic differential games on an infinite horizon. Each 'player' represents two active decision makers, namely a scheduler and a remote controller, which cooperate to optimize over a global objective while competing with the other player. Each player's scheduler can only intermittently relay state in… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: Submitted to IEEE for possible publication

  5. arXiv:2501.18718  [pdf, other

    cs.IT cs.MA eess.SY math.OC

    Distributed Offloading in Multi-Access Edge Computing Systems: A Mean-Field Perspective

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Sennur Ulukus, Tamer Başar

    Abstract: Multi-access edge computing (MEC) technology is a promising solution to assist power-constrained IoT devices by providing additional computing resources for time-sensitive tasks. In this paper, we consider the problem of optimal task offloading in MEC systems with due consideration of the timeliness and scalability issues under two scenarios of equitable and priority access to the edge server (ES)… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: Submitted to IEEE for possible publication

  6. arXiv:2501.12256  [pdf, other

    math.OC eess.SY

    Lie-Bracket Nash Equilibrium Seeking with Bounded Update Rates for Noncooperative Games

    Authors: Victor Hugo Pereira Rodrigues, Tiago Roux Oliveira, Miroslav Krstic, Tamer Basar

    Abstract: This paper proposes a novel approach for local convergence to Nash equilibrium in quadratic noncooperative games based on a distributed Lie-bracket extremum seeking control scheme. This is the first instance of noncooperative games being tackled in a model-free fashion integrated with the extremum seeking method of bounded update rates. In particular, the stability analysis is carried out using Li… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  7. arXiv:2501.05660  [pdf, ps, other

    cs.IT cs.GT eess.SY

    Fully Decentralized Computation Offloading in Priority-Driven Edge Computing Systems

    Authors: Shubham Aggarwal, Melih Bastopcu, Muhammad Aneeq uz Zaman, Tamer Başar, Sennur Ulukus, Nail Akar

    Abstract: We develop a novel framework for fully decentralized offloading policy design in multi-access edge computing (MEC) systems. The system comprises $N$ power-constrained user equipments (UEs) assisted by an edge server (ES) to process incoming tasks. Tasks are labeled with urgency flags, and in this paper, we classify them under three urgency levels, namely, high, moderate, and low urgency. We formul… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: Submitted to IEEE for possible publication

  8. arXiv:2412.00679  [pdf, ps, other

    cs.IT cs.GT eess.SP eess.SY

    Remote Estimation Games with Random Walk Processes: Stackelberg Equilibrium

    Authors: Atahan Dokme, Raj Kiriti Velicheti, Melih Bastopcu, Tamer Başar

    Abstract: Remote estimation is a crucial element of real time monitoring of a stochastic process. While most of the existing works have concentrated on obtaining optimal sampling strategies, motivated by malicious attacks on cyber-physical systems, we model sensing under surveillance as a game between an attacker and a defender. This introduces strategic elements to conventional remote estimation problems.… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  9. arXiv:2411.13234  [pdf, other

    math.OC eess.SY

    Extremum and Nash Equilibrium Seeking with Delays and PDEs: Designs & Applications

    Authors: Tiago Roux Oliveira, Miroslav Krstić, Tamer Başar

    Abstract: The development of extremum seeking (ES) has progressed, over the past hundred years, from static maps, to finite-dimensional dynamic systems, to networks of static and dynamic agents. Extensions from ODE dynamics to maps and agents that incorporate delays or even partial differential equations (PDEs) is the next natural step in that progression through ascending research challenges. This paper re… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: Preprint submitted to IEEE Control Systems Magazine (Special Issue: Into the Second Century of Extremum Seeking Control, 38 pages and 34 figures)

  10. arXiv:2411.04913  [pdf, other

    cs.LG math.OC math.PR

    Structure Matters: Dynamic Policy Gradient

    Authors: Sara Klein, Xiangyuan Zhang, Tamer Başar, Simon Weissmann, Leif Döring

    Abstract: In this work, we study $γ$-discounted infinite-horizon tabular Markov decision processes (MDPs) and introduce a framework called dynamic policy gradient (DynPG). The framework directly integrates dynamic programming with (any) policy gradient method, explicitly leveraging the Markovian property of the environment. DynPG dynamically adjusts the problem horizon during training, decomposing the origi… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 46 pages, 4 figures

  11. arXiv:2411.01794  [pdf, other

    eess.SY cs.GT cs.MA

    Revisiting Game-Theoretic Control in Socio-Technical Networks: Emerging Design Frameworks and Contemporary Applications

    Authors: Quanyan Zhu, Tamer Başar

    Abstract: Socio-technical networks represent emerging cyber-physical infrastructures that are tightly interwoven with human networks. The coupling between human and technical networks presents significant challenges in managing, controlling, and securing these complex, interdependent systems. This paper investigates game-theoretic frameworks for the design and control of socio-technical networks, with a foc… ▽ More

    Submitted 5 November, 2024; v1 submitted 3 November, 2024; originally announced November 2024.

  12. arXiv:2410.19696  [pdf, other

    cs.IT eess.SY

    Age of Coded Updates In Gossip Networks Under Memory and Memoryless Schemes

    Authors: Erkan Bayram, Melih Bastopcu, Mohamed-Ali Belabbas, Tamer Başar

    Abstract: We consider an information update system on a gossip network, where a source node encodes information into $n$ total keys such that any subset of at least $k+1$ keys can fully reconstruct the original information. This encoding process follows the principles of a $k$-out-of-$n$ threshold system. The encoded updates are then disseminated across the network through peer-to-peer communication. We hav… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

    Comments: A part of this work is presented at the ACSSC24. This work has been submitted to IEEE for possible publication. arXiv admin note: text overlap with arXiv:2402.11462

  13. arXiv:2408.01327  [pdf, other

    cs.IT cs.NI cs.PF

    Modeling Interfering Sources in Shared Queues for Timely Computations in Edge Computing Systems

    Authors: Nail Akar, Melih Bastopcu, Sennur Ulukus, Tamer Başar

    Abstract: Most existing stochastic models on age of information (AoI) focus on a single shared server serving status update packets from $N>1$ sources where each packet update stream is Poisson, i.e., single-hop scenario. In the current work, we study a two-hop edge computing system for which status updates from the information sources are still Poisson but they are not immediately available at the shared e… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures

  14. arXiv:2407.06528  [pdf, ps, other

    math.OC cs.IT eess.SY

    Semantic Communication in Multi-team Dynamic Games: A Mean Field Perspective

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: Coordinating communication and control is a key component in the stability and performance of networked multi-agent systems. While single user networked control systems have gained a lot of attention within this domain, in this work, we address the more challenging problem of large population multi-team dynamic games. In particular, each team constitutes two decision makers (namely, the sensor and… ▽ More

    Submitted 24 June, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE for possible publication

  15. arXiv:2406.13992  [pdf, ps, other

    cs.MA eess.SY

    Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective

    Authors: Muhammad Aneeq uz Zaman, Mathieu Laurière, Alec Koppel, Tamer Başar

    Abstract: In this paper, we study the problem of robust cooperative multi-agent reinforcement learning (RL) where a large number of cooperative agents with distributed information aim to learn policies in the presence of \emph{stochastic} and \emph{non-stochastic} uncertainties whose distributions are respectively known and unknown. Focusing on policy optimization that accounts for both types of uncertainti… ▽ More

    Submitted 12 June, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in L4DC 2024. Moved Disclaimer from footnote to unnumbered section

  16. arXiv:2406.05632  [pdf, ps, other

    math.OC eess.SY

    Best Response Strategies for Asymmetric Sensing in Linear-Quadratic Differential Games

    Authors: Shubham Aggarwal, Tamer Başar, Dipankar Maity

    Abstract: In this paper, we revisit the two-player continuous-time infinite-horizon linear quadratic differential game problem, where one of the players can sample the state of the system only intermittently due to a sensing constraint while the other player can do so continuously. Under these asymmetric sensing limitations between the players, we analyze the optimal sensing and control strategies for the p… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE L-CSS

  17. arXiv:2405.15762  [pdf, other

    math.OC eess.SY

    Sliding-Mode Nash Equilibrium Seeking for a Quadratic Duopoly Game

    Authors: Victor Hugo Pereira Rodrigues, Tiago Roux Oliveira, Miroslav Krstić, Tamer Başar

    Abstract: This paper introduces a new method to achieve stable convergence to Nash equilibrium in duopoly noncooperative games. Inspired by the recent fixed-time Nash Equilibrium seeking (NES) as well as prescribed-time extremum seeking (ES) and source seeking schemes, our approach employs a distributed sliding mode control (SMC) scheme, integrating extremum seeking with sinusoidal perturbation signals to e… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 8 pages and 2 figures. arXiv admin note: substantial text overlap with arXiv:2404.07287

    MSC Class: 91Axx; 91A05; 91A10; 93-XX; 93B52; 93C40; 93D30

  18. arXiv:2405.00665  [pdf, other

    cs.IT cs.NI eess.SP

    Optimizing Profitability in Timely Gossip Networks

    Authors: Priyanka Kaswan, Melih Bastopcu, Sennur Ulukus, S. Rasoul Etesami, Tamer Başar

    Abstract: We consider a communication system where a group of users, interconnected in a bidirectional gossip network, wishes to follow a time-varying source, e.g., updates on an event, in real-time. The users wish to maintain their expected version ages below a threshold, and can either rely on gossip from their neighbors or directly subscribe to a server publishing about the event, if the former option do… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  19. arXiv:2404.16009  [pdf, other

    cs.IT cs.NI eess.SP

    How to Make Money From Fresh Data: Subscription Strategies in Age-Based Systems

    Authors: Priyanka Kaswan, Melih Bastopcu, Sennur Ulukus, S. Rasoul Etesami, Tamer Başar

    Abstract: We consider a communication system consisting of a server that tracks and publishes updates about a time-varying data source or event, and a gossip network of users interested in closely tracking the event. The timeliness of the information is measured through the version age of information. The users wish to have their expected version ages remain below a threshold, and have the option to either… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  20. arXiv:2404.11013  [pdf, other

    cs.LG math.OC

    Control Theoretic Approach to Fine-Tuning and Transfer Learning

    Authors: Erkan Bayram, Shenyu Liu, Mohamed-Ali Belabbas, Tamer Başar

    Abstract: Given a training set in the form of a paired $(\mathcal{X},\mathcal{Y})$, we say that the control system $\dot x = f(x,u)$ has learned the paired set via the control $u^*$ if the system steers each point of $\mathcal{X}$ to its corresponding target in $\mathcal{Y}$. If the training set is expanded, most existing methods for finding a new control $u^*$ require starting from scratch, resulting in a… ▽ More

    Submitted 19 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  21. arXiv:2404.08509  [pdf, other

    cs.DC cs.CL cs.LG

    Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

    Authors: Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Başar, Ravishankar K. Iyer

    Abstract: Large language models (LLMs) have been driving a new wave of interactive AI applications across numerous domains. However, efficiently serving LLM inference requests is challenging due to their unpredictable execution times originating from the autoregressive nature of generative models. Existing LLM serving systems exploit first-come-first-serve (FCFS) scheduling, suffering from head-of-line bloc… ▽ More

    Submitted 25 November, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at AIOps'24

  22. arXiv:2404.07287  [pdf, other

    math.OC eess.SY

    Nash Equilibrium Seeking for Noncooperative Duopoly Games via Event-Triggered Control

    Authors: Victor Hugo Pereira Rodrigues, Tiago Roux Oliveira, Miroslav Krstić, Tamer Başar

    Abstract: This paper proposes a novel approach for locally stable convergence to Nash equilibrium in duopoly noncooperative games based on a distributed event-triggered control scheme. The proposed approach employs extremum seeking, with sinusoidal perturbation signals applied to estimate the Gradient (first derivative) of unknown quadratic payoff functions. This is the first instance of noncooperative game… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  23. arXiv:2404.02898  [pdf, ps, other

    cs.IT cs.GT cs.NI eess.SY

    Fully Decentralized Task Offloading in Multi-Access Edge Computing Systems

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Sennur Ulukus, Tamer Başar

    Abstract: We consider the problem of task offloading in multi-access edge computing (MEC) systems constituting $N$ devices assisted by an edge server (ES), where the devices can split task execution between a local processor and the ES. Since the local task execution and communication with the ES both consume power, each device must judiciously choose between the two. We model the problem as a large populat… ▽ More

    Submitted 28 October, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE Globecom Workshops 2024

  24. arXiv:2404.02407  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Decision Transformer as a Foundation Model for Partially Observable Continuous Control

    Authors: Xiangyuan Zhang, Weichao Mao, Haoran Qiu, Tamer Başar

    Abstract: Closed-loop control of nonlinear dynamical systems with partial-state observability demands expert knowledge of a diverse, less standardized set of theoretical tools. Moreover, it requires a delicate integration of controller and estimator designs to achieve the desired system behavior. To establish a general controller synthesis framework, we explore the Decision Transformer (DT) architecture. Sp… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Submitted to CDC 2024

  25. Stochastic-Robust Planning of Networked Hydrogen-Electrical Microgrids: A Study on Induced Refueling Demand

    Authors: Xunhang Sun, Xiaoyu Cao, Bo Zeng, Qiaozhu Zhai, Tamer Başar, Xiaohong Guan

    Abstract: Hydrogen-electrical microgrids are increasingly assuming an important role on the pathway toward decarbonization of energy and transportation systems. This paper studies networked hydrogen-electrical microgrids planning (NHEMP), considering a critical but often-overlooked issue, i.e., the demand-inducing effect (DIE) associated with infrastructure development decisions. Specifically, higher refuel… ▽ More

    Submitted 27 August, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Smart Grid, 16(1), 115-130, 2025

  26. arXiv:2404.00045  [pdf, ps, other

    cs.GT cs.AI cs.LG cs.MA

    Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games

    Authors: Muhammad Aneeq uz Zaman, Shubham Aggarwal, Melih Bastopcu, Tamer Başar

    Abstract: In this paper, we investigate the impact of introducing relative entropy regularization on the Nash Equilibria (NE) of General-Sum $N$-agent games, revealing the fact that the NE of such games conform to linear Gaussian policies. Moreover, it delineates sufficient conditions, contingent upon the adequacy of entropy regularization, for the uniqueness of the NE within the game. As Policy Optimizatio… ▽ More

    Submitted 13 September, 2024; v1 submitted 25 March, 2024; originally announced April 2024.

    Comments: Accepted for Conference on Decision and Control 2024

  27. arXiv:2403.11345  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective

    Authors: Muhammad Aneeq uz Zaman, Alec Koppel, Mathieu Laurière, Tamer Başar

    Abstract: We address in this paper Reinforcement Learning (RL) among agents that are grouped into teams such that there is cooperation within each team but general-sum (non-zero sum) competition across different teams. To develop an RL method that provably achieves a Nash equilibrium, we focus on a linear-quadratic structure. Moreover, to tackle the non-stationarity induced by multi-agent interactions in th… ▽ More

    Submitted 8 February, 2025; v1 submitted 17 March, 2024; originally announced March 2024.

  28. arXiv:2403.08741  [pdf, ps, other

    cs.GT cs.IT cs.LG eess.SY math.OC

    Learning How to Strategically Disclose Information

    Authors: Raj Kiriti Velicheti, Melih Bastopcu, S. Rasoul Etesami, Tamer Başar

    Abstract: Strategic information disclosure, in its simplest form, considers a game between an information provider (sender) who has access to some private information that an information receiver is interested in. While the receiver takes an action that affects the utilities of both players, the sender can design information (or modify beliefs) of the receiver through signal commitment, hence posing a Stack… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  29. arXiv:2403.07890  [pdf, other

    cs.GT cs.AI cs.LG

    $\widetilde{O}(T^{-1})$ Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games

    Authors: Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Başar

    Abstract: No-regret learning has a long history of being closely connected to game theory. Recent works have devised uncoupled no-regret learning dynamics that, when adopted by all the players in normal-form games, converge to various equilibrium solutions at a near-optimal rate of $\widetilde{O}(T^{-1})$, a significant improvement over the $O(1/\sqrt{T})$ rate of classic no-regret learners. However, analog… ▽ More

    Submitted 23 April, 2024; v1 submitted 2 February, 2024; originally announced March 2024.

  30. arXiv:2403.06299  [pdf, other

    eess.SY cs.GT math.OC

    Disentangling Resilience from Robustness: Contextual Dualism, Interactionism, and Game-Theoretic Paradigms

    Authors: Quanyan Zhu, Tamer Basar

    Abstract: This article explains the distinctions between robustness and resilience in control systems. Resilience confronts a distinct set of challenges, posing new ones for designing controllers for feedback systems, networks, and machines that prioritize resilience over robustness. The concept of resilience is explored through a three-stage model, emphasizing the need for a proactive preparation and autom… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  31. arXiv:2403.01005  [pdf, other

    eess.SY cs.AI math.OC

    Policy Optimization for PDE Control with a Warm Start

    Authors: Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: Dimensionality reduction is crucial for controlling nonlinear partial differential equations (PDE) through a "reduce-then-design" strategy, which identifies a reduced-order model and then implements model-based control solutions. However, inaccuracies in the reduced-order modeling can substantially degrade controller performance, especially in PDEs with chaotic behavior. To address this issue, we… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  32. arXiv:2402.11462  [pdf, ps, other

    cs.IT cs.NI eess.SY

    Age of $k$-out-of-$n$ Systems on a Gossip Network

    Authors: Erkan Bayram, Melih Bastopcu, Mohamed-Ali Belabbas, Tamer Başar

    Abstract: We consider information update systems on a gossip network, which consists of a single source and $n$ receiver nodes. The source encrypts the information into $n$ distinct keys with version stamps, sending a unique key to each node. For decoding the information in a $k$-out-of-$n$ system, each receiver node requires at least $k+1$ different keys with the same version, shared over peer-to-peer conn… ▽ More

    Submitted 17 September, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in ACSSC24

  33. arXiv:2311.18736  [pdf, other

    eess.SY cs.AI cs.CE cs.LG math.OC

    Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms

    Authors: Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with contin… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: 25 pages, 16 figures

  34. arXiv:2311.04455  [pdf, ps, other

    math.OC cs.MA

    Vector-Valued Gossip over $w$-Holonomic Networks

    Authors: Erkan Bayram, Mohamed-Ali Belabbas, Tamer Başar

    Abstract: We study the weighted average consensus problem for a gossip network of agents with vector-valued states. For a given matrix-weighted graph, the gossip process is described by a sequence of pairs of adjacent agents communicating and updating their states based on the edge matrix weight. Our key contribution is providing conditions for the convergence of this non-homogeneous Markov process as well… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  35. arXiv:2310.13853  [pdf, other

    eess.SY math.DS physics.soc-ph

    A Discrete-time Networked Competitive Bivirus SIS Model

    Authors: Sebin Gracy, Ji Liu, Tamer Basar, Cesar A. Uribe

    Abstract: The paper deals with the analysis of a discrete-time networked competitive bivirus susceptible-infected-susceptible (SIS) model. More specifically, we suppose that virus 1 and virus 2 are circulating in the population and are in competition with each other. We show that the model is strongly monotone, and that, under certain assumptions, it does not admit any periodic orbit. We identify a sufficie… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  36. arXiv:2309.15423  [pdf, other

    cs.GT eess.SY

    Prosumers Participation in Markets: A Scalar-Parameterized Function Bidding Approach

    Authors: Abdullah Alawad, Muhammad Aneeq uz Zaman, Khaled Alshehri, Tamer Başar

    Abstract: In uniform-price markets, suppliers compete to supply a resource to consumers, resulting in a single market price determined by their competition. For sufficient flexibility, producers and consumers prefer to commit to a function as their strategies, indicating their preferred quantity at any given market price. Producers and consumers may wish to act as both, i.e., prosumers. In this paper, we ex… ▽ More

    Submitted 14 March, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Corrected typos in the figures

  37. arXiv:2309.14317  [pdf, ps, other

    cs.GT cs.MA eess.SY math.OC

    Online and Offline Dynamic Influence Maximization Games Over Social Networks

    Authors: Melih Bastopcu, S. Rasoul Etesami, Tamer Başar

    Abstract: In this work, we consider dynamic influence maximization games over social networks with multiple players (influencers). The goal of each influencer is to maximize their own reward subject to their limited total budget rate constraints. Thus, influencers need to carefully design their investment policies considering individuals' opinion dynamics and other influencers' investment strategies, leadin… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to IEEE for possible publication

  38. arXiv:2309.04831  [pdf, other

    math.OC cs.AI cs.LG eess.SY math.DS

    Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs

    Authors: Xiangyuan Zhang, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

    Abstract: We introduce the receding-horizon policy gradient (RHPG) algorithm, the first PG algorithm with provable global convergence in learning the optimal linear estimator designs, i.e., the Kalman filter (KF). Notably, the RHPG algorithm does not require any prior knowledge of the system for initialization and does not require the target system to be open-loop stable. The key of RHPG is that we integrat… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.12624

  39. arXiv:2306.14886  [pdf, ps, other

    cs.GT cs.IT eess.SY math.OC

    Value of Information in Games with Multiple Strategic Information Providers

    Authors: Raj Kiriti Velicheti, Melih Bastopcu, Tamer Başar

    Abstract: In the classical communication setting multiple senders having access to the same source of information and transmitting it over channel(s) to a receiver in general leads to a decrease in estimation error at the receiver as compared with the single sender case. However, if the objectives of the information providers are different from that of the estimator, this might result in interesting strateg… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: This work has been submitted for possible journal publication

  40. arXiv:2305.09068  [pdf, other

    eess.SY

    Analysis, Control, and State Estimation for the Networked Competitive Multi-Virus SIR Model

    Authors: Ciyuan Zhang, Sebin Gracy, Tamer Basar, Philip E. Pare

    Abstract: This paper proposes a novel discrete-time multi-virus susceptible-infected-recovered (SIR) model that captures the spread of competing epidemics over a population network. First, we provide sufficient conditions for the infection level of all the viruses over the networked model to converge to zero in exponential time. Second, we propose an observation model which captures the summation of all the… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2204.00708

  41. arXiv:2303.09515  [pdf, ps, other

    eess.SY cs.GT cs.SI math.OC

    Large Population Games on Constrained Unreliable Networks

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: This paper studies an $N$--agent cost-coupled game where the agents are connected via an unreliable capacity constrained network. Each agent receives state information over that network which loses packets with probability $p$. A Base station (BS) actively schedules agent communications over the network by minimizing a weighted Age of Information (WAoI) based cost function under a capacity limit… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Submitted to IEEE for possible publication

  42. arXiv:2302.13144  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient

    Authors: Xiangyuan Zhang, Tamer Başar

    Abstract: We revisit in this paper the discrete-time linear quadratic regulator (LQR) problem from the perspective of receding-horizon policy gradient (RHPG), a newly developed model-free learning framework for control applications. We provide a fine-grained sample complexity analysis for RHPG to learn a control policy that is both stabilizing and $ε$-close to the optimal LQR solution, and our algorithm doe… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 February, 2023; originally announced February 2023.

  43. arXiv:2301.12624  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Learning the Kalman Filter with Fine-Grained Sample Complexity

    Authors: Xiangyuan Zhang, Bin Hu, Tamer Başar

    Abstract: We develop the first end-to-end sample complexity of model-free policy gradient (PG) methods in discrete-time infinite-horizon Kalman filtering. Specifically, we introduce the receding-horizon policy gradient (RHPG-KF) framework and demonstrate $\tilde{\mathcal{O}}(ε^{-2})$ sample complexity for RHPG-KF in learning a stabilizing filter that is $ε$-close to the optimal Kalman filter. Notably, the p… ▽ More

    Submitted 27 February, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: To appear in ACC 2023

  44. arXiv:2212.07534  [pdf, ps, other

    math.OC cs.LG

    Decentralized Nonconvex Optimization with Guaranteed Privacy and Accuracy

    Authors: Yongqiang Wang, Tamer Basar

    Abstract: Privacy protection and nonconvexity are two challenging problems in decentralized optimization and learning involving sensitive data. Despite some recent advances addressing each of the two problems separately, no results have been reported that have theoretical guarantees on both privacy protection and saddle/maximum avoidance in decentralized nonconvex optimization. We propose a new algorithm fo… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper to Automatica

  45. arXiv:2212.02072  [pdf, ps, other

    eess.SY

    Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

    Authors: Leilei Cui, Tamer Başar, Zhong-Ping Jiang

    Abstract: This paper proposes a novel robust reinforcement learning framework for discrete-time linear systems with model mismatch that may arise from the sim-to-real gap. A key strategy is to invoke advanced techniques from control theory. Using the formulation of the classical risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to generate a robust optim… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 27 Pages, 13 Figures

  46. arXiv:2211.07937  [pdf, other

    cs.LG

    An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods

    Authors: Yanli Liu, Kaiqing Zhang, Tamer Başar, Wotao Yin

    Abstract: In this paper, we revisit and improve the convergence of policy gradient (PG), natural PG (NPG) methods, and their variance-reduced variants, under general smooth policy parametrizations. More specifically, with the Fisher information matrix of the policy being positive definite: i) we show that a state-of-the-art variance-reduced PG method, which has only been shown to converge to stationary poin… ▽ More

    Submitted 16 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2020 (improve the proof of Lemma B.1 and Proposition G.1.)

    Journal ref: Advances in Neural Information Processing Systems 33 (2020): 7624-7636

  47. arXiv:2210.04810  [pdf, other

    math.OC cs.LG stat.ML

    Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

    Authors: Bin Hu, Kaiqing Zhang, Na Li, Mehran Mesbahi, Maryam Fazel, Tamer Başar

    Abstract: Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and reinforcement learning. This article surveys some of the recent developments on policy optimization, a gradient-based iterative approach for feedback control synt… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: To Appear in Annual Review of Control, Robotics, and Autonomous Systems

  48. arXiv:2210.00551  [pdf, ps, other

    math.OC eess.SY

    Gradient-tracking based Distributed Optimization with Guaranteed Optimality under Noisy Information Sharing

    Authors: Yongqiang Wang, Tamer Başar

    Abstract: Distributed optimization enables networked agents to cooperatively solve a global optimization problem even with each participating agent only having access to a local partial view of the objective function. Despite making significant inroads, most existing results on distributed optimization rely on noise-free information sharing among the agents, which is problematic when communication channels… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Accepted to IEEE Transactions on Automatic Control as a full paper. arXiv admin note: text overlap with arXiv:2202.01113

  49. arXiv:2209.12888  [pdf, ps, other

    eess.SY cs.IT cs.NI math.OC

    Weighted Age of Information based Scheduling for Large Population Games on Networks

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: In this paper, we consider a discrete-time multi-agent system involving $N$ cost-coupled networked rational agents solving a consensus problem and a central Base Station (BS), scheduling agent communications over a network. Due to a hard bandwidth constraint on the number of transmissions through the network, at most $R_d < N$ agents can concurrently access their state information through the netw… ▽ More

    Submitted 26 December, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: This work has been submitted to IEEE for possible publication

  50. arXiv:2209.04938  [pdf, ps, other

    cs.GT eess.SY math.OC

    Ensuring both Provable Convergence and Differential Privacy in Nash Equilibrium Seeking on Directed Graphs

    Authors: Yongqiang Wang, Tamer Basar

    Abstract: We study in this paper privacy protection in fully distributed Nash equilibrium seeking where a player can only access its own cost function and receive information from its immediate neighbors over a directed communication network. In view of the non-cooperative nature of the underlying decision-making process, it is imperative to protect the privacy of individual players in networked games when… ▽ More

    Submitted 10 April, 2023; v1 submitted 11 September, 2022; originally announced September 2022.