Skip to main content

Showing 1–33 of 33 results for author: Nayyar, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2410.04004  [pdf, other

    eess.SY

    Compositional Planning for Logically Constrained Multi-Agent Markov Decision Processes

    Authors: Krishna C. Kalagarla, Matthew Low, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Designing control policies for large, distributed systems is challenging, especially in the context of critical, temporal logic based specifications (e.g., safety) that must be met with high probability. Compositional methods for such problems are needed for scalability, yet relying on worst-case assumptions for decomposition tends to be overly conservative. In this work, we use the framework of C… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: 6 pages, 1 figure, accepted for publication at the 63rd IEEE Conf. on Decision and Control (2024)

  2. arXiv:2402.08813  [pdf, other

    math.OC cs.LG eess.SY

    Model approximation in MDPs with unbounded per-step cost

    Authors: Berk Bozkurt, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

    Abstract: We consider the problem of designing a control policy for an infinite-horizon discounted cost Markov decision process $\mathcal{M}$ when we only have access to an approximate model $\hat{\mathcal{M}}$. How well does an optimal policy $\hatπ^{\star}$ of the approximate model perform when used in the original model $\mathcal{M}$? We answer this question by bounding a weighted norm of the difference… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2310.10107  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Posterior Sampling-based Online Learning for Episodic POMDPs

    Authors: Dengwang Tang, Dongze Ye, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Learning in POMDPs is known to be significantly harder than in MDPs. In this paper, we consider the online learning problem for episodic POMDPs with unknown transition and observation models. We propose a Posterior Sampling-based reinforcement learning algorithm for POMDPs (PS4POMDPs), which is much simpler and more implementable compared to state-of-the-art optimism-based online learning algorith… ▽ More

    Submitted 23 October, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 41 pages, 9 figures

    MSC Class: 93E35

  4. arXiv:2305.14736  [pdf, other

    cs.AI cs.FL eess.SY

    Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes

    Authors: Krishna C. Kalagarla, Dhruva Kartik, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Autonomous systems often have logical constraints arising, for example, from safety, operational, or regulatory requirements. Such constraints can be expressed using temporal logic specifications. The system state is often partially observable. Moreover, it could encompass a team of multiple agents with a common objective but disparate information structures and constraints. In this paper, we firs… ▽ More

    Submitted 19 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.09038

  5. arXiv:2304.04346  [pdf, other

    cs.AI cs.MA eess.SY math.OC

    A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach

    Authors: Dengwang Tang, Ashutosh Nayyar, Rahul Jain

    Abstract: The Common Information (CI) approach provides a systematic way to transform a multi-agent stochastic control problem to a single-agent partially observed Markov decision problem (POMDP) called the coordinator's POMDP. However, such a POMDP can be hard to solve due to its extraordinarily large action space. We propose a new algorithm for multi-agent stochastic control problems, called coordinator's… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 11 pages, 4 figures

    MSC Class: 68T20 ACM Class: I.2.8; I.2.11

  6. arXiv:2209.03888  [pdf, ps, other

    eess.SY

    Optimal Communication and Control Strategies for a Multi-Agent System in the Presence of an Adversary

    Authors: Dhruva Kartik, Sagar Sudhakara, Rahul Jain, Ashutosh Nayyar

    Abstract: We consider a multi-agent system in which a decentralized team of agents controls a stochastic system in the presence of an adversary. Instead of committing to a fixed information sharing protocol, the agents can strategically decide at each time whether to share their private information with each other or not. The agents incur a cost whenever they communicate with each other and the adversary ma… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: In proceedings of Conference of Decision and Control (2022)

  7. arXiv:2203.09038  [pdf, other

    eess.SY

    Optimal Control of Partially Observable Markov Decision Processes with Finite Linear Temporal Logic Constraints

    Authors: Krishna C. Kalagarla, Dhruva Kartik, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo

    Abstract: Autonomous agents often operate in scenarios where the state is partially observed. In addition to maximizing their cumulative reward, agents must execute complex tasks with rich temporal and logical structures. These tasks can be expressed using temporal logic languages like finite linear temporal logic (LTL_f). This paper, for the first time, provides a structured framework for designing agent p… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  8. arXiv:2108.08502  [pdf, ps, other

    eess.SY cs.AI math.OC

    A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems

    Authors: Mukul Gagrani, Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

    Abstract: We revisit the Thompson sampling algorithm to control an unknown linear quadratic (LQ) system recently proposed by Ouyang et al (arXiv:1709.04047). The regret bound of the algorithm was derived under a technical assumption on the induced norm of the closed loop system. In this technical note, we show that by making a minor modification in the algorithm (in particular, ensuring that an episode does… ▽ More

    Submitted 19 September, 2022; v1 submitted 19 August, 2021; originally announced August 2021.

    Journal ref: Proc 2022 IEEE Conference on Decision and Control

  9. arXiv:2108.07970  [pdf, other

    eess.SY cs.AI math.OC

    Scalable regret for learning to control network-coupled subsystems with unknown dynamics

    Authors: Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

    Abstract: We consider the problem of controlling an unknown linear quadratic Gaussian (LQG) system consisting of multiple subsystems connected over a network. Our goal is to minimize and quantify the regret (i.e. loss in performance) of our strategy with respect to an oracle who knows the system model. Viewing the interconnected subsystems globally and directly using existing LQG learning algorithms for the… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 12 pages

  10. arXiv:2102.05838  [pdf, ps, other

    cs.MA eess.SY

    Common Information Belief based Dynamic Programs for Stochastic Zero-sum Games with Competing Teams

    Authors: Dhruva Kartik, Ashutosh Nayyar, Urbashi Mitra

    Abstract: Decentralized team problems where players have asymmetric information about the state of the underlying stochastic system have been actively studied, but \emph{games} between such teams are less understood. We consider a general model of zero-sum stochastic games between two competing teams. This model subsumes many previously considered team and zero-sum game models. For this general model, we pr… ▽ More

    Submitted 27 September, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:1909.01445

  11. arXiv:2011.04686  [pdf, other

    eess.SY cs.LG math.OC

    Thompson sampling for linear quadratic mean-field teams

    Authors: Mukul Gagrani, Sagar Sudhakara, Aditya Mahajan, Ashutosh Nayyar, Yi Ouyang

    Abstract: We consider optimal control of an unknown multi-agent linear quadratic (LQ) system where the dynamics and the cost are coupled across the agents through the mean-field (i.e., empirical mean) of the states and controls. Directly using single-agent LQ learning algorithms in such models results in regret which increases polynomially with the number of agents. We propose a new Thompson sampling based… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Submitted to AISTATS 2021

  12. arXiv:2007.03007  [pdf, ps, other

    cs.GT eess.SY

    Optimal Dynamic Mechanism Design with Stochastic Supply and Flexible Consumers

    Authors: Shiva Navabi, Ashutosh Nayyar

    Abstract: We consider the problem of designing an expected-revenue maximizing mechanism for allocating multiple non-perishable goods of $k$ varieties to flexible consumers over $T$ time steps. In our model, a random number of goods of each variety may become available to the seller at each time and a random number of consumers may enter the market at each time. Each consumer is present in the market for one… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  13. arXiv:1911.06912  [pdf, ps, other

    eess.SY cs.IT math.ST

    Fixed-horizon Active Hypothesis Testing

    Authors: Dhruva Kartik, Ashutosh Nayyar, Urbashi Mitra

    Abstract: Two active hypothesis testing problems are formulated. In these problems, the agent can perform a fixed number of experiments and then decide on one of the hypotheses. The agent is also allowed to declare its experiments inconclusive if needed. The first problem is an asymmetric formulation in which the the objective is to minimize the probability of incorrectly declaring a particular hypothesis t… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Submitted to IEEE Transactions on Automatic Control

  14. arXiv:1909.01445  [pdf, ps, other

    eess.SY cs.GT

    Zero-sum Stochastic Games with Asymmetric Information

    Authors: Dhruva Kartik, Ashutosh Nayyar

    Abstract: A general model for zero-sum stochastic games with asymmetric information is considered. In this model, each player's information at each time can be divided into a common information part and a private information part. Under certain conditions on the evolution of the common and private information, a dynamic programming characterization of the value of the game (if it exists) is presented. If th… ▽ More

    Submitted 24 December, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted for presentation at the 58th Conference on Decision and Control (CDC), 2019 Submitted to Dynamic Games and Applications

  15. arXiv:1908.06070  [pdf, other

    eess.SY

    Optimal scheduling strategy for networked estimation with energy harvesting

    Authors: Marcos M. Vasconcelos, Mukul Gagrani, Ashutosh Nayyar, Urbashi Mitra

    Abstract: Joint optimization of scheduling and estimation policies is considered for a system with two sensors and two non-collocated estimators. Each sensor produces an independent and identically distributed sequence of random variables, and each estimator forms estimates of the corresponding sequence with respect to the mean-squared error sense. The data generated by the sensors is transmitted to the cor… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 25 pages, 9 figures

  16. arXiv:1902.03339  [pdf, other

    eess.SY math.OC

    Worst-case Guarantees for Remote Estimation of an Uncertain Source

    Authors: Mukul Gagrani, Yi Ouyang, Mohammad Rasouli, Ashutosh Nayyar

    Abstract: Consider a remote estimation problem where a sensor wants to communicate the state of an uncertain source to a remote estimator over a finite time horizon. The uncertain source is modeled as an autoregressive process with bounded noise. Given that the sensor has a limited communication budget, the sensor must decide when to transmit the state to the estimator who has to produce real-time estimates… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  17. arXiv:1806.06497  [pdf, other

    eess.SY math.OC

    Optimal Infinite Horizon Decentralized Networked Controllers with Unreliable Communication

    Authors: Yi Ouyang, Seyed Mohammad Asghari, Ashutosh Nayyar

    Abstract: We consider a decentralized networked control system (DNCS) consisting of a remote controller and a collection of linear plants, each associated with a local controller. Each local controller directly observes the state of its co-located plant and can inform the remote controller of the plant's state through an unreliable uplink channel. The downlink channels from the remote controller to local co… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 52 pages, Submitted to IEEE Transactions on Automatic Control

  18. arXiv:1802.00538  [pdf, ps, other

    eess.SY math.OC

    Decentralized Control of Stochastically Switched Linear System with Unreliable Communication

    Authors: Seyed Mohammad Asghari, Yi Ouyang, Ashutosh Nayyar

    Abstract: We consider a networked control system (NCS) consisting of two plants, a global plant and a local plant, and two controllers, a global controller and a local controller. The global (resp. local) plant follows discrete-time stochastically switched linear dynamics with a continuous global (resp. local) state and a discrete global (resp. local) mode. We assume that the state and mode of the global pl… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

    Comments: [Extended Version] Accepted for presentation in IEEE American Conference on Control (ACC) 2018

  19. arXiv:1611.07175  [pdf, other

    eess.SY

    Optimal Local and Remote Controllers with Unreliable Uplink Channels

    Authors: Seyed Mohammad Asghari, Yi Ouyang, Ashutosh Nayyar

    Abstract: We consider a networked control system consisting of a remote controller and a collection of linear plants, each associated with a local controller. Each local controller directly observes the state of its co-located plant and can inform the remote controller of the plant's state through an unreliable uplink channel. We assume that the downlink channels from the remote controller to local controll… ▽ More

    Submitted 16 June, 2018; v1 submitted 22 November, 2016; originally announced November 2016.

    Comments: 43 pages, Accepted for publication in IEEE Transactions on Automatic Control

  20. arXiv:1611.03592  [pdf, ps, other

    eess.SY

    Dynamic Teams and Decentralized Control Problems with Substitutable Actions

    Authors: Seyed Mohammad Asghari, Ashutosh Nayyar

    Abstract: This paper considers two problems -- a dynamic team problem and a decentralized control problem. The problems we consider do not belong to the known classes of "simpler" dynamic team/decentralized control problems such as partially nested or quadratically invariant problems. However, we show that our problems admit simple solutions under an assumption referred to as the substitutability assumption… ▽ More

    Submitted 11 November, 2016; originally announced November 2016.

    Comments: 25 pages, Accepted for publication in IEEE Transactions on Automatic Control

  21. arXiv:1606.07215  [pdf, other

    eess.SY

    Optimal Local and Remote Controllers with Unreliable Communication

    Authors: Yi Ouyang, Seyed Mohammad Asghari, Ashutosh Nayyar

    Abstract: We consider a decentralized optimal control problem for a linear plant controlled by two controllers, a local controller and a remote controller. The local controller directly observes the state of the plant and can inform the remote controller of the plant state through a packet-drop channel. We assume that the remote controller is able to send acknowledgments to the local controller to signal th… ▽ More

    Submitted 23 June, 2016; originally announced June 2016.

  22. arXiv:1601.02250  [pdf, ps, other

    eess.SY

    Decentralized Control Problems with Substitutable Actions

    Authors: Seyed Mohammad Asghari, Ashutosh Nayyar

    Abstract: We consider a decentralized system with multiple controllers and define substitutability of one controller by another in open-loop strategies. We explore the implications of this property on the optimization of closed-loop strategies. In particular, we focus on the decentralized LQG problem with substitutable actions. Even though the problem we formulate does not belong to the known classes of "si… ▽ More

    Submitted 10 January, 2016; originally announced January 2016.

  23. arXiv:1409.7034  [pdf, ps, other

    eess.SY

    Rate-constrained Energy Services: Allocation Policies and Market Decisions

    Authors: Ashutosh Nayyar, Matias Negrete-Pincetic, Kameshwar Poolla, Pravin Varaiya

    Abstract: The integration of renewable generation poses operational and economic challenges for the electricity grid. For the core problem of power balance, the legacy paradigm of tailoring supply to follow random demand may be inappropriate under deep penetration of uncertain and intermittent renewable generation. In this situation, there is an emerging consensus that the alternative approach of controllin… ▽ More

    Submitted 24 September, 2014; originally announced September 2014.

  24. arXiv:1408.5825  [pdf, other

    eess.SY

    Duration-differentiated Energy Services with a Continuum of Loads

    Authors: Ashutosh Nayyar, Matias Negrete-Pincetic, Kameshwar Poolla, Pravin Varaiya

    Abstract: As the proportion of total power supplied by renewable sources increases, it gets more costly to use reserve generation to compensate for the variability of renewables like solar and wind. Hence attention has been drawn to exploiting flexibility in demand as a substitute for reserve generation. Flexibility has different attributes. In this paper we consider loads requiring a constant power for a s… ▽ More

    Submitted 25 August, 2014; originally announced August 2014.

  25. arXiv:1408.2551  [pdf, ps, other

    eess.SY math.OC

    Optimal Control for LQG Systems on Graphs---Part I: Structural Results

    Authors: Ashutosh Nayyar, Laurent Lessard

    Abstract: In this two-part paper, we identify a broad class of decentralized output-feedback LQG systems for which the optimal control strategies have a simple intuitive estimation structure and can be computed efficiently. Roughly, we consider the class of systems for which the coupling of dynamics among subsystems and the inter-controller communication is characterized by the same directed graph. Furtherm… ▽ More

    Submitted 11 August, 2014; originally announced August 2014.

  26. arXiv:1404.1112  [pdf, other

    eess.SY

    Duration-Differentiated Services in Electricity

    Authors: Ashutosh Nayyar, Matias Negrete-Pincetic, Kameshwar Poolla, Pravin Varaiya

    Abstract: The integration of renewable sources poses challenges at the operational and economic levels of the power grid. In terms of keeping the balance between supply and demand, the usual scheme of supply following load may not be appropriate for large penetration levels of uncertain and intermittent renewable supply. In this paper, we focus on an alternative scheme in which the load follows the supply,… ▽ More

    Submitted 3 April, 2014; originally announced April 2014.

  27. arXiv:1403.3126  [pdf, ps, other

    eess.SY

    Signaling in sensor networks for sequential detection

    Authors: Ashutosh Nayyar, Demosthenis Teneketzis

    Abstract: Sequential detection problems in sensor networks are considered. The true state of nature/true hypothesis is modeled as a binary random variable $H$ with known prior distribution. There are $N$ sensors making noisy observations about the hypothesis; $\mathcal{N} =\{1,2,\ldots,N\}$ denotes the set of sensors. Sensor $i$ can receive messages from a subset $\mathcal{P}^i \subset \mathcal{N}$ of senso… ▽ More

    Submitted 12 March, 2014; originally announced March 2014.

    Comments: 10 pages

  28. arXiv:1403.2739  [pdf, other

    math.OC eess.SY

    Sufficient statistics for linear control strategies in decentralized systems with partial history sharing

    Authors: Aditya Mahajan, Ashutosh Nayyar

    Abstract: In decentralized control systems with linear dynamics, quadratic cost, and Gaussian disturbance (also called decentralized LQG systems) linear control strategies are not always optimal. Nonetheless, linear control strategies are appealing due to analytic and implementation simplicity. In this paper, we investigate decentralized LQG systems with partial history sharing information structure and ide… ▽ More

    Submitted 11 March, 2014; originally announced March 2014.

  29. arXiv:1401.4786  [pdf, ps, other

    eess.SY cs.GT math.OC

    Common Information based Markov Perfect Equilibria for Linear-Gaussian Games with Asymmetric Information

    Authors: Abhishek Gupta, Ashutosh Nayyar, Cedric Langbort, Tamer Basar

    Abstract: We consider a class of two-player dynamic stochastic nonzero-sum games where the state transition and observation equations are linear, and the primitive random variables are Gaussian. Each controller acquires possibly different dynamic information about the state process and the other controller's past actions and observations. This leads to a dynamic game of asymmetric information among the cont… ▽ More

    Submitted 19 January, 2014; originally announced January 2014.

    Comments: Submitted to SIAM Journal of Control and Optimization

  30. arXiv:1303.3256  [pdf, other

    eess.SY math.OC

    Structural Results and Explicit Solution for Two-Player LQG Systems on a Finite Time Horizon

    Authors: Laurent Lessard, Ashutosh Nayyar

    Abstract: It is well-known that linear dynamical systems with Gaussian noise and quadratic cost (LQG) satisfy a separation principle. Finding the optimal controller amounts to solving separate dual problems; one for control and one for estimation. For the discrete-time finite-horizon case, each problem is a simple forward or backward recursion. In this paper, we consider a generalization of the LQG problem… ▽ More

    Submitted 6 September, 2013; v1 submitted 13 March, 2013; originally announced March 2013.

  31. arXiv:1209.3549  [pdf, ps, other

    cs.GT eess.SY

    Nash Equilibria for Stochastic Games with Asymmetric Information-Part 1: Finite Games

    Authors: Ashutosh Nayyar, Abhishek Gupta, Cédric Langbort, Tamer Başar

    Abstract: A model of stochastic games where multiple controllers jointly control the evolution of the state of a dynamic system but have access to different information about the state and action processes is considered. The asymmetry of information among the controllers makes it difficult to compute or characterize Nash equilibria. Using common information among the controllers, the game with asymmetric in… ▽ More

    Submitted 17 September, 2012; originally announced September 2012.

  32. arXiv:1209.1695  [pdf, other

    eess.SY math.OC

    Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

    Authors: Ashutosh Nayyar, Aditya Mahajan, Demosthenis Teneketzis

    Abstract: A general model of decentralized stochastic control called partial history sharing information structure is presented. In this model, at each step the controllers share part of their observation and control history with each other. This general model subsumes several existing models of information sharing as special cases. Based on the information commonly known to all the controllers, the decentr… ▽ More

    Submitted 8 September, 2012; originally announced September 2012.

    Comments: 37 pages, 1 figure

  33. arXiv:1205.6018  [pdf, ps, other

    eess.SY math.OC

    Optimal Strategies for Communication and Remote Estimation with an Energy Harvesting Sensor

    Authors: Ashutosh Nayyar, Tamer Basar, Demosthenis Teneketzis, Venugopal V. Veeravalli

    Abstract: We consider a remote estimation problem with an energy harvesting sensor and a remote estimator. The sensor observes the state of a discrete-time source which may be a finite state Markov chain or a multi-dimensional linear Gaussian system. It harvests energy from its environment (say, for example, through a solar cell) and uses this energy for the purpose of communicating with the estimator. Due… ▽ More

    Submitted 27 May, 2012; originally announced May 2012.

    Comments: 32 pages