Skip to main content

Showing 1–9 of 9 results for author: Hemachandra, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.05526  [pdf, other

    math.OC

    Optimal Storage Design: An $L^{\infty}$ infused Inventory Control

    Authors: Madhu Dhiman, Veeraruna Kavitha, Nandyala Hemachandra

    Abstract: Inventory control typically considers controlling the price and the production rate. However, such systems have rigidity towards altering the physical storage capacity -- one can not easily alter the physical size after the initial design. The paper focuses on this critical aspect, consideration of which leads to a non-standard control problem. Here, the objective is a weighted combination of the… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2303.07834  [pdf, other

    math.OC math.PR

    Finite-Horizon Constrained MDPs With Both Additive And Multiplicative Utilities

    Authors: Uday Kumar M, Sanjay P Bhat, Veeraruna Kavitha, Nandyala Hemachandra

    Abstract: This paper considers the problem of finding a solution to the finite horizon constrained Markov decision processes (CMDP) where the objective as well as constraints are sum of additive and multiplicative utilities. Towards solving this, we construct another CMDP, with only additive utilities under a restricted set of policies, whose optimal value is equal to that of the original CMDP. Furthermore,… ▽ More

    Submitted 15 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  3. arXiv:2209.14963  [pdf, ps, other

    math.OC

    Approximate Solutions To Constrained Risk-Sensitive Markov Decision Processes

    Authors: Uday Kumar M, Sanjay P Bhat, Veeraruna Kavitha, Nandyala Hemachandra

    Abstract: This paper considers the problem of finding near-optimal Markovian randomized (MR) policies for finite-state-action, infinite-horizon, constrained risk-sensitive Markov decision processes (CRSMDPs). Constraints are in the form of standard expected discounted cost functions as well as expected risk-sensitive discounted cost functions over finite and infinite horizons. The main contribution is to sh… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 38 pages

  4. arXiv:2109.01654  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Multi-agent Natural Actor-critic Reinforcement Learning Algorithms

    Authors: Prashant Trivedi, Nandyala Hemachandra

    Abstract: Multi-agent actor-critic algorithms are an important part of the Reinforcement Learning paradigm. We propose three fully decentralized multi-agent natural actor-critic (MAN) algorithms in this work. The objective is to collectively find a joint policy that maximizes the average long-term return of these agents. In the absence of a central controller and to preserve privacy, agents communicate some… ▽ More

    Submitted 2 April, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: A very high-level summary of our revision is: In Section 3.5, we theoretically prove that the objective function value from the deterministic variant of MAN algorithms dominates that of the MAAC algorithm under some minimal conditions. It relies on the Lemma 2 of our paper: the minimum singular value of the Fisher information matrix is well within the reciprocal of the policy parameter dimension

  5. arXiv:2008.07330  [pdf, other

    math.ST cs.LG stat.ML

    Optimal Posteriors for Chi-squared Divergence based PAC-Bayesian Bounds and Comparison with KL-divergence based Optimal Posteriors and Cross-Validation Procedure

    Authors: Puja Sahu, Nandyala Hemachandra

    Abstract: We investigate optimal posteriors for recently introduced \cite{begin2016pac} chi-squared divergence based PAC-Bayesian bounds in terms of nature of their distribution, scalability of computations, and test set performance. For a finite classifier set, we deduce bounds for three distance functions: KL-divergence, linear and squared distances. Optimal posterior weights are proportional to deviation… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:1912.06803

  6. arXiv:1810.08021  [pdf, other

    math.OC

    On a Conjecture for Dynamic Priority Queues and Nash Equilibrium for Quality of Service Sensitive Markets

    Authors: Manu K. Gupta, N. Hemachandra

    Abstract: Many economic transactions, including those of online markets, have a time lag between the start and end times of transactions. Customers need to wait for completion of their transaction (order fulfillment) and hence are also interested in their waiting time as a Quality of Service (QoS) attribute. So, they factor this QoS in the demand they offer to the firm (service-provider) and some customers… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

  7. arXiv:1605.00977  [pdf, ps, other

    math.OC cs.GT

    Blackwell-Nash Equilibrium for Discrete and Continuous Time Stochastic Games

    Authors: Vikas Vikram Singh, N. Hemachandra

    Abstract: We consider both discrete and continuous time finite state-action stochastic games. In discrete time stochastic games, it is known that a stationary Blackwell-Nash equilibrium (BNE) exists for a single controller additive reward (SC-AR) stochastic game which is a special case of a general stochastic game. We show that, in general, the additive reward condition is needed for the existence of a BNE.… ▽ More

    Submitted 3 May, 2016; originally announced May 2016.

    MSC Class: 91A05; 91A10; 91A15; 90C40

  8. arXiv:1206.1672  [pdf, ps, other

    math.OC cs.GT

    A mathematical programming based characterization of Nash equilibria of some constrained stochastic games

    Authors: Vikas Vikram Singh, N. Hemachandra

    Abstract: We consider two classes of constrained finite state-action stochastic games. First, we consider a two player nonzero sum single controller constrained stochastic game with both average and discounted cost criterion. We consider the same type of constraints as in [1], i.e., player 1 has subscription based constraints and player 2, who controls the transition probabilities, has realization based con… ▽ More

    Submitted 8 June, 2012; originally announced June 2012.

    MSC Class: 91A10; 91A15; 90C05; 90C20; 90C26

  9. arXiv:math/0212006  [pdf, ps, other

    math.PR

    Bounds for covariances and variances of truncated random variables

    Authors: N. Hemachandra, V. Cheriyan

    Abstract: We show that a lower bound for covariance of $\min(X_1,X_2)$ and $\max(X_1,X_2)$ is $\cov{X_1}{X_2}$ and an upper bound for variance of \\ $\min(X_2,\max(X,X_1))$ is $\var{X} + \var{X_1} +\var{X_2}$ generalizing previous results. We also characterize the cases where these bounds are sharp.

    Submitted 1 December, 2002; originally announced December 2002.

    Comments: 7 pages. Revised during October 2002

    Report number: 02_2002 MSC Class: 60