Skip to main content

Showing 1–3 of 3 results for author: Rusmevichientong, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.12438  [pdf, other

    math.OC cs.LG stat.ML

    Debiasing In-Sample Policy Performance for Small-Data, Large-Scale Optimization

    Authors: Vishal Gupta, Michael Huang, Paat Rusmevichientong

    Abstract: Motivated by the poor performance of cross-validation in settings where data are scarce, we propose a novel estimator of the out-of-sample performance of a policy in data-driven optimization.Our approach exploits the optimization problem's sensitivity analysis to estimate the gradient of the optimal objective value with respect to the amount of noise in the data and uses the estimated gradient to… ▽ More

    Submitted 2 August, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

  2. arXiv:1301.2308  [pdf

    cs.AI

    A Tractable POMDP for a Class of Sequencing Problems

    Authors: Paat Rusmevichientong, Benjamin van Roy

    Abstract: We consider a partially observable Markov decision problem (POMDP) that models a class of sequencing problems. Although POMDPs are typically intractable, our formulation admits tractable solution. Instead of maintaining a value function over a high-dimensional set of belief states, we reduce the state space to one of smaller dimension, in which grid-based dynamic programming techniques are effecti… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-480-487

  3. arXiv:0812.3465  [pdf, ps, other

    cs.LG

    Linearly Parameterized Bandits

    Authors: Paat Rusmevichientong, John N. Tsitsiklis

    Abstract: We consider bandit problems involving a large (possibly infinite) collection of arms, in which the expected reward of each arm is a linear function of an $r$-dimensional random vector $\mathbf{Z} \in \mathbb{R}^r$, where $r \geq 2$. The objective is to minimize the cumulative regret and Bayes risk. When the set of arms corresponds to the unit sphere, we prove that the regret and Bayes risk is of… ▽ More

    Submitted 24 February, 2010; v1 submitted 18 December, 2008; originally announced December 2008.

    Comments: 40 pages; updated results and references