Skip to main content

Showing 1–14 of 14 results for author: Zeevi, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2404.12949  [pdf, ps, other

    math.PR cs.GT math.OC math.ST

    Optimal single threshold stopping rules and sharp prophet inequalities

    Authors: Alexander Goldenshluger, Yaakov Malinovsky, Assaf Zeevi

    Abstract: This paper considers a finite horizon optimal stopping problem for a sequence of independent and identically distributed random variables. The objective is to design stopping rules that attempt to select the random variable with the highest value in the sequence. The performance of any stopping rule may be benchmarked relative to the selection of a "prophet" that has perfect foreknowledge of the l… ▽ More

    Submitted 19 July, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    MSC Class: 60G40; 62L12; 91A05

  2. arXiv:2310.00806  [pdf, other

    cs.LG math.OC math.ST

    Bayesian Design Principles for Frequentist Sequential Learning

    Authors: Yunbei Xu, Assaf Zeevi

    Abstract: We develop a general theory to optimize the frequentist regret for sequential learning problems, where efficient bandit and reinforcement learning algorithms can be derived from unified Bayesian principles. We propose a novel optimization approach to generate "algorithmic beliefs" at each round, and use Bayesian posteriors to make decisions. The optimization objective to create "algorithmic belief… ▽ More

    Submitted 8 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  3. arXiv:2011.06186  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Towards Optimal Problem Dependent Generalization Error Bounds in Statistical Learning Theory

    Authors: Yunbei Xu, Assaf Zeevi

    Abstract: We study problem-dependent rates, i.e., generalization errors that scale near-optimally with the variance, the effective loss, or the gradient norms evaluated at the "best hypothesis." We introduce a principled framework dubbed "uniform localized convergence," and characterize sharp problem-dependent rates for central statistical learning problems. From a methodological viewpoint, our framework re… ▽ More

    Submitted 23 December, 2020; v1 submitted 11 November, 2020; originally announced November 2020.

  4. arXiv:2007.07876  [pdf, ps, other

    cs.LG math.ST stat.ML

    Upper Counterfactual Confidence Bounds: a New Optimism Principle for Contextual Bandits

    Authors: Yunbei Xu, Assaf Zeevi

    Abstract: The principle of optimism in the face of uncertainty is one of the most widely used and successful ideas in multi-armed bandits and reinforcement learning. However, existing optimistic algorithms (primarily UCB and its variants) often struggle to deal with general function classes and large context spaces. In this paper, we study general contextual bandits with an offline regression oracle and pro… ▽ More

    Submitted 9 March, 2024; v1 submitted 15 July, 2020; originally announced July 2020.

  5. arXiv:2004.05442  [pdf, other

    cs.LG math.PR stat.ML

    Discriminative Learning via Adaptive Questioning

    Authors: Achal Bassamboo, Vikas Deep, Sandeep Juneja, Assaf Zeevi

    Abstract: We consider the problem of designing an adaptive sequence of questions that optimally classify a candidate's ability into one of several categories or discriminative grades. A candidate's ability is modeled as an unknown parameter, which, together with the difficulty of the question asked, determines the likelihood with which s/he is able to answer a question correctly. The learning algorithm is o… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

    Comments: 3 figures

  6. arXiv:1901.04183  [pdf, ps, other

    math.PR math.ST

    A Unified Approach for Solving Sequential Selection Problems

    Authors: Alexander Goldenshluger, Yaakov Malinovsky, Assaf Zeevi

    Abstract: In this paper we develop a unified approach for solving a wide class of sequential selection problems. This class includes, but is not limited to, selection problems with no-information, rank-dependent rewards, and considers both fixed as well as random problem horizons. The proposed framework is based on a reduction of the original selection problem to one of optimal stopping for a sequence of ju… ▽ More

    Submitted 23 January, 2020; v1 submitted 14 January, 2019; originally announced January 2019.

    MSC Class: 60G40; 62L15

  7. arXiv:1405.3316  [pdf, other

    cs.LG math.OC math.PR stat.ML

    Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards

    Authors: Omar Besbes, Yonatan Gur, Assaf Zeevi

    Abstract: In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play one of K arms, each characterized by an unknown reward distribution. Reward realizations are only observed when an arm is selected, and the gambler's objective is to maximize his cumulative expected earnings over some given horizon of play T. To do this, the gambler needs to acquire information about arms (explor… ▽ More

    Submitted 6 June, 2019; v1 submitted 13 May, 2014; originally announced May 2014.

  8. arXiv:1307.5449  [pdf, other

    math.PR cs.LG stat.ML

    Non-stationary Stochastic Optimization

    Authors: O. Besbes, Y. Gur, A. Zeevi

    Abstract: We consider a non-stationary variant of a sequential stochastic optimization problem, in which the underlying cost functions may change along the horizon. We propose a measure, termed variation budget, that controls the extent of said change, and study how restrictions on this budget impact achievable performance. We identify sharp conditions under which it is possible to achieve long-run-average… ▽ More

    Submitted 22 December, 2014; v1 submitted 20 July, 2013; originally announced July 2013.

  9. arXiv:1003.1630  [pdf, ps, other

    math.ST

    Nonparametric Bandits with Covariates

    Authors: Philippe Rigollet, Assaf Zeevi

    Abstract: We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random covariate. The goal is to maximize cumulative expected reward. We derive general lower bounds on the performance of any admissible policy, and develop an algorithm whose performance achieves the order of said lower bound up… ▽ More

    Submitted 8 March, 2010; originally announced March 2010.

    MSC Class: Primary 62G08. Secondary 62L12; 62L05; 62C20.

  10. Woodroofe's one-armed bandit problem revisited

    Authors: Alexander Goldenshluger, Assaf Zeevi

    Abstract: We consider the one-armed bandit problem of Woodroofe [J. Amer. Statist. Assoc. 74 (1979) 799--806], which involves sequential sampling from two populations: one whose characteristics are known, and one which depends on an unknown parameter and incorporates a covariate. The goal is to maximize cumulative expected reward. We study this problem in a minimax setting, and develop rate-optimal police… ▽ More

    Submitted 1 September, 2009; originally announced September 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AAP589 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP589 MSC Class: 62L05 (Primary); 60G40; 62C20 (Secondary)

    Journal ref: Annals of Applied Probability 2009, Vol. 19, No. 4, 1603-1633

  11. Recovering convex boundaries from blurred and noisy observations

    Authors: Alexander Goldenshluger, Assaf Zeevi

    Abstract: We consider the problem of estimating convex boundaries from blurred and noisy observations. In our model, the convolution of an intensity function $f$ is observed with additive Gaussian white noise. The function $f$ is assumed to have convex support $G$ whose boundary is to be recovered. Rather than directly estimating the intensity function, we develop a procedure which is based on estimating… ▽ More

    Submitted 1 August, 2006; originally announced August 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000326 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0149 MSC Class: 62G05; 62H35 (Primary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 3, 1375-1394

  12. The Hough transform estimator

    Authors: Alexander Goldenshluger, Assaf Zeevi

    Abstract: This article pursues a statistical study of the Hough transform, the celebrated computer vision algorithm used to detect the presence of lines in a noisy image. We first study asymptotic properties of the Hough transform estimator, whose objective is to find the line that ``best'' fits a set of planar points. In particular, we establish strong consistency and rates of convergence, and characteri… ▽ More

    Submitted 29 March, 2005; originally announced March 2005.

    Comments: Published at http://dx.doi.org/10.1214/009053604000000760 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS208 MSC Class: 62F12; 62F35; 68T45 (Primary)

    Journal ref: Annals of Statistics 2004, Vol. 32, No. 5, 1908-1932

  13. Validity of heavy traffic steady-state approximations in generalized Jackson Networks

    Authors: David Gamarnik, Assaf Zeevi

    Abstract: We consider a single class open queueing network, also known as a generalized Jackson network (GJN). A classical result in heavy-traffic theory asserts that the sequence of normalized queue length processes of the GJN converge weakly to a reflected Brownian motion (RBM) in the orthant, as the traffic intensity approaches unity. However, barring simple instances, it is still not known whether the… ▽ More

    Submitted 9 March, 2006; v1 submitted 4 October, 2004; originally announced October 2004.

    Comments: Published at http://dx.doi.org/10.1214/105051605000000638 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP0134 MSC Class: 60J25; 60J65; 60K25 (Primary)

    Journal ref: Annals of Applied Probability 2006, Vol. 16, No. 1, 56-90

  14. Optimal change-point estimation from indirect observations

    Authors: A. Goldenshluger, A. Tsybakov, A. Zeevi

    Abstract: We study nonparametric change-point estimation from indirect noisy observations. Focusing on the white noise convolution model, we consider two classes of functions that are smooth apart from the change-point. We establish lower bounds on the minimax risk in estimating the change-point and develop rate optimal estimation procedures. The results demonstrate that the best achievable rates of conve… ▽ More

    Submitted 18 May, 2006; v1 submitted 23 July, 2004; originally announced July 2004.

    Comments: Published at http://dx.doi.org/10.1214/009053605000000750 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0068 MSC Class: 62G05; 62G20 (Primary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 1, 350-372