Skip to main content

Showing 1–15 of 15 results for author: Khamaru, K

Searching in archive math. Search in all archives.
.
  1. arXiv:2412.06126  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    UCB algorithms for multi-armed bandits: Precise regret and adaptive inference

    Authors: Qiyang Han, Koulik Khamaru, Cun-Hui Zhang

    Abstract: Upper Confidence Bound (UCB) algorithms are a widely-used class of sequential algorithms for the $K$-armed bandit problem. Despite extensive research over the past decades aimed at understanding their asymptotic and (near) minimax optimality properties, a precise understanding of their regret behavior remains elusive. This gap has not only hindered the evaluation of their actual algorithmic effici… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  2. arXiv:2408.04595  [pdf, other

    stat.ML cs.AI cs.LG eess.SY math.ST

    Inference with the Upper Confidence Bound Algorithm

    Authors: Koulik Khamaru, Cun-Hui Zhang

    Abstract: In this paper, we discuss the asymptotic behavior of the Upper Confidence Bound (UCB) algorithm in the context of multiarmed bandit problems and discuss its implication in downstream inferential tasks. While inferential tasks become challenging when data is collected in a sequential manner, we argue that this problem can be alleviated when the sequential algorithm at hand satisfies certain stabili… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 17 pages, 1 figure

  3. arXiv:2404.00042  [pdf, ps, other

    math.OC cs.AI cs.LG stat.ML

    Stochastic Optimization with Constraints: A Non-asymptotic Instance-Dependent Analysis

    Authors: Koulik Khamaru

    Abstract: We consider the problem of stochastic convex optimization under convex constraints. We analyze the behavior of a natural variance reduced proximal gradient (VRPG) algorithm for this problem. Our main result is a non-asymptotic guarantee for VRPG algorithm. Contrary to minimax worst case guarantees, our result is instance-dependent in nature. This means that our guarantee captures the complexity of… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

    Comments: 18 pages

  4. arXiv:2310.00532  [pdf, other

    math.ST cs.LG

    Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference

    Authors: Licong Lin, Mufang Ying, Suvrojit Ghosh, Koulik Khamaru, Cun-Hui Zhang

    Abstract: Estimation and inference in statistics pose significant challenges when data are collected adaptively. Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to exhibit asymptotic normality for single coordinate estimation and have inflated error. This issue is highlighted by a recent minimax lower bound, which shows that the error of estimating a single coordinate can be enlar… ▽ More

    Submitted 28 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: This paper is accepted at NeurIPS 2023

  5. arXiv:2307.07320  [pdf, other

    math.ST cs.LG stat.ML

    Adaptive Linear Estimating Equations

    Authors: Mufang Ying, Koulik Khamaru, Cun-Hui Zhang

    Abstract: Sequential data collection has emerged as a widely adopted technique for enhancing the efficiency of data gathering processes. Despite its advantages, such data collection mechanism often introduces complexities to the statistical inference procedure. For instance, the ordinary least squares (OLS) estimator in an adaptive linear regression model can exhibit non-normal asymptotic behavior, posing c… ▽ More

    Submitted 7 November, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Paper is accepted at NeurIPS 2023

  6. arXiv:2303.02534  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Semi-parametric inference based on adaptively collected data

    Authors: Licong Lin, Koulik Khamaru, Martin J. Wainwright

    Abstract: Many standard estimators, when applied to adaptively collected data, fail to be asymptotically normal, thereby complicating the construction of confidence intervals. We address this challenge in a semi-parametric context: estimating the parameter vector of a generalized linear regression model contaminated by a non-parametric nuisance component. We construct suitably weighted estimating equations… ▽ More

    Submitted 1 March, 2025; v1 submitted 4 March, 2023; originally announced March 2023.

  7. arXiv:2201.08518  [pdf, ps, other

    math.ST cs.LG math.OC stat.ML

    Optimal variance-reduced stochastic approximation in Banach spaces

    Authors: Wenlong Mou, Koulik Khamaru, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space. Focusing on a stochastic query model that provides noisy evaluations of the operator, we analyze a variance-reduced stochastic approximation scheme, and establish non-asymptotic bounds for both the operator defect and the estimation error, measured in an arbitrary semi-norm. In contras… ▽ More

    Submitted 29 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  8. arXiv:2107.02266  [pdf, other

    math.ST cs.LG stat.ML

    Near-optimal inference in adaptive linear regression

    Authors: Koulik Khamaru, Yash Deshpande, Tor Lattimore, Lester Mackey, Martin J. Wainwright

    Abstract: When data is collected in an adaptive manner, even simple methods like ordinary least squares can exhibit non-normal asymptotic behavior. As an undesirable consequence, hypothesis tests and confidence intervals based on asymptotic normality can lead to erroneous results. We propose a family of online debiasing estimators to correct these distributional anomalies in least squares estimation. Our pr… ▽ More

    Submitted 21 March, 2023; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: 51 pages, 7 figures

  9. arXiv:2005.11411  [pdf, other

    cs.LG math.ST stat.ML

    Instability, Computational Efficiency and Statistical Accuracy

    Authors: Nhat Ho, Koulik Khamaru, Raaz Dwivedi, Martin J. Wainwright, Michael I. Jordan, Bin Yu

    Abstract: Many statistical estimators are defined as the fixed point of a data-dependent operator, with estimators based on minimizing a cost function being an important special case. The limiting performance of such estimators depends on the properties of the population-level operator in the idealized limit of infinitely many samples. We develop a general framework that yields bounds on statistical accurac… ▽ More

    Submitted 20 March, 2022; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: 68 pages, 6 Figures, 2 Tables. First three authors contributed equally

  10. arXiv:2003.07337  [pdf, other

    stat.ML cs.LG math.OC

    Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

    Authors: Koulik Khamaru, Ashwin Pananjady, Feng Ruan, Martin J. Wainwright, Michael I. Jordan

    Abstract: We address the problem of policy evaluation in discounted Markov decision processes, and provide instance-dependent guarantees on the $\ell_\infty$-error under a generative model. We establish both asymptotic and non-asymptotic versions of local minimax lower bounds for policy evaluation, thereby providing an instance-dependent baseline by which to compare algorithms. Theory-inspired simulations s… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 38 pages, 3 figures

  11. arXiv:1902.00194  [pdf, other

    math.ST cs.LG stat.ML

    Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

    Authors: Raaz Dwivedi, Nhat Ho, Koulik Khamaru, Martin J. Wainwright, Michael I. Jordan, Bin Yu

    Abstract: We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i.i.d. samples are known to have lower accuracy than the classical $n^{- \frac{1}{2}}$ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We provide a rigorous characterization of EM for fitting a weakly identif… ▽ More

    Submitted 15 November, 2021; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 30 pages, 4 figures. The first three authors contributed equally to this work. To appear in AISTATS 2020

  12. arXiv:1812.08305  [pdf, ps, other

    cs.LG math.OC stat.ML

    Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

    Authors: Dhruv Malik, Ashwin Pananjady, Kush Bhatia, Koulik Khamaru, Peter L. Bartlett, Martin J. Wainwright

    Abstract: We study derivative-free methods for policy optimization over the class of linear policies. We focus on characterizing the convergence rate of these methods when applied to linear-quadratic systems, and study various settings of driving noise and reward feedback. We show that these methods provably converge to within any pre-specified tolerance of the optimal policy with a number of zero-order eva… ▽ More

    Submitted 18 May, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: Version v3 consistent with paper appearing in JMLR

  13. arXiv:1810.00828  [pdf, other

    math.ST stat.ML

    Singularity, Misspecification, and the Convergence Rate of EM

    Authors: Raaz Dwivedi, Nhat Ho, Koulik Khamaru, Michael I. Jordan, Martin J. Wainwright, Bin Yu

    Abstract: A line of recent work has analyzed the behavior of the Expectation-Maximization (EM) algorithm in the well-specified setting, in which the population likelihood is locally strongly concave around its maximizing argument. Examples include suitably separated Gaussian mixture models and mixtures of linear regressions. We consider over-specified settings in which the number of fitted components is lar… ▽ More

    Submitted 28 April, 2020; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: 63 pages, 12 figures. The first three authors contributed equally to this work. To appear in Annals of Statistics

    MSC Class: Primary 62F15; 62G05; secondary 62G20

  14. arXiv:1804.09629  [pdf, other

    stat.ML cs.LG math.OC

    Convergence guarantees for a class of non-convex and non-smooth optimization problems

    Authors: Koulik Khamaru, Martin J. Wainwright

    Abstract: We consider the problem of finding critical points of functions that are non-convex and non-smooth. Studying a fairly broad class of such problems, we analyze the behavior of three gradient-based methods (gradient descent, proximal update, and Frank-Wolfe update). For each of these methods, we establish rates of convergence for general problems, and also prove faster rates for continuous sub-analy… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: 50 pages, 2 figures

  15. arXiv:1801.05935  [pdf, other

    math.OC stat.CO stat.ML

    Computation of the Maximum Likelihood estimator in low-rank Factor Analysis

    Authors: Koulik Khamaru, Rahul Mazumder

    Abstract: Factor analysis, a classical multivariate statistical technique is popularly used as a fundamental tool for dimensionality reduction in statistics, econometrics and data science. Estimation is often carried out via the Maximum Likelihood (ML) principle, which seeks to maximize the likelihood under the assumption that the positive definite covariance matrix can be decomposed as the sum of a low ran… ▽ More

    Submitted 17 January, 2018; originally announced January 2018.

    Comments: 22 pages, 4 figures