Skip to main content

Showing 1–2 of 2 results for author: Yavas, R C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.05072  [pdf, other

    cs.LG cs.IT stat.ML

    A General Framework for Clustering and Distribution Matching with Bandit Feedback

    Authors: Recep Can Yavas, Yuqi Huang, Vincent Y. F. Tan, Jonathan Scarlett

    Abstract: We develop a general framework for clustering and distribution matching problems with bandit feedback. We consider a $K$-armed bandit model where some subset of $K$ arms is partitioned into $M$ groups. Within each group, the random variable associated to each arm follows the same distribution on a finite alphabet. At each time step, the decision maker pulls an arm and observes its outcome from the… ▽ More

    Submitted 9 January, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

    Comments: 24 pages

    MSC Class: 68T05 ACM Class: I.2.6

  2. arXiv:2311.00481  [pdf, ps, other

    cs.LG stat.ML

    Fixed-Budget Best-Arm Identification in Sparse Linear Bandits

    Authors: Recep Can Yavas, Vincent Y. F. Tan

    Abstract: We study the best-arm identification problem in sparse linear bandits under the fixed-budget setting. In sparse linear bandits, the unknown feature vector $θ^*$ may be of large dimension $d$, but only a few, say $s \ll d$ of these features have non-zero values. We design a two-phase algorithm, Lasso and Optimal-Design- (Lasso-OD) based linear best-arm identification. The first phase of Lasso-OD le… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 28 pages, Submitted to TMLR

    ACM Class: I.2.6