Skip to main content

Showing 1–1 of 1 results for author: Güçlü, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.12154  [pdf, other

    stat.ML cs.LG eess.SY

    Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds

    Authors: Arda Güçlü, Subhonmesh Bose

    Abstract: We propose and analyze TRAiL (Tangential Randomization in Linear Bandits), a computationally efficient regret-optimal forced exploration algorithm for linear bandits on action sets that are sublevel sets of strongly convex functions. TRAiL estimates the governing parameter of the linear bandit problem through a standard regularized least squares and perturbs the reward-maximizing action correspond… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 42 pages, 6 Figures