Lipschitz Bandits with Stochastic Delayed Feedback

Liu, Zhongxuan; Kang, Yue; Lee, Thomas C. M.

Computer Science > Machine Learning

arXiv:2510.00309 (cs)

[Submitted on 30 Sep 2025]

Title:Lipschitz Bandits with Stochastic Delayed Feedback

Authors:Zhongxuan Liu, Yue Kang, Thomas C. M. Lee

View PDF HTML (experimental)

Abstract:The Lipschitz bandit problem extends stochastic bandits to a continuous action set defined over a metric space, where the expected reward function satisfies a Lipschitz condition. In this work, we introduce a new problem of Lipschitz bandit in the presence of stochastic delayed feedback, where the rewards are not observed immediately but after a random delay. We consider both bounded and unbounded stochastic delays, and design algorithms that attain sublinear regret guarantees in each setting. For bounded delays, we propose a delay-aware zooming algorithm that retains the optimal performance of the delay-free setting up to an additional term that scales with the maximal delay $\tau_{\max}$. For unbounded delays, we propose a novel phased learning strategy that accumulates reliable feedback over carefully scheduled intervals, and establish a regret lower bound showing that our method is nearly optimal up to logarithmic factors. Finally, we present experimental results to demonstrate the efficiency of our algorithms under various delay scenarios.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2510.00309 [cs.LG]
	(or arXiv:2510.00309v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2510.00309

Submission history

From: Zhongxuan Liu [view email]
[v1] Tue, 30 Sep 2025 22:07:17 UTC (101 KB)

Computer Science > Machine Learning

Title:Lipschitz Bandits with Stochastic Delayed Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Lipschitz Bandits with Stochastic Delayed Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators