Combinatorial Rising Bandit

Song, Seockbean; Yoon, Youngsik; Wang, Siwei; Chen, Wei; Ok, Jungseul

Computer Science > Machine Learning

arXiv:2412.00798 (cs)

[Submitted on 1 Dec 2024 (v1), last revised 29 May 2025 (this version, v3)]

Title:Combinatorial Rising Bandit

Authors:Seockbean Song, Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul Ok

View PDF

Abstract:Combinatorial online learning is a fundamental task for selecting the optimal action (or super arm) as a combination of base arms in sequential interactions with systems providing stochastic rewards. It is applicable to diverse domains such as robotics, social advertising, network routing, and recommendation systems. In many real-world scenarios, we often encounter rising rewards, where playing a base arm not only provides an instantaneous reward but also contributes to the enhancement of future rewards, e.g., robots enhancing proficiency through practice and social influence strengthening in the history of successful recommendations. Moreover, the enhancement of a single base arm may affect multiple super arms that include it, introducing complex dependencies that are not captured by existing rising bandit models. To address this, we introduce the Combinatorial Rising Bandit (CRB) framework and propose a provably efficient algorithm, Combinatorial Rising Upper Confidence Bound (CRUCB). We establish an upper bound on regret CRUCB and show that it is nearly tight by deriving a matching lower bound. In addition, we empirically demonstrate the effectiveness of CRUCB not only in synthetic environments but also in realistic applications of deep reinforcement learning.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2412.00798 [cs.LG]
	(or arXiv:2412.00798v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.00798

Submission history

From: Seockbean Song [view email]
[v1] Sun, 1 Dec 2024 12:52:18 UTC (2,231 KB)
[v2] Mon, 3 Feb 2025 09:25:56 UTC (19,586 KB)
[v3] Thu, 29 May 2025 10:32:12 UTC (20,901 KB)

Computer Science > Machine Learning

Title:Combinatorial Rising Bandit

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Combinatorial Rising Bandit

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators