Minimum mean-squared error estimation with bandit feedback

Ghosh, Ayon; Prashanth, L. A.; Sen, Dipayan; Gopalan, Aditya

Computer Science > Machine Learning

arXiv:2203.16810 (cs)

[Submitted on 31 Mar 2022 (v1), last revised 2 May 2025 (this version, v4)]

Title:Minimum mean-squared error estimation with bandit feedback

Authors:Ayon Ghosh, L.A. Prashanth, Dipayan Sen, Aditya Gopalan

View PDF HTML (experimental)

Abstract:We consider the problem of sequentially learning to estimate, in the mean squared error (MSE) sense, a Gaussian $K$-vector of unknown covariance by observing only $m < K$ of its entries in each round. We propose two MSE estimators, and analyze their concentration properties. The first estimator is non-adaptive, as it is tied to a predetermined $m$-subset and lacks the flexibility to transition to alternative subsets. The second estimator, which is derived using a regression framework, is adaptive and exhibits better concentration bounds in comparison to the first estimator. We frame the MSE estimation problem with bandit feedback, where the objective is to find the MSE-optimal subset with high confidence. We propose a variant of the successive elimination algorithm to solve this problem. We also derive a minimax lower bound to understand the fundamental limit on the sample complexity of this problem.

Comments:	A two-page extended abstract version of this paper appeared in the Proceedings of the Ninth Indian Control Conference (ICC), 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2203.16810 [cs.LG]
	(or arXiv:2203.16810v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.16810

Submission history

From: L.A. Prashanth [view email]
[v1] Thu, 31 Mar 2022 05:33:32 UTC (280 KB)
[v2] Fri, 1 Apr 2022 06:50:59 UTC (276 KB)
[v3] Thu, 11 Jan 2024 05:44:18 UTC (91 KB)
[v4] Fri, 2 May 2025 12:23:05 UTC (26 KB)

Computer Science > Machine Learning

Title:Minimum mean-squared error estimation with bandit feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Minimum mean-squared error estimation with bandit feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators