Approximate Thompson Sampling via Epistemic Neural Networks

Osband, Ian; Wen, Zheng; Asghari, Seyed Mohammad; Dwaracherla, Vikranth; Ibrahimi, Morteza; Lu, Xiuyuan; Van Roy, Benjamin

Computer Science > Machine Learning

arXiv:2302.09205 (cs)

[Submitted on 18 Feb 2023]

Title:Approximate Thompson Sampling via Epistemic Neural Networks

Authors:Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

View PDF

Abstract:Thompson sampling (TS) is a popular heuristic for action selection, but it requires sampling from a posterior distribution. Unfortunately, this can become computationally intractable in complex environments, such as those modeled using neural networks. Approximate posterior samples can produce effective actions, but only if they reasonably approximate joint predictive distributions of outputs across inputs. Notably, accuracy of marginal predictive distributions does not suffice. Epistemic neural networks (ENNs) are designed to produce accurate joint predictive distributions. We compare a range of ENNs through computational experiments that assess their performance in approximating TS across bandit and reinforcement learning environments. The results indicate that ENNs serve this purpose well and illustrate how the quality of joint predictive distributions drives performance. Further, we demonstrate that the \textit{epinet} -- a small additive network that estimates uncertainty -- matches the performance of large ensembles at orders of magnitude lower computational cost. This enables effective application of TS with computation that scales gracefully to complex environments.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.09205 [cs.LG]
	(or arXiv:2302.09205v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.09205

Submission history

From: Ian Osband [view email]
[v1] Sat, 18 Feb 2023 01:58:15 UTC (2,019 KB)

Computer Science > Machine Learning

Title:Approximate Thompson Sampling via Epistemic Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Approximate Thompson Sampling via Epistemic Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators