Characterizing Truthful Multi-Armed Bandit Mechanisms

Babaioff, Moshe; Sharma, Yogeshwer; Slivkins, Aleksandrs

Computer Science > Data Structures and Algorithms

arXiv:0812.2291 (cs)

[Submitted on 12 Dec 2008 (v1), last revised 3 Jun 2013 (this version, v7)]

Title:Characterizing Truthful Multi-Armed Bandit Mechanisms

Authors:Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivkins

View PDF

Abstract:We consider a multi-round auction setting motivated by pay-per-click auctions for Internet advertising. In each round the auctioneer selects an advertiser and shows her ad, which is then either clicked or not. An advertiser derives value from clicks; the value of a click is her private information. Initially, neither the auctioneer nor the advertisers have any information about the likelihood of clicks on the advertisements. The auctioneer's goal is to design a (dominant strategies) truthful mechanism that (approximately) maximizes the social welfare.
If the advertisers bid their true private values, our problem is equivalent to the "multi-armed bandit problem", and thus can be viewed as a strategic version of the latter. In particular, for both problems the quality of an algorithm can be characterized by "regret", the difference in social welfare between the algorithm and the benchmark which always selects the same "best" advertisement. We investigate how the design of multi-armed bandit algorithms is affected by the restriction that the resulting mechanism must be truthful. We find that truthful mechanisms have certain strong structural properties -- essentially, they must separate exploration from exploitation -- and they incur much higher regret than the optimal multi-armed bandit algorithms. Moreover, we provide a truthful mechanism which (essentially) matches our lower bound on regret.

Comments:	This is the full version of a conference paper published in ACM EC 2009. This revision is re-focused to emphasize the results that do not rely on the "IIA assumption" (see the paper for the definition)
Subjects:	Data Structures and Algorithms (cs.DS); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
ACM classes:	F.2.2; K.4.4; F.1.2; J.4
Cite as:	arXiv:0812.2291 [cs.DS]
	(or arXiv:0812.2291v7 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.0812.2291

Submission history

From: Aleksandrs Slivkins [view email]
[v1] Fri, 12 Dec 2008 04:13:01 UTC (80 KB)
[v2] Thu, 15 Jan 2009 01:56:08 UTC (81 KB)
[v3] Fri, 20 Feb 2009 18:10:47 UTC (84 KB)
[v4] Tue, 23 Jun 2009 02:21:56 UTC (83 KB)
[v5] Fri, 18 Sep 2009 00:17:44 UTC (85 KB)
[v6] Tue, 15 May 2012 22:57:53 UTC (87 KB)
[v7] Mon, 3 Jun 2013 21:03:36 UTC (93 KB)

Computer Science > Data Structures and Algorithms

Title:Characterizing Truthful Multi-Armed Bandit Mechanisms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Characterizing Truthful Multi-Armed Bandit Mechanisms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators