Quick Navigation

Overview Topics Abstract Paper Tools References Reactions Discussion Publication Related

Topics

Quantum Optimization Quantum Machine Learning

Quantum exploration algorithms for multi-armed bandits

arXiv

Authors: Daochen Wang, Xuchen You, Tongyang Li, Andrew M. Childs

Year

2020

Paper ID

22239

Status

Preprint

Abstract Read

~2 min

Abstract Words

119

Citations

N/A

Abstract

Identifying the best arm of a multi-armed bandit is a central problem in bandit optimization. We study a quantum computational version of this problem with coherent oracle access to states encoding the reward probabilities of each arm as quantum amplitudes. Specifically, we show that we can find the best arm with fixed confidence using {O}bigl\(sqrt{sum_i=2ⁿΔ^{smash{-2}}_i}bigr\) quantum queries, where Δ_i represents the difference between the mean reward of the best arm and the i^th-best arm. This algorithm, based on variable-time amplitude amplification and estimation, gives a quadratic speedup compared to the best possible classical result. We also prove a matching quantum lower bound (up to poly-logarithmic factors).

Paper Tools

Become a member to use research tools

Sign in to open papers, visit source links, share, cite, compare, copy DOI links, request category corrections, and build your reading list.

Become a member Sign in

Show Paper arXiv Publisher Share Cite This Paper Copy URL Compare Copy DOI Add to Reading List Category Correction Request

References & Citation Signals

[1] DOI https://doi.org/arXiv:2007.07049 [2] arXiv https://arxiv.org/abs/2007.07049 [3] Publisher https://arxiv.org/abs/2007.07049

Local Citation Graph (Related-Paper Links)

External citation index: OpenAlex citation signal

Community Reactions

Quick sentiment from readers on this paper.

Score: 0

Likes: 0 Dislikes: 0

Discussion & Reviews (Moderated)

Average Rating: 0.0 / 5 (0 ratings)

No written reviews yet.