Quick Navigation

Topics

Quantum Optimization Quantum Machine Learning

Quantum exploration algorithms for multi-armed bandits

arXiv
Authors: Daochen Wang, Xuchen You, Tongyang Li, Andrew M. Childs

Year

2020

Paper ID

22239

Status

Preprint

Abstract Read

~2 min

Abstract Words

119

Citations

N/A

Abstract

Identifying the best arm of a multi-armed bandit is a central problem in bandit optimization. We study a quantum computational version of this problem with coherent oracle access to states encoding the reward probabilities of each arm as quantum amplitudes. Specifically, we show that we can find the best arm with fixed confidence using {O}bigl\(sqrt{sumi=2nΔ^{smash{-2}}i}bigr\) quantum queries, where Δi represents the difference between the mean reward of the best arm and the ith-best arm. This algorithm, based on variable-time amplitude amplification and estimation, gives a quadratic speedup compared to the best possible classical result. We also prove a matching quantum lower bound (up to poly-logarithmic factors).

Paper Tools

Become a member to use research tools

Sign in to open papers, visit source links, share, cite, compare, copy DOI links, request category corrections, and build your reading list.

Show Paper arXiv Publisher Share Cite This Paper Copy URL Compare Copy DOI Add to Reading List Category Correction Request

References & Citation Signals

Local Citation Graph (Related-Paper Links)

Current Paper #22239 #67338 Provably Quantum-Secure Microgr... #67328 Faster and Better Quantum Softw... #67313 Digitized Counterdiabatic Quant... #67310 Women for Quantum -- Manifesto ...

External citation index: OpenAlex citation signal

Community Reactions

Quick sentiment from readers on this paper.

Score: 0
Likes: 0 Dislikes: 0

Sign in to react to this paper.

Discussion & Reviews (Moderated)

Average Rating: 0.0 / 5 (0 ratings)

No written reviews yet.