Quick Navigation

Topics

Quantum Machine Learning

Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes

arXiv
Authors: Bhargav Ganguly, Yang Xu, Vaneet Aggarwal

Year

2023

Paper ID

53710

Status

Preprint

Abstract Read

~2 min

Abstract Words

119

Citations

N/A

Abstract

This paper investigates the potential of quantum acceleration in addressing infinite horizon Markov Decision Processes (MDPs) to enhance average reward outcomes. We introduce an innovative quantum framework for the agent's engagement with an unknown MDP, extending the conventional interaction paradigm. Our approach involves the design of an optimism-driven tabular Reinforcement Learning algorithm that harnesses quantum signals acquired by the agent through efficient quantum mean estimation techniques. Through thorough theoretical analysis, we demonstrate that the quantum advantage in mean estimation leads to exponential advancements in regret guarantees for infinite horizon Reinforcement Learning. Specifically, the proposed Quantum algorithm achieves a regret bound of {mathcal{O}}(1), a significant improvement over the {mathcal{O}}\(sqrt{T}\) bound exhibited by classical counterparts.

Why This Paper Matters

  • This paper contributes to the Quantum Machine Learning research area in the Quantum Articles archive.
  • It adds a 2023 reference point for readers tracking recent quantum research.
  • This paper investigates the potential of quantum acceleration in addressing infinite horizon Markov Decision Processes (MDPs) to enhance average reward outcomes.

Paper Tools

Become a member to use research tools

Sign in to open papers, visit source links, share, cite, compare, copy DOI links, request category corrections, and build your reading list.

Show Paper arXiv Publisher Share Cite This Paper Copy URL Compare Copy DOI Add to Reading List Category Correction Request

References & Citation Signals

Local Citation Graph (Related-Paper Links)

Current Paper #53710 #69596 Comprehensive pKa Data Augmenta... #69584 OQMD: Single-Qubit Rotation Con... #69549 REGRID-QAOA: A Resource-Efficie... #69539 Learning ground state observabl...

External citation index: OpenAlex citation signal

Community Reactions

Quick sentiment from readers on this paper.

Score: 0
Likes: 0 Dislikes: 0

Sign in to react to this paper.

Discussion & Reviews (Moderated)

Average Rating: 0.0 / 5 (0 ratings)

No written reviews yet.