You're viewing papers too quickly. Please wait a moment.<br>This helps keep the archive available for everyone.

Quick Navigation

Topics

Quantum Machine Learning

Sublinear Time Quantum Algorithm for Attention Approximation

arXiv
Authors: Zhao Song, Jianfei Xue, Jiahao Zhang, Lichen Zhang

Year

2026

Paper ID

3022

Status

Preprint

Abstract Read

~2 min

Abstract Words

238

Citations

N/A

Abstract

Given the query, key and value matrices Q, K, Vin mathbb{R}ntimes d, the attention module is defined as Att(Q, K, V)=D-1AV where A=exp\(QK^→p/sqrt{d}\) with exp\(cdot\) applied entrywise, D=diag\(A{bf 1}n\). The attention module is the backbone of modern transformers and large language models, but explicitly forming the softmax matrix D-1A incurs Ω\(n2\) time, motivating numerous approximation schemes that reduce runtime to widetilde O(nd) via sparsity or low-rank factorization. We propose a quantum data structure that approximates any row of Att(Q, K, V) using only row queries to Q, K, V. Our algorithm preprocesses these matrices in widetilde{O}left\(ε-1 n0.5 left( s_λ2.5 + s_λ1.5 d + α0.5 d right\) right) time, where ε is the target accuracy, s_λ is the λ-statistical dimension of the exponential kernel defined by Q and K, and α measures the row distortion of V that is at most d/{rm srank}(V), the stable rank of V. Each row query can be answered in widetilde{O}\(s_λ2 + s_λd\) time. To our knowledge, this is the first quantum data structure that approximates rows of the attention matrix in sublinear time with respect to n. Our approach relies on a quantum Nyström approximation of the exponential kernel, quantum multivariate mean estimation for computing D, and quantum leverage score sampling for the multiplication with V.

Why This Paper Matters

  • This paper contributes to the Quantum Machine Learning research area in the Quantum Articles archive.
  • It adds a 2026 reference point for readers tracking recent quantum research.
  • Given the query, key and value matrices Q, K, Vin mathbbR^ntimes d, the attention module is defined as Att(Q, K, V)=D^-1AV where A=exp(QK^ -> p/sqrtd) with exp(cdot) applied...

Paper Tools

Become a member to use research tools

Sign in to open papers, visit source links, share, cite, compare, copy DOI links, request category corrections, and build your reading list.

Show Paper arXiv Publisher Share Cite This Paper Copy URL Compare Copy DOI Add to Reading List Category Correction Request

References & Citation Signals

Local Citation Graph (Related-Paper Links)

Current Paper #3022 #69596 Comprehensive pKa Data Augmenta... #69584 OQMD: Single-Qubit Rotation Con... #69549 REGRID-QAOA: A Resource-Efficie... #69539 Learning ground state observabl...

External citation index: OpenAlex citation signal

Community Reactions

Quick sentiment from readers on this paper.

Score: 0
Likes: 0 Dislikes: 0

Sign in to react to this paper.

Discussion & Reviews (Moderated)

Average Rating: 0.0 / 5 (0 ratings)

No written reviews yet.