AI Navigate

Towards Noise-Resilient Quantum Multi-Armed and Stochastic Linear Bandits

arXiv cs.LG / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper analyzes the impact of noise on quantum MAB and stochastic linear bandits in NISQ devices and highlights the gap in relying on ideal quantum Monte Carlo for reward estimation.
  • It proposes a noise-robust quantum Monte Carlo estimator to improve reward querying accuracy under realistic noise models.
  • Building on this estimator, it introduces noise-robust QMAB and QSLB algorithms that aim to preserve quantum advantage even in noisy environments.
  • Experimental results across several quantum noise models show improved estimation accuracy and reduced regret, demonstrating practical resilience on noisy quantum hardware.

Abstract

Quantum multi-armed bandits (MAB) and stochastic linear bandits (SLB) have recently attracted significant attention, as their quantum counterparts can achieve quadratic speedups over classical MAB and SLB. However, most existing quantum MAB algorithms assume ideal quantum Monte Carlo (QMC) procedures on noise-free circuits, overlooking the impact of noise in current noisy intermediate-scale quantum (NISQ) devices. In this paper, we study a noise-robust QMC algorithm that improves estimation accuracy when querying quantum reward oracles. Building on this estimator, we propose noise-robust QMAB and QSLB algorithms that enhance performance in noisy environments while preserving the advantage over classical methods. Experiments show that our noise-robust approach improves QMAB estimation accuracy and reduces regret under several quantum noise models.