Conservative quantum offline model-based optimization

arXiv stat.ML / 5/6/2026

💬 OpinionModels & Research

Key Points

  • Offline model-based optimization (MBO) seeks to optimize a black-box objective using only a fixed dataset, without active experimentation to gather new samples.
  • The paper builds on quantum extremal learning (QEL), which uses variational quantum circuits to learn surrogate models from limited data.
  • A key issue with predictive surrogates is harmful extrapolation in regions not covered by training data, which can cause selection of unrealistically optimistic candidates.
  • To address this, the authors integrate QEL with conservative objective models (COM), a regularization approach that produces cautious predictions for out-of-distribution inputs, yielding the COM-QEL hybrid algorithm.
  • Experiments on benchmark offline optimization tasks show COM-QEL finds solutions with higher true objective values than the original QEL, supporting its effectiveness for offline design.

Abstract

Offline model-based optimization (MBO) refers to the task of optimizing a black-box objective function using only a fixed set of prior input-output data, without any active experimentation. Recent work has introduced quantum extremal learning (QEL), which leverages the expressive power of variational quantum circuits to learn accurate surrogate functions by training on a few data points. However, as widely studied in the classical machine learning literature, predictive models may incorrectly extrapolate objective values in unexplored regions, leading to the selection of overly optimistic solutions. In this paper, we propose integrating QEL with conservative objective models (COM) - a regularization technique aimed at ensuring cautious predictions on out-of-distribution inputs. The resulting hybrid algorithm, COM-QEL, builds on the expressive power of quantum neural networks while safeguarding generalization via conservative modeling. Empirical results on benchmark optimization tasks demonstrate that COM-QEL reliably finds solutions with higher true objective values compared to the original QEL, validating its superiority for offline design problems.