Nonmyopic Global Optimisation via Approximate Dynamic Programming

arXiv stat.ML / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses global optimization of expensive, gradient-free black-box functions using surrogate-based sequential query strategies, focusing on methods that remain efficient in high-dimensional settings.
  • It notes that while Bayesian optimization with Gaussian-process surrogates is effective, GP-based computation becomes prohibitive as dimensionality grows, motivating lighter deterministic surrogates such as IDW and RBF.
  • A key contribution is the development of nonmyopic (lookahead) acquisition functions for deterministic surrogate models by using approximate dynamic programming concepts like rollout and multi-step scenario optimization.
  • The proposed approach selects a sequence of query points over a horizon by predicting surrogate-model evolution, explicitly improving the exploration–exploitation trade-off beyond greedy (myopic) criteria.
  • Experiments on synthetic, hyperparameter tuning benchmarks, a constrained task, and a data-driven predictive control application show faster, more robust convergence compared with conventional myopic strategies.

Abstract

Global optimisation to optimise expensive-to-evaluate black-box functions without gradient information. Bayesian optimisation, one of the most well-known techniques, typically employs Gaussian processes as surrogate models, leveraging their probabilistic nature to balance exploration and exploitation. However, these processes become computationally prohibitive in high-dimensional spaces. Recent alternatives, based on inverse distance weighting (IDW) and radial basis functions (RBFs), offer competitive, computationally lighter solutions. Despite their efficiency, both traditional global and Bayesian optimisation strategies suffer from the myopic nature of their acquisition functions, which focus on immediate improvement neglecting future implications of the sequential decision making process. Nonmyopic acquisition functions devised for the Bayesian setting have shown promise in improving long-term performance. Yet, their combination with deterministic surrogate models remains unexplored. In this work, we introduce novel nonmyopic acquisition strategies tailored to IDW and RBF based on approximate dynamic programming paradigms, including rollout and multi-step scenario-based optimisation schemes, to enable lookahead acquisition. These methods optimise a sequence of query points over a horizon by predicting the evolution of the surrogate model, inherently managing the exploration-exploitation trade-off via optimisation techniques. The proposed approach represents a significant advance in extending nonmyopic acquisition principles, previously confined to Bayesian optimisation, to deterministic models. Empirical results on synthetic and hyperparameter tuning benchmark problems, a constrained problem, as well as on a data-driven predictive control application, demonstrate that these nonmyopic methods outperform conventional myopic approaches, leading to faster and more robust convergence.