Betting for Sim-to-Real Performance Evaluation

arXiv cs.RO / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper tackles how to estimate real-world robot performance accurately and efficiently when physical experiments are scarce, costly, slow, and safety-constrained.
  • It frames sim-to-real performance evaluation as a “betting” problem, deriving theoretical conditions under which a betting mechanism provably outperforms standard Monte Carlo estimators.
  • The authors characterize how to construct ideal bets and propose theoretically grounded, practical approximations, along with decision rules to diagnose whether the approximations are behaving as intended.
  • Experiments on synthetic setups and cross-fidelity simulators validate the approach, including a case that infers real pick-and-place accuracy from synthetic distributions using the betting perspective.
  • Reproducible code for the reported results is released on GitHub (ISUSAIL/Bet4Sim2Real).

Abstract

This paper studies the problem of robot performance evaluation, focusing on how to obtain accurate and efficient estimates of real-world behavior under severe constraints on physical experimentation. Such estimates are essential for benchmarking algorithms, comparing design alternatives, validating controllers, and supporting certification or regulatory decision-making, yet real-world testing with physical robots is often expensive, time-consuming, and safety-limited. To mitigate the scarcity of real-world trials, sim-to-real methodologies are commonly employed, using low-cost simulators to inform, supplement, or prioritize physical experiments. Departing from (and complementary to) existing approaches in variance reduction (e.g., importance-sampling variants) or bias-correction (e.g., through prediction-powered inference or learned control variates), we examine this performance-evaluation problem through the lens of betting. We establish theoretical conditions under which a betting mechanism can yield accurate and efficient estimates (provably outperforming the Monte Carlo estimator) and we characterize how such bets should be constructed. We further develop theoretically grounded yet practically implementable approximations of the ideal bet, and we provide concrete decision rules that diagnose when these approximate betting strategies are working as intended. We demonstrate the effectiveness of the proposed methods using both synthetic examples and cross-fidelity computational simulators. Notably, we also showcase an illustrative case in which a group of synthetic distributions are used to infer the real-world pick-and-place accuracy of a robotic manipulator, a seemingly unconventional sim-to-real transfer that becomes natural and feasible under the proposed betting perspective. Programs for reproducing empirical results are available at https://github.com/ISUSAIL/Bet4Sim2Real.