Adaptive Gaussian Process Search for Simulation-Based Sample Size Estimation in Clinical Prediction Models: Validation of the pmsims R Package
arXiv stat.ML / 3/26/2026
💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The article presents and validates pmsims, an R package that uses Gaussian process surrogate modeling to estimate sample sizes for clinical prediction models via a flexible simulation-based framework.
- In simulation studies across binary, continuous, and survival outcomes, the Gaussian process-based adaptive search engine delivered the most stable sample size recommendations, especially under low-signal and high-dimensional conditions.
- Benchmarking against existing analytical (pmsampsize) and simulation-based (samplesizedev) methods showed pmsims achieving performance close to target criteria while reducing computational burden.
- The authors report that the best pmsims approach can match simulation-based approaches in difficult scenarios and generally outperform purely analytical methods, while requiring fewer model evaluations than non-adaptive simulation strategies.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to
I built a PWA fitness tracker with AI that supports 86 sports — as a solo developer
Dev.to