Feasible-First Exploration for Constrained ML Deployment Optimization in Crash-Prone Hierarchical Search Spaces
arXiv cs.LG / 4/29/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research
Key Points
- The paper targets ML deployment optimization under strict production constraints by treating the problem as a hierarchical mixed-variable search space with many invalid configurations that can crash, OOM, or violate latency limits.
- It argues that standard black-box optimizers (e.g., TPE and constrained Bayesian optimization) can waste a large share of small evaluation budgets on infeasible trials when valid regions are rare.
- The authors propose Thermal Budget Annealing (TBA), a feasible-first approach that explicitly maps feasible regions before warm-starting TPE.
- TBA improves robustness on hostile hardware using early trial timeouts and a subspace blacklisting mechanism that temporarily suppresses categorical subspaces after repeated failures.
- The work also introduces DeployBench, a benchmark suite with hierarchical structure, hidden crash zones, hard constraints, and unequal evaluation costs, showing TBA’s hybrid strategy improves discovery and reduces wasted budget on both synthetic tasks and real GPU deployment across multiple GPU targets.
Related Articles

How I Use AI Agents to Maintain a Living Knowledge Base for My Team
Dev.to

An API testing tool built specifically for AI agent loops
Dev.to
IK_LLAMA now supports Qwen3.5 MTP Support :O
Reddit r/LocalLLaMA
OpenAI models, Codex, and Managed Agents come to AWS
Dev.to

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to