SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits
arXiv cs.LG / 3/20/2026
📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- SOL-ExecBench introduces a new benchmark of 235 CUDA kernel optimization problems drawn from 124 production and emerging AI models, spanning language, diffusion, vision, audio, video, and hybrid architectures, and targets NVIDIA Blackwell GPUs.
- It evaluates forward and backward workloads across BF16, FP8, and NVFP4, including kernels whose best performance is expected to depend on Blackwell-specific capabilities.
- The benchmark measures performance against analytically derived Speed-of-Light (SOL) bounds computed by SOLAR, providing a hardware-grounded target rather than a traditional software baseline.
- It outputs a SOL Score that quantifies how much of the gap to the hardware SOL bound a candidate kernel closes, enabling objective comparison of kernel efficiency.
- A sandboxed harness with GPU clock locking, L2 cache clearing, isolated subprocess execution, and static-analysis checks is provided to guard against reward-hacking by agentic optimizers.
広告
Related Articles

GDPR and AI Training Data: What You Need to Know Before Training on Personal Data
Dev.to

We built a 9-item checklist that catches LLM coding agent failures before execution starts
Dev.to
Edge-to-Cloud Swarm Coordination for heritage language revitalization programs with embodied agent feedback loops
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Sector HQ Daily AI Intelligence - March 27, 2026
Dev.to