ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models
arXiv cs.CV / 3/23/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- ATHENA introduces a model-agnostic, test-time adaptive steering framework to improve object-count fidelity in text-to-image diffusion models without retraining or changing model architectures.
- It leverages intermediate representations during sampling to estimate counts and applies count-aware noise corrections early in denoising to steer the generation trajectory before structural errors are hard to fix.
- The work presents three variants, ranging from static prompt-based steering to dynamically adjusted count-aware control, balancing computation with higher numerical accuracy.
- Experiments on standard benchmarks and a new dataset show improved count fidelity, particularly at higher target counts, while maintaining favorable accuracy-runtime trade-offs across multiple diffusion backbones.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to
They Did Not Accidentally Make Work the Answer to Who You Are
Dev.to