CODESTRUCT: Code Agents over Structured Action Spaces
arXiv cs.AI / 4/8/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that LLM-based code agents often fail because they treat code repositories as unstructured text and rely on brittle string matching for edits.
- It proposes CODESTRUCT, which reframes a repository as a structured action space by operating on named AST entities and using syntax-validated operations via readCode and editCode.
- Across SWE-Bench Verified evaluated on six LLMs, CODESTRUCT increases Pass@1 accuracy by 1.2–5.0% while cutting token usage by 12–38% for most models.
- The biggest gains occur for models prone to invalid or empty patches in text-based interfaces; for example, GPT-5-nano improves by 20.8% as empty-patch failures drop from 46.6% to 7.2%.
- Results on CodeAssistBench also show consistent accuracy improvements (+0.8–4.4%) with potential cost reductions up to 33%, supporting the idea that structure-aware interfaces improve reliability and efficiency.
Related Articles

Black Hat Asia
AI Business

Meta's latest model is as open as Zuckerberg's private school
The Register

AI fuels global trade growth as China-US flows shift, McKinsey finds
SCMP Tech

Why multi-agent AI security is broken (and the identity patterns that actually work)
Dev.to
BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.
Reddit r/artificial