Aethon: A Reference-Based Replication Primitive for Constant-Time Instantiation of Stateful AI Agents
arXiv cs.AI / 4/15/2026
💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that moving from stateless LLM inference to stateful, tool-using agents exposes limitations in current AI runtimes, especially around latency and memory costs from heavy instantiation.
- It introduces Aethon, a reference-based replication primitive that creates stateful agent instances as compositional views over stable definitions, layered memory, and local context overlays instead of fully materializing each instance.
- Aethon’s architecture and memory model use layered inheritance with copy-on-write semantics to decouple creation cost from inherited structure, enabling near-constant-time instantiation.
- The authors analyze how this design affects complexity, scalability, multi-agent orchestration, and enterprise governance, positioning reference-based instantiation as a production-oriented systems abstraction.
- Overall, Aethon is presented as a step toward an infrastructure pattern where agents behave like lightweight, composable execution identities that can be spawned, specialized, and governed at scale.
Related Articles

As China’s biotech firms shift gears, can AI floor the accelerator?
SCMP Tech

Why AI Teams Are Standardizing on a Multi-Model Gateway
Dev.to

From Chaos to Cadence: Automating Your Post-Show Follow-Up with AI
Dev.to

a claude code/codex plugin to run autoresearch on your repository
Dev.to

AI startup claims to automate app making but actually just uses humans
Dev.to