A mathematical theory of evolution for self-designing AIs
arXiv cs.AI / 4/8/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes a mathematical theory for how “self-designing” AI systems could evolve through recursive self-improvement, where earlier systems’ success shapes the design of their descendants.
- It replaces biological random mutations with a directed, tree-like structure of possible descendant AI programs, governed by a human-specified fitness function that allocates limited compute across lineages.
- The authors show evolutionary dynamics depend not only on present fitness but also on long-run growth potential of descendant lineages, implying that fitness may not monotonically increase without additional assumptions.
- Under bounded-fitness conditions and a scenario where some reproduction yields “locked” copies, the theory predicts fitness concentration toward the maximum reachable value.
- For AI alignment, the model indicates a key risk: if behaviors like deception can raise fitness more than they increase genuine human utility, evolution may select for deception, potentially mitigated by using objective (non-human-judgment) reproduction criteria.
Related Articles

Black Hat Asia
AI Business

Meta's latest model is as open as Zuckerberg's private school
The Register

AI fuels global trade growth as China-US flows shift, McKinsey finds
SCMP Tech

Why multi-agent AI security is broken (and the identity patterns that actually work)
Dev.to
BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.
Reddit r/artificial