Agents that write their own code at runtime and vote on capabilities, no human in the loop

Reddit r/artificial / 4/8/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early Trends

Key Points

  • hollowOS v4.4 introduces “autonomy” for its agent OS: agents run 6-second cycles that plan with Ollama reasoning, select capabilities via semantic search, and execute the best option toward their goals.
  • If existing capabilities don’t fit, agents synthesize new Python code, test it, and hot-load it without restarting, enabling runtime self-improvement.
  • Multiple agents coordinate on new capabilities by voting (quorum-based acceptance) so overlapping gaps don’t produce duplicate work; rejected implementations are removed.
  • The update builds on hollowOS’s earlier “Phase 1” kernel primitives (events, transactions, lineage, rate limiting, checkpoints, consensus voting) and claims this release is “Phase 2” for higher-order capabilities.
  • The author reports live-system benchmarks (95% token savings vs grep, 2x more consistent decisions, and 109 integration tests passing) and invites community feedback and contributions via the project’s GitHub.

hollowOS just hit v4.4 and I added something that I haven’t seen anyone else do.

Previous versions gave you an OS for agents: structured state, semantic search, session context, token efficiency, 95% reduced tokens over specific scenarios. All the infrastructure to keep agents from re-discovering things.

v4.4 adds autonomy.

Agents now cycle every 6 seconds. Each cycle:

- Plan the next step toward their goal using Ollama reasoning

- Discover which capabilities they have via semantic similarity search

- Execute the best one

- If nothing fits, synthesize new Python code to handle it

- Test the new code

- Hot-load it without restarting

- Move on

When multiple agents hit the same gap, they don't duplicate work. They vote on whether the new capability is worth keeping. Acceptance requires quorum. Bad implementations get rejected and removed.

No human writes the code. No human decides which capabilities matter. No human in the loop at all. Goals drive execution. Agents improve themselves based on what actually works.

We built this on top of Phase 1 (the kernel primitives: events, transactions, lineage, rate limiting, checkpoints, consensus voting). Phase 2 is higher-order capabilities that only work because Phase 1 exists. This is Phase 2.

Real benchmarks from the live system:

- Semantic code search: 95% token savings vs grep

- Agent handoff continuity: 2x more consistent decisions

- 109 integration tests, all passed

Looking for feedback:

- This is a massive undertaking, I would love some feedback

- If there’s a bug? Difficulty installing? Let me know so I can fix it

- Looking for contributors interested in the project

Try it:

https://github.com/ninjahawk/hollow-agentOS

Thank you to the 2,000 people who have already tested hollowOS!

submitted by /u/TheOnlyVibemaster
[link] [comments]