Neural Computers

arXiv cs.LG / 4/9/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes “Neural Computers” (NCs) as a new machine form that unifies computation, memory, and I/O inside a learned runtime state, positioning models themselves as the “running computer.”
  • It contrasts NCs with conventional computers (explicit programs), agents (acting in external environments), and world models (learning dynamics), arguing that the model can directly provide execution with stable interfaces.
  • As an initial step, the authors test whether NC primitives can be learned purely from collected input/output traces—without access to instrumented program state—by instantiating NCs as video models that roll out screen frames from instructions and pixels (plus user actions when available).
  • The experiments suggest early NCs can acquire interface-related primitives such as I/O alignment and short-horizon control, but key goals like routine reuse, controlled updates, and symbolic stability remain unresolved.
  • The authors lay out a roadmap toward a “Completely Neural Computer” (CNC) with durable capability reuse, explicit reprogramming, and stable execution, potentially defining a computing paradigm beyond today’s dominant ML-agent/world-model approaches.

Abstract

We propose a new frontier: Neural Computers (NCs) -- an emerging machine form that unifies computation, memory, and I/O in a learned runtime state. Unlike conventional computers, which execute explicit programs, agents, which act over external execution environments, and world models, which learn environment dynamics, NCs aim to make the model itself the running computer. Our long-term goal is the Completely Neural Computer (CNC): the mature, general-purpose realization of this emerging machine form, with stable execution, explicit reprogramming, and durable capability reuse. As an initial step, we study whether early NC primitives can be learned solely from collected I/O traces, without instrumented program state. Concretely, we instantiate NCs as video models that roll out screen frames from instructions, pixels, and user actions (when available) in CLI and GUI settings. These implementations show that learned runtimes can acquire early interface primitives, especially I/O alignment and short-horizon control, while routine reuse, controlled updates, and symbolic stability remain open. We outline a roadmap toward CNCs around these challenges. If overcome, CNCs could establish a new computing paradigm beyond today's agents, world models, and conventional computers.