Generative UI Is Three Things. Only One Ships.

Dev.to / 6/4/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

Google shipped a November 2025 generative UI experiment that matched human-designed interfaces in about half of tested cases, while the rest produced unusable or incorrect artifacts.
The system sometimes took a minute or more per page to render and could output different results on every refresh, indicating reliability problems beyond what additional compute can solve.
Google’s own ELO scoring (1736.2) showed strong preference over other output formats but still only matched human experts about half the time, meaning it is “impressive” yet aimed at the wrong target.
The article argues that even modest per-screen success rates compound badly in multi-step workflows, making visible failure nearly certain for longer chains of screens.
It distinguishes between “generating code/UI as a static artifact” and true “runtime generative UI” that adapts during use, and claims the third (tamer) approach is the one that can actually win.

Continue reading this article on the original site.

Dev.to

The Verge

Dev.to

Dev.to

Dev.to