
The new ARC-AGI-3 benchmark drops AI systems into interactive game environments that humans solve with ease. No frontier model breaks the 1 percent mark because the benchmark strips away their biggest advantages.
The article ARC-AGI-3 offers $2M to any AI that matches untrained humans, yet every frontier model scores below 1% appeared first on The Decoder.