@arcprize: Announcing ARC-AGI-3The only...
@arcprize
15 views
Mar 25, 2026
Advertisement
1
Announcing ARC-AGI-3
The only unsaturated agentic intelligence benchmark in the world
Humans score 100%, AI <1%
This human-AI gap demonstrates we do not yet have AGI
Most benchmarks test what models already know, ARC-AGI-3 tests how they learn
The only unsaturated agentic intelligence benchmark in the world
Humans score 100%, AI <1%
This human-AI gap demonstrates we do not yet have AGI
Most benchmarks test what models already know, ARC-AGI-3 tests how they learn
3
We created an in-house game studio and built 135 novel environments from scratch
No instructions, Core Knowledge Priors-only
In order to beat these games, AI must:
β’ Explore the environment
β’ Form hypotheses
β’ Execute a plan
β’ Learn and adapt
No instructions, Core Knowledge Priors-only
In order to beat these games, AI must:
β’ Explore the environment
β’ Form hypotheses
β’ Execute a plan
β’ Learn and adapt
4
ARC-AGI-3 is a useful research tool to analyze model behavior
Key failure modes seen in our early testing:
β’ Thinking it is playing another game
β’ Holding on to early hypothesis
β’ Unable to forecast into the future
Both AI + human runs have sharable replays
Watch Gemini 3.1 do well on some games, poorly on others:
arcprize.org/replay/34a9614β¦
arcprize.org/replay/d0e0768β¦
Key failure modes seen in our early testing:
β’ Thinking it is playing another game
β’ Holding on to early hypothesis
β’ Unable to forecast into the future
Both AI + human runs have sharable replays
Watch Gemini 3.1 do well on some games, poorly on others:
arcprize.org/replay/34a9614β¦
arcprize.org/replay/d0e0768β¦
6
Also live today: ARC Prize 2026 - 3 tracks, $2,000,000 in prizes available!
Get involved:
β’ Play a Game: arcprize.org/tasks/ls20
β’ Build Agents: docs.arcprize.org
β’ Win Prizes: arcprize.org/competitions/2β¦
Get involved:
β’ Play a Game: arcprize.org/tasks/ls20
β’ Build Agents: docs.arcprize.org
β’ Win Prizes: arcprize.org/competitions/2β¦


