Holy shit⌠AGI finally has a number. đ¤Ż
For the first time, we can actually measure how close we are to real Artificial General Intelligence thanks to a new paper from Yoshua Bengio, Dawn Song, Max Tegmark, Eric Schmidt, and others.
For years, everyoneâs been throwing around the term âAGIâ like itâs some mystical milestone.
Something vague. Something out there in the future.
But this paper just pinned it down with a definition that finally makes sense.
They describe AGI as:
âAn AI that can match the cognitive versatility and proficiency of a well-educated adult.â
No hype. No âhuman-levelâ buzzwords. Just a measurable benchmark based on real cognitive science.
The framework is built on the CattellâHornâCarroll (CHC) model â the same system psychologists use to measure human intelligence.
It breaks cognition into ten abilities: reasoning, memory, math, language, perception, processing speed, and more.
Then the researchers did something wild â
they tested actual AI models against those same human benchmarks.
And hereâs what they found:
GPT-4 â 27% toward AGI
GPT-5 â 58% toward AGI
In other words, the latest model now operates at over half the cognitive range of an average human adult.
But itâs not there yet.
Both models still scored 0% in long-term memory the ability to store and recall new information over time.
Thatâs the missing piece. Continuous learning. Lifelong memory. The foundation of real general intelligence.
What makes this paper groundbreaking isnât just the data itâs the clarity.
For the first time, we have a framework that turns âAGIâ from a buzzword into a measurable spectrum.
We can finally track progress.
We can finally quantify intelligence.
And right now, humanityâs best AI is sitting at 58% of the way to AGI.
The countdown has officially begun.

The paper starts by calling out the elephant in the room: nobody actually agrees on what AGI is.
Every year the definition shifts from âbetter than humans at chessâ to âbetter than humans at everything.â
They argue this ambiguity has slowed progress.
So they built a quantifiable definition.
Every year the definition shifts from âbetter than humans at chessâ to âbetter than humans at everything.â
They argue this ambiguity has slowed progress.
So they built a quantifiable definition.

Their definition is refreshingly simple:
âAGI is an AI that matches the cognitive versatility and proficiency of a well-educated adult.â
Not superhuman. Not godlike. Just human-level cognition across domains.
This focus on versatility (breadth) and proficiency (depth) is what separates AGI from narrow AI.
âAGI is an AI that matches the cognitive versatility and proficiency of a well-educated adult.â
Not superhuman. Not godlike. Just human-level cognition across domains.
This focus on versatility (breadth) and proficiency (depth) is what separates AGI from narrow AI.

To measure that, they used the CattellâHornâCarroll (CHC) theory the same psychometric model used in human IQ tests.
It breaks intelligence into 10 âbroad abilitiesâ reasoning, memory, math, perception, etc.
Then they built AI test batteries modeled after human cognition.
It breaks intelligence into 10 âbroad abilitiesâ reasoning, memory, math, perception, etc.
Then they built AI test batteries modeled after human cognition.
When they ran GPT-4 and GPT-5 through the test batteries, the results were wildly uneven.
GPT-5 crushed it in reasoning and math.
But both models completely failed at long-term memory scoring 0%.
Basically, they can think fast⌠but forget everything.
GPT-5 crushed it in reasoning and math.
But both models completely failed at long-term memory scoring 0%.
Basically, they can think fast⌠but forget everything.

That long-term memory gap is the missing piece of AGI.
Humans learn through retention and recall.
AIs? They just remix whatâs already encoded.
Without persistent memory, true understanding canât emerge.
Humans learn through retention and recall.
AIs? They just remix whatâs already encoded.
Without persistent memory, true understanding canât emerge.

The paper also distinguishes between speed and intelligence.
GPT-5 performs simple tasks quickly, but raw speed â smarter thinking it just means faster prediction.
Humans trade speed for reflection.
AIs donât yet.
GPT-5 performs simple tasks quickly, but raw speed â smarter thinking it just means faster prediction.
Humans trade speed for reflection.
AIs donât yet.

Whatâs powerful here is that this framework lets us track AGI like a scoreboard.
Instead of arguing âare we close to AGI?â we can now ask:
âHow much cognitive ground remains before parity?â
Next frontier: memory + multimodal reasoning.
Instead of arguing âare we close to AGI?â we can now ask:
âHow much cognitive ground remains before parity?â
Next frontier: memory + multimodal reasoning.
The takeawayâs clear:
AGI isnât a magic switch itâs a spectrum of cognition.
And for the first time, we can measure it.
Right now, humanityâs smartest model stands at 58% human.
The other 42% will define the next decade.
Read the full paper: agidefinition.ai
AGI isnât a magic switch itâs a spectrum of cognition.
And for the first time, we can measure it.
Right now, humanityâs smartest model stands at 58% human.
The other 42% will define the next decade.
Read the full paper: agidefinition.ai
Generated by Thread Navigator
Press â + S to quick-export
