@omarsar0: Hierarchical Reasoning Model...
@omarsar0
23 views
Aug 03, 2025
3
It moves away from token-level reasoning by using two coupled modules: a slow, high-level planner and a fast, low-level executor.
The two recurrent networks operate at different timescales to collaboratively solve tasks
Leads to greater reasoning depth and efficiency with only 27M parameters and no pretraining!
The two recurrent networks operate at different timescales to collaboratively solve tasks
Leads to greater reasoning depth and efficiency with only 27M parameters and no pretraining!
9
Analysis reveals that HRM learns a dimensionality hierarchy similar to the cortex: the high-level module operates in a higher-dimensional space than the low-level one (PR: 89.95 vs. 30.22).
The authors suggest that this is an emergent trait not present in untrained models.
Paper: arxiv.org/abs/2506.21734
The authors suggest that this is an emergent trait not present in untrained models.
Paper: arxiv.org/abs/2506.21734







