cool idea from Meta
What if we augment CoT + RL’s token space thinking into a “latent space”?
This research proposes “The Free Transformer”, with a way to let LLMs make global decisions within a latent space (via VAE encoder) that could later simplify autoregressive sampling
Generated by Thread Navigator
Press ⌘ + S to quick-export


