Introducing the first discrete diffusion pipeline for text in Diffusers -- LLaDA2 by @TheInclusionAI 🔥
It follows an MoE architecture w/ 16B total params. It is definitely not SOTA across the board, but it hopefully flips that soon.
Check out the links below to know more ⬇️ ...
Mar 26, 2026