@anemll: Here is gemma-4-26B-A4B-it on ...
@anemll
8 views
Apr 05, 2026
Advertisement
1
Here is gemma-4-26B-A4B-it on A17 Pro chip w/8GB memory ( MacBook Neo)
~ 7 t/s running on AMX ( GPU is slower on A17)
Gemma's 4 expert is x2.3 larger than Qwen
See Qwen 35B below
~ 7 t/s running on AMX ( GPU is slower on A17)
Gemma's 4 expert is x2.3 larger than Qwen
See Qwen 35B below
2
Qwen 3.5-35B-A3B, ~ 7.5 tps
with larger cache due to smaller expert
with larger cache due to smaller expert
3
A18 Pro as per screenshot