
10 min
Local Coding Model on a 24 GB Mac: Why My NVFP4 Setup Failed Quietly
Qwen3.6-35B on an M4 Pro 24 GB: technically loaded, 100% GPU -- and yet 0.1 tokens per second. An honest diagnosis, five lessons, and hardware recommendations for others.
behind-the-scenes

