Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 21 days ago • 134
TiDAR: Think in Diffusion, Talk in Autoregression Paper • 2511.08923 • Published 26 days ago • 111
Running on CPU Upgrade Featured 2.54k The Smol Training Playbook 📚 2.54k The secrets to building world-class LLMs
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 17 items • Updated 21 days ago • 50
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper • 2507.01957 • Published Jul 2 • 21