RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 28 days ago • 7
The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods Paper • 2602.11364 • Published Feb 11
QEIL v2: Heterogeneous Computing for Edge Intelligence via Roofline-Derived Pareto-Optimal Energy Modeling and Multi-Objective Orchestration Paper • 2602.06057 • Published 11 days ago • 5
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook 📚 3.1k The secrets to building world-class LLMs
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 28 days ago • 7
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 28 days ago • 7
CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Paper • 2507.06013 • Published Jul 8, 2025
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 188
Running 3.78k The Ultra-Scale Playbook 🌌 3.78k The ultimate guide to training LLM on large GPU Clusters