Running 75 Unlocking On-Policy Distillation for Any Model Family 📝 75 Apply on-policy distillation to any model family
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement Nov 7, 2025 • 4