-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 405k • • 12.9k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 355 • 718 -
The Ultra-Scale Playbook
🌌3.63kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627
Sunny Ratnani
SunnyRatnaniMD
·
AI & ML interests
None yet