-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 555k • • 12.9k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 506 • 711 -
The Ultra-Scale Playbook
🌌3.6kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627
Sunny Ratnani
SunnyRatnaniMD
·
AI & ML interests
None yet
Organizations
Medical License Exam
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 555k • • 12.9k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 506 • 711 -
Running3.6k
The Ultra-Scale Playbook
🌌3.6kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627