Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 9 days ago • 122
view article Article How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio Aug 14 • 25
Running 3.56k The Ultra-Scale Playbook 🌌 3.56k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21 • 157k • • 1.25k
dbmdz/bert-large-cased-finetuned-conll03-english Token Classification • 0.3B • Updated Sep 6, 2023 • 1.78M • • 92