view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12 • 473
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub +2 Feb 12 • 79
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 76
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 219
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python +5 Oct 22, 2024 • 44