view article Article Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models Jul 4 • 10
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21 • 37
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. May 15 • 36
view article Article Welcome Falcon Mamba: The first strong attention-free 7B model +4 Aug 12, 2024 • 113
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware +7 Mar 20, 2024 • 32
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face +5 Dec 11, 2023 • 12
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 Sep 12, 2023 • 12
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 Sep 12, 2023 • 12
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 171
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 171
view article Article Introducing RWKV - An RNN with the advantages of a transformer +1 May 15, 2023 • 23