view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 64
view article Article RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples Aug 16, 2024 • 10
view article Article RegMix: Data Mixture as Regression for Language Model Pre-training Jul 11, 2024 • 15
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch May 7, 2024 • 112
view article Article Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA Jan 22, 2024 • 26
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H Jun 3, 2025 • 71
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs May 7, 2025 • 42
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 98
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 94