Running Featured 1.21k FineWeb: decanting the web for the finest text data at scale 🍷 1.21k Generate high-quality text data for LLMs using FineWeb
Running 3.56k The Ultra-Scale Playbook 🌌 3.56k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.59k The Smol Training Playbook 📚 2.59k The secrets to building world-class LLMs
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion Paper • 2405.04883 • Published May 8, 2024
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces Paper • 2407.11895 • Published Jul 16, 2024 • 7
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 50
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16, 2024 • 9
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup Paper • 2410.21269 • Published Oct 28, 2024
APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization Paper • 2506.21655 • Published Jun 26