ndl-core-collection Collection A collection of UK government structured datasets and textual sources for research, analysis, and AI applications. • 6 items • Updated Jan 12 • 3
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 5 days ago • 15
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 7 days ago • 43
Datasets of AI Ecosystem Data Collection Datasets shared on the Hub to support research and investigation of the AI ecosystem • 3 items • Updated 9 days ago • 1
Visualizations of AI Ecosystem Data Collection Spaces and demos showing the evolution of the AI ecosystem • 6 items • Updated 9 days ago • 1
Research on AI Ecosystem Data Collection Research papers leveraging AI ecosystem data • 6 items • Updated 9 days ago • 1
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 16 days ago • 79
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 16 days ago • 182
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 27 days ago • 88
view article Article easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem 23 days ago • 5
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 64