Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 7 days ago • 84
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20 • 39
Deepseek v3.2 Speciale Collection Distilled models and datasets for Deepseek v3.2 Speciale. • 11 items • Updated 10 days ago • 1
Gemini 3 Pro Collection Distilled models and datasets for Gemini 3 Pro. • 9 items • Updated 10 days ago • 1
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset +1 Mar 15, 2024 • 13