Huggingface Projects

company

https://huggingface.co/

huggingface

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper about 8 hours ago

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

sergiopaniego updated a dataset about 20 hours ago

huggingface-projects/Deep-RL-Course-Certification

hysts updated a Space about 22 hours ago

huggingface-projects/gemma-3n-E4B-it

View all activity

akhaliq

submitted a paper to Daily Papers about 8 hours ago

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Paper • 2603.24517 • Published 2 days ago • 3

sergiopaniego

updated a dataset about 20 hours ago

huggingface-projects/Deep-RL-Course-Certification

Viewer • Updated about 10 hours ago • 1.69k • 196 • 18

hysts

updated a Space about 22 hours ago

Gemma 3n E4B It

⚡

142

Chat with a multimodal assistant using text, images, audio, or video

hysts

updated 7 Spaces 1 day ago

Gemma 3 12b It

🔥

163

Chat with AI using text, images, or a video for instant answers

Gemma 2 2B JPN IT

😻

Chatbot

Gemma 2 9B IT

😻

101

Chatbot

Gemma 2 2B IT

😻

Chatbot

Llama 3.2 3B Instruct

😻

123

Chatbot

Llama 2 13b Chat

🦙

490

Chat with Llama‑2 13B for instant AI-generated replies

Llama 2 7B Chat

🏆

482

Chat with Llama‑2 7B model

mishig

posted an update 1 day ago

Post

132

I like these models nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16 and nvidia/NVIDIA-Nemotron-3-Nano-4B-FP8 and TradingAgents: Multi-Agents LLM Financial Trading Framework (2412.20138) and https://arxiv.org/abs/2412.20138

pcuenq

updated a dataset 6 days ago

huggingface-projects/drlc-leaderboard-data

Viewer • Updated 6 days ago • 49.3k • 1.43k • 2

akhaliq

submitted a paper to Daily Papers 9 days ago

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Paper • 2603.16792 • Published 10 days ago • 3

akhaliq

submitted a paper to Daily Papers 12 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published 15 days ago • 37

sergiopaniego

posted an update 14 days ago

Post

568

ICYMI, great blog by @kashif and @stas on Ulysses Sequence Parallelism: train with million-token contexts

on 4×H100s: 12x longer sequences, 3.7x throughput

learn how to integrate it with Accelerate, Transformers, and TRL ⤵️
https://huggingface.co/blog/ulysses-sp

clefourrier

authored a paper 15 days ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 15 days ago • 63

AdinaY

submitted a paper to Daily Papers 15 days ago

Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published 18 days ago • 7

sergiopaniego

posted an update 16 days ago

Post

325

We just released a big blog surveying 16 OSS frameworks for async RL training of LLMs!

We're building a new async GRPO trainer for TRL and as first step, we needed to understand how the ecosystem solves this problem today.

The problem: in synchronous RL training, generation dominates wall-clock time. 32K-token rollouts on a 32B model take hours while training GPUs sit completely idle. With reasoning models and agentic RL making rollouts longer and more variable, this only gets worse.

The ecosystem converged on the same fix: separate inference + training onto different GPU pools, rollout buffer, and async weight sync.

We compared 16 frameworks across 7 axes: orchestration, buffer design, weight sync, staleness management, partial rollouts, LoRA, and MoE support.

This survey is step one. The async GRPO trainer for TRL is next!

https://huggingface.co/blog/async-rl-training-landscape