Quentin Gallouédec's picture

In a Training Loop 🔄

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

trl-internal-testing/tiny-Qwen3VLForConditionalGeneration:Upload Qwen3VLForConditionalGeneration

new activity 1 day ago

trl-internal-testing/tiny-Qwen3VLForConditionalGeneration:Upload Qwen3VLForConditionalGeneration

new activity 1 day ago

trl-internal-testing/tiny-LlavaNextForConditionalGeneration:Upload LlavaNextForConditionalGeneration

View all activity

Organizations

upvoted an article 8 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

8 days ago

• 33

upvoted a paper 12 days ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 68

upvoted a changelog 16 days ago

Hugging Face Changelog

Spaces agents.md for your coding agents

30 days ago

• 283

upvoted an article 17 days ago

Article

AI evals are becoming the new compute bottleneck

evaleval

•

17 days ago

• 26

upvoted a collection 24 days ago

Tiny Models for CI

A collection of tiny models of common model architectures. Useful for e2e smoke tests across real pretrained models to validate loss behavior. • 10 items • Updated 24 days ago • 1

upvoted an article about 2 months ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

+2

qgallouedec, stevhliu, pcuenq, sergiopaniego

•

Mar 31

• 51

upvoted a paper about 2 months ago

Fine-Tuning Language Models from Human Preferences

Paper • 1909.08593 • Published Sep 18, 2019 • 4

upvoted a paper 2 months ago

Fewer Truncations Improve Language Modeling

Paper • 2404.10830 • Published Apr 16, 2024 • 5

upvoted 3 articles 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 152

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 194

Article

Bringing Autonomous Driving RL to OpenEnv and TRL

sergiopaniego

•

Feb 26

• 22

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.63k

upvoted 2 articles 3 months ago

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

dlouapre

•

Feb 19

• 62

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 505

upvoted 2 papers 3 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 149

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 13

upvoted 3 articles 3 months ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

burtenshaw

•

Jan 20

• 12

Article

Transformers.js v4: Now Available on NPM!

Xenova, nico-martin

•

Feb 9

• 95

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

dvgodoy

•

Feb 11, 2025

• 123

upvoted a paper 3 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37