In a Training Loop 🔄

2 12 23

Subarno Sadat Barno

barnobarno666

AI & ML interests

reinforming learning

Recent Activity

liked a dataset 7 days ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

liked a dataset 7 days ago

zwhe99/DeepMath-103K

liked a dataset 7 days ago

TeichAI/claude-4.5-opus-high-reasoning-250x

View all activity

Organizations

liked 3 datasets 7 days ago

#1 opened 10 days ago by

gergopool

liked a model 10 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Text Generation • 28B • Updated 3 days ago • 23k • 341

liked 2 datasets 2 months ago

RUC-AIBOX/OlymMATH-eval

Viewer • Updated May 11, 2025 • 579k • 143 • 4

brando/olympiad-bench-imo-math-boxed-825-v2-21-08-2024

Viewer • Updated Nov 6, 2024 • 1.65k • 55 • 5

liked 2 models 3 months ago

Synthyra/ESM2-8M

Fill-Mask • 7.52M • Updated 5 days ago • 1.25k • 2

biomap-research/proteinglm-100b-int4

50B • Updated Mar 17, 2025 • 78 • 11

liked a model 4 months ago

Adilbai/ppo-LunarLander-v2

Reinforcement Learning • Updated Jun 9, 2025 • 2

updated a model 4 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 6

published a model 4 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 6

upvoted a collection 4 months ago

Gemma 3 Release

Collection

28 items • Updated Aug 11, 2025 • 619

liked a Space 4 months ago

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

upvoted 2 papers 5 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6, 2025 • 23

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 181

liked a model 5 months ago

unsloth/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Jun 2, 2025 • 181k • 88

liked a model 6 months ago

unsloth/Qwen3-1.7B-Base-unsloth-bnb-4bit

Text Generation • Updated May 13, 2025 • 5.25k • 3

upvoted 2 papers 6 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 91

Subarno Sadat Barno

AI & ML interests

Recent Activity

Organizations

barnobarno666's activity

Claude distillation

The Smol Training Playbook