1 22 60

MC

Dreamer312

Dreamer

AI & ML interests

NLP, CV, LLM, AGENT, RL

Recent Activity

upvoted a paper 7 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

upvoted a paper 7 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

upvoted a paper 21 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

View all activity

Organizations

None yet

upvoted 2 papers 7 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 20 days ago • 364

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 14 days ago • 239

upvoted a paper 21 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 24 days ago • 144

upvoted a paper 29 days ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 77

upvoted 2 papers 3 months ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published Jan 29 • 102

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

liked 2 models 5 months ago

WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated Nov 24, 2025 • 1.97k • 517

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 94.2k • • 1.7k

liked a Space 6 months ago

Robot Learning: A Tutorial

📝

386

Explore the Robot Learning tutorial online

commented 2 papers 6 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

liked a dataset 6 months ago

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 3.43k • 207

commented 4 papers 11 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

upvoted a paper 11 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20, 2025 • 76

upvoted a collection 11 months ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 6 days ago • 56

liked 2 models 11 months ago

unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-Text-to-Text • 401B • Updated Jun 18, 2025 • 11.5k • 45

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • 402B • Updated May 22, 2025 • 30.6k • • 479

MC

AI & ML interests

Recent Activity

Organizations

Dreamer312's activity

Robot Learning: A Tutorial