GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 20 days ago • 364
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 14 days ago • 239
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 24 days ago • 144
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published Mar 22 • 77
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published Jan 29 • 102
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization Paper • 2505.12346 • Published May 18, 2025 • 19 • 16
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 6 days ago • 56
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF Image-Text-to-Text • 401B • Updated Jun 18, 2025 • 11.5k • 45
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated May 22, 2025 • 30.6k • • 479