StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published 7 days ago • 15
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models Paper • 2605.17672 • Published 8 days ago • 22
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward Paper • 2605.12495 • Published 13 days ago • 35
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 313
Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging -- An Open Recipe Paper • 2502.09056 • Published Feb 13, 2025 • 32
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published Mar 24, 2025 • 20
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Paper • 2502.14846 • Published Feb 20, 2025 • 16
Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO Paper • 2605.04077 • Published Apr 14 • 7
Motion-Aware Caching for Efficient Autoregressive Video Generation Paper • 2605.01725 • Published 22 days ago • 8
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction Paper • 2605.06642 • Published 18 days ago • 27
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 22 days ago • 162
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 24 days ago • 84
PhyCo: Learning Controllable Physical Priors for Generative Motion Paper • 2604.28169 • Published 25 days ago • 13