Pretraining Frame Preservation in Autoregressive Video Memory Compression
Paper
•
2512.23851
•
Published
•
25
None defined yet.
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO