Unlocking Feature Learning in Gated Delta Networks at Scale Paper • 2606.04048 • Published 3 days ago • 2
Unlocking Feature Learning in Gated Delta Networks at Scale Paper • 2606.04048 • Published 3 days ago • 2
EXCEEDS: Extracting Complex Events via Nugget-based Grid Modeling in Scientific Domain Paper • 2406.14075 • Published Apr 24
Beyond Static Dialogues: Benchmarking Realistic, Heterogeneous, and Evolving Long-Term Memory Paper • 2605.31086 • Published 7 days ago • 5
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents Paper • 2605.07039 • Published 29 days ago • 4
ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation Paper • 2507.19492 • Published May 31, 2025 • 1
Composition-Grounded Instruction Synthesis for Visual Reasoning Paper • 2510.15040 • Published Oct 16, 2025
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published Mar 28 • 29
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 154
Towards Principled Disentanglement for Domain Generalization Paper • 2111.13839 • Published Nov 27, 2021
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation Paper • 2202.01336 • Published Feb 2, 2022
The Impact of Symbolic Representations on In-context Learning for Few-shot Reasoning Paper • 2212.08686 • Published Dec 16, 2022
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning Paper • 2506.10378 • Published Jun 12, 2025 • 2
EvoLM: In Search of Lost Language Model Training Dynamics Paper • 2506.16029 • Published Jun 19, 2025
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs? Paper • 2507.15887 • Published Jul 19, 2025