CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling Paper • 2602.01766 • Published Feb 2 • 1
F4Splat: Feed-Forward Predictive Densification for Feed-Forward 3D Gaussian Splatting Paper • 2603.21304 • Published 3 days ago • 31 • 3
ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization Paper • 2603.09290 • Published 15 days ago • 5 • 2
REVERE: Reflective Evolving Research Engineer for Scientific Workflows Paper • 2603.20667 • Published 4 days ago • 14 • 2
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 2 days ago • 97 • 4
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Paper • 2603.19028 • Published 6 days ago • 16 • 2
Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation Paper • 2603.21884 • Published 2 days ago • 2 • 2
In-the-Wild Camouflage Attack on Vehicle Detectors through Controllable Image Editing Paper • 2603.19456 • Published 6 days ago • 1 • 2
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 3 days ago • 63 • 3
Understanding Behavior Cloning with Action Quantization Paper • 2603.20538 • Published 5 days ago • 1 • 2
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 2 days ago • 40 • 2
Scalable Prompt Routing via Fine-Grained Latent Task Discovery Paper • 2603.19415 • Published 6 days ago • 5 • 2
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding Paper • 2603.22285 • Published 2 days ago • 45 • 2
Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published 2 days ago • 30 • 2