Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6 • 208
Open Multimodal Retrieval-Augmented Factual Image Generation Paper • 2510.22521 • Published Oct 26 • 30
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search Paper • 2506.11155 • Published Jun 11 • 1
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Paper • 2506.04207 • Published Jun 4 • 48
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data Paper • 2408.06273 • Published Aug 12, 2024 • 10