Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search Paper • 2506.11155 • Published Jun 11 • 1