-
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models
Paper • 2511.18373 • Published • 5 -
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Paper • 2511.13288 • Published • 17 -
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
Paper • 2511.19418 • Published • 26 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 105
Innocent Emmanuel
EL102
AI & ML interests
None yet
Recent Activity
updated
a collection
9 days ago
My thing
updated
a collection
10 days ago
My thing
updated
a collection
11 days ago
My thing
Organizations
None yet