VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud Paper • 2303.14408 • Published Mar 25, 2023
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE Paper • 2311.02684 • Published Nov 5, 2023
Asynchronous Large Language Model Enhanced Planner for Autonomous Driving Paper • 2406.14556 • Published Jun 20, 2024 • 1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy Paper • 2510.13778 • Published Oct 15 • 16