FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing Paper • 2506.20911 • Published Jun 26, 2025 • 41
Grounding Task Assistance with Multimodal Cues from a Single Demonstration Paper • 2505.01578 • Published May 2, 2025
CoSTA$\ast$: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing Paper • 2503.10613 • Published Mar 13, 2025 • 79
ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights Paper • 2406.14596 • Published Jun 20, 2024 • 5
Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models Paper • 2310.15127 • Published Oct 23, 2023
TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors Paper • 2207.10761 • Published Jul 21, 2022