AgentVQA

university

https://advaitgupta.github.io

AI & ML interests

None defined yet.

advaitgupta

authored a paper 6 months ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published Jun 26, 2025 • 41

gsarch

authored 2 papers 7 months ago

Grounded Reinforcement Learning for Visual Reasoning

Paper • 2505.23678 • Published May 29, 2025 • 2

Grounding Task Assistance with Multimodal Cues from a Single Demonstration

Paper • 2505.01578 • Published May 2, 2025

advaitgupta

authored a paper 10 months ago

CoSTA$\ast$: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published Mar 13, 2025 • 79

gsarch

authored 5 papers over 1 year ago

ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights

Paper • 2406.14596 • Published Jun 20, 2024 • 5

Neural Representations of Dynamic Visual Stimuli

Paper • 2406.02659 • Published Jun 4, 2024

ODIN: A Single Model for 2D and 3D Perception

Paper • 2401.02416 • Published Jan 4, 2024 • 13

Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models

Paper • 2310.15127 • Published Oct 23, 2023

TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors

Paper • 2207.10761 • Published Jul 21, 2022