Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AgentVQA

university
https://advaitgupta.github.io
Activity Feed

AI & ML interests

None defined yet.

Yin's profile picture Gabriel H Sarch's profile picture Advait Gupta's profile picture

advaitgupta 
authored a paper 6 months ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published Jun 26, 2025 • 41
gsarch 
authored 2 papers 7 months ago

Grounded Reinforcement Learning for Visual Reasoning

Paper • 2505.23678 • Published May 29, 2025 • 2

Grounding Task Assistance with Multimodal Cues from a Single Demonstration

Paper • 2505.01578 • Published May 2, 2025
advaitgupta 
authored a paper 10 months ago

CoSTA$\ast$: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published Mar 13, 2025 • 79
gsarch 
authored 5 papers over 1 year ago

ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights

Paper • 2406.14596 • Published Jun 20, 2024 • 5

Neural Representations of Dynamic Visual Stimuli

Paper • 2406.02659 • Published Jun 4, 2024

ODIN: A Single Model for 2D and 3D Perception

Paper • 2401.02416 • Published Jan 4, 2024 • 13

Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models

Paper • 2310.15127 • Published Oct 23, 2023

TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors

Paper • 2207.10761 • Published Jul 21, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs