Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AGI Workshop @ Tsinghua

non-profit
Activity Feed

AI & ML interests

Generative AI/LLM/Multimodal

Rongao Li's profile picture haipengluo's profile picture Haoji Zhang's profile picture

zhang9302002 
authored 4 papers 3 months ago

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Paper • 2412.01268 • Published Dec 2, 2024 • 1

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20, 2025 • 53

Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning

Paper • 2508.04416 • Published Aug 6, 2025 • 1

Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

Paper • 2506.23825 • Published Jun 30, 2025
zhang9302002 
authored a paper over 1 year ago

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Paper • 2406.08085 • Published Jun 12, 2024 • 17
rongaoli 
updated a Space over 1 year ago
Running
1

README

👀
1

rongaoli 
authored a paper about 2 years ago

TACO: Topics in Algorithmic COde generation dataset

Paper • 2312.14852 • Published Dec 22, 2023 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs