23 2

Zeyu Zhang

SteveZeyuZhang

https://steve-zeyu-zhang.github.io/

steve-zeyu-zhang

AI & ML interests

Geometric Learning, Generative AI, Computer Vision, Robotics, AI for Health

Recent Activity

authored a paper 9 days ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

updated a model 21 days ago

SteveZeyuZhang/mesh

published a model 25 days ago

SteveZeyuZhang/mesh

View all activity

Organizations

commented 2 papers about 1 month ago

EgoLCD: Egocentric Video Generation with Long Context Diffusion

Paper • 2512.04515 • Published Dec 4, 2025 • 5 •

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation

Paper • 2511.22973 • Published Nov 28, 2025 • 4 •

New activity in heyuanyu/LV-Bench about 1 month ago

Update README.md

#1 opened about 1 month ago by

SteveZeyuZhang

commented a paper about 1 month ago

EvoVLA: Self-Evolving Vision-Language-Action Model

Paper • 2511.16166 • Published Nov 20, 2025 • 5 •

New activity in AIGeeksGroup/VaseVQA-3D 2 months ago

Upload vaseglb.tar.gz

#2 opened 2 months ago by

zzzzzzaza

commented 2 papers 3 months ago

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

Paper • 2510.01623 • Published Oct 2, 2025 • 10 •

UniVid: The Open-Source Unified Video Model

Paper • 2509.24200 • Published Sep 29, 2025 • 4 •

commented 2 papers 4 months ago

VaseVQA: Multimodal Agent and Benchmark for Ancient Greek Pottery

Paper • 2509.17191 • Published Sep 21, 2025 • 1 •

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes

Paper • 2509.16415 • Published Sep 19, 2025 • 2 •

commented 2 papers 5 months ago

ReMoMask: Retrieval-Augmented Masked Motion Generation

Paper • 2508.02605 • Published Aug 4, 2025 • 4 •

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Paper • 2507.23478 • Published Jul 31, 2025 • 15 •

commented a paper 6 months ago

PresentAgent: Multimodal Agent for Presentation Video Generation

Paper • 2507.04036 • Published Jul 5, 2025 • 10 •

commented a paper 8 months ago

MediAug: Exploring Visual Augmentation in Medical Imaging

Paper • 2504.18983 • Published Apr 26, 2025 • 7 •

commented 2 papers 9 months ago

DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion

Paper • 2504.09513 • Published Apr 13, 2025 •

3D CoCa: Contrastive Learners are 3D Captioners

Paper • 2504.09518 • Published Apr 13, 2025 • 5 •

commented 4 papers 10 months ago

PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images

Paper • 2503.17970 • Published Mar 23, 2025 • 3 •

Motion Anything: Any to Motion Generation

Paper • 2503.06955 • Published Mar 10, 2025 • 35 •

Motion Anything: Any to Motion Generation

Paper • 2503.06955 • Published Mar 10, 2025 • 35 •

DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps

Paper • 2502.15885 • Published Feb 21, 2025 • 2 •

commented a paper about 1 year ago

KMM: Key Frame Mask Mamba for Extended Motion Generation

Paper • 2411.06481 • Published Nov 10, 2024 • 5 •