Xiao-Ming Wu's picture

14 6

Xiao-Ming Wu

DravenALG

https://dravenalg.github.io/

AI & ML interests

Deep Learning, Computer Vision, Embodied AI

Recent Activity

updated a model about 15 hours ago

DravenALG/Draven_Real-world_Robot_Checkpoint

new activity 3 days ago

DravenALG/Draven_Real-world_Robot_Data:Upload pull_water.zip

new activity 3 days ago

DravenALG/Draven_Real-world_Robot_Data:Upload 5 files

View all activity

Organizations

None yet

upvoted a paper 27 days ago

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published 30 days ago • 18

upvoted a paper about 1 month ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 64

upvoted a paper about 2 months ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted 2 papers 3 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 67

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 126

upvoted a paper 5 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18, 2025 • 49