Yuxuan Hu's picture

15 5

Yuxuan Hu PRO

Ashenone3

·

AI & ML interests

None yet

Recent Activity

updated a dataset 14 days ago

Ashenone3/LM-Searcher-T2I-SFT

published a dataset 14 days ago

Ashenone3/LM-Searcher-T2I-SFT

liked a model 25 days ago

Ashenone3/LM-Searcher

View all activity

Organizations

upvoted 2 papers about 2 months ago

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Paper • 2510.14958 • Published Oct 16 • 22

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Paper • 2510.12796 • Published Oct 14 • 12

upvoted 2 papers 2 months ago

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

Paper • 2509.22644 • Published Sep 26 • 20

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Paper • 2509.22651 • Published Sep 26 • 22

upvoted a paper 4 months ago

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Paper • 2508.05635 • Published Aug 7 • 73

upvoted a paper 6 months ago

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Paper • 2505.21496 • Published May 27 • 38

upvoted 3 papers 7 months ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Paper • 2505.17022 • Published May 22 • 27

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published May 6 • 17

upvoted a paper 11 months ago

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Paper • 2501.01895 • Published Jan 3 • 55

upvoted 2 papers 12 months ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published Dec 11, 2024 • 18

Chimera: Improving Generalist Model with Domain-Specific Experts

Paper • 2412.05983 • Published Dec 8, 2024 • 9

upvoted 3 papers about 1 year ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 46

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 56

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 47