2 13 9

Xu Wayen

wilye

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

RL makes MLLMs see better than SFT

upvoted a paper 3 months ago

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

updated a dataset 6 months ago

VisuLogic/VisuLogic

View all activity

Organizations

upvoted a paper 2 months ago

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 48

upvoted a paper 3 months ago

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 32

updated 2 datasets 6 months ago

VisuLogic/VisuLogic

Viewer • Updated Jul 9, 2025 • 1k • 1.07k • 11

VisuLogic/VisuLogic-Train

Preview • Updated Jun 28, 2025 • 252 • 10

upvoted 2 papers 6 months ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24, 2025 • 26

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 74

liked a model 8 months ago

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16, 2025 • 2.74k • 510

upvoted a paper 8 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5, 2025 • 28

liked 2 datasets 8 months ago

Zhaowc/AudioCaps

Viewer • Updated Apr 14, 2025 • 102k • 311 • 2

VisuLogic/VisuLogic-Train

Preview • Updated Jun 28, 2025 • 252 • 10

upvoted a collection 8 months ago

Papers to Read

Collection

208 items • Updated Aug 24, 2025 • 10

authored 3 papers 8 months ago

P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task

Paper • 2409.11279 • Published Sep 17, 2024 • 1

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21, 2025 • 78

liked 4 datasets 8 months ago

upvoted a paper 8 months ago

Equivariant Image Modeling

Paper • 2503.18948 • Published Mar 24, 2025 • 15

liked a model 8 months ago

Cusyoung/CrossEarth

Updated Jun 20, 2025 • 6

Xu Wayen

AI & ML interests

Recent Activity

Organizations

wilye's activity