4 17 2

Xiuyu Li

xiuyul

https://xiuyuli.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

upvoted a paper about 22 hours ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

updated a model about 1 month ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step100-merged

View all activity

Organizations

upvoted a paper about 19 hours ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Paper • 2512.05033 • Published 7 days ago • 13

upvoted a paper about 22 hours ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published 17 days ago • 19

updated a model about 1 month ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step100-merged

Text Generation • 4B • Updated Nov 10 • 8

published a model about 1 month ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step100-merged

Text Generation • 4B • Updated Nov 10 • 8

updated a model about 1 month ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step80-merged

Text Generation • 4B • Updated Nov 9 • 8

published 2 models about 1 month ago

xiuyul/deepcoder-sandbox-Qwen3-4B-Instruct-2507-32rank-4e-05lr-step80-merged

Text Generation • 4B • Updated Nov 9 • 8

xiuyul/deepcoder-Qwen-Qwen3-4B-Instruct-2507-32rank-4e-05lr-8group-128batch-1_0temp-seed0-merged

Text Generation • 4B • Updated Nov 3 • 6

updated a model about 1 month ago

xiuyul/deepcoder-Qwen-Qwen3-4B-Instruct-2507-32rank-4e-05lr-8group-128batch-1_0temp-seed0-merged

Text Generation • 4B • Updated Nov 3 • 6

upvoted a paper 4 months ago

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

Paper • 2508.10395 • Published Aug 14 • 42

updated a dataset 4 months ago

Parallel-Reasoning/apr_sft_data

Preview • Updated Aug 15 • 18 • 1

published a dataset 4 months ago

Parallel-Reasoning/apr_sft_data

Preview • Updated Aug 15 • 18 • 1

updated a dataset 4 months ago

Parallel-Reasoning/sosp_sft_data

Viewer • Updated Aug 15 • 500k • 24

published a dataset 4 months ago

Parallel-Reasoning/sosp_sft_data

Viewer • Updated Aug 15 • 500k • 24

updated a dataset 4 months ago

Parallel-Reasoning/countdown_problems

Viewer • Updated Aug 15 • 501k • 31

published a dataset 4 months ago

Parallel-Reasoning/countdown_problems

Viewer • Updated Aug 15 • 501k • 31

Xiuyu Li

AI & ML interests

Recent Activity

Organizations

xiuyul's activity