siyan zhao's picture

1 2 3

siyan zhao

siyanzhao

·

siyan_zhao

AI & ML interests

Machine Learning

Recent Activity

upvoted a paper about 2 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

authored a paper 3 months ago

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

authored a paper 3 months ago

Inpainting-Guided Policy Optimization for Diffusion Large Language Models

View all activity

Organizations

siyanzhao 's datasets 5

siyanzhao/OpenThoughts2-1M_verifiable

Viewer • Updated Apr 24 • 63.9k • 19

siyanzhao/s1-59k-minus-s1k

Viewer • Updated Apr 14 • 58k • 320

siyanzhao/prefeval_implicit_persona

Viewer • Updated Feb 24 • 1k • 293

siyanzhao/prefeval_implicit_choice

Viewer • Updated Feb 24 • 1k • 54

siyanzhao/prefeval_explicit

Viewer • Updated Feb 24 • 1k • 89 • 2