h zhao's picture

4

h zhao

n1cck

huaiyizhao

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

WeiboAI/VibeThinker-1.5B:hello? 虽然是一个推理模型，但有的方面也太离谱了吧

commented on a paper 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

commented on a paper 4 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

View all activity

Organizations

None yet

New activity in WeiboAI/VibeThinker-1.5B about 1 month ago

hello? 虽然是一个推理模型，但有的方面也太离谱了吧

#8 opened about 1 month ago by

commented 3 papers 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660 •

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118 •

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118 •

New activity in Time-MQA/TSQA 5 months ago

Open sourcing evaluation scripts?

#1 opened 5 months ago by