h zhao
n1cck
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
WeiboAI/VibeThinker-1.5B:hello? 虽然是一个推理模型,但有的方面也太离谱了吧
commented on
a paper
3 months ago
Sharing is Caring: Efficient LM Post-Training with Collective RL
Experience Sharing
commented on
a paper
4 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Organizations
None yet