Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
h zhao's picture
4

h zhao

n1cck
  • huaiyizhao

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago
WeiboAI/VibeThinker-1.5B:hello? 虽然是一个推理模型,但有的方面也太离谱了吧
commented on a paper 4 months ago
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
commented on a paper 4 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
View all activity

Organizations

None yet

New activity in WeiboAI/VibeThinker-1.5B about 1 month ago

hello? 虽然是一个推理模型,但有的方面也太离谱了吧

6
#8 opened about 1 month ago by
yu0226
commented 3 papers 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660 •
56

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118 •
6

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118 •
6
New activity in Time-MQA/TSQA 5 months ago

Open sourcing evaluation scripts?

#1 opened 5 months ago by
n1cck
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs