Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
On Vacation 🏝️
3
5
6
Peter L. Chen
PeterLauLukCh
Follow
John6666's profile picture
ElisaCheung's profile picture
2 followers
·
6 following
https://peterlaulukchen.github.io/
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 21 hours ago
MOAwR/RedditSummary-Alignment
published
a dataset
1 day ago
MOAwR/RedditSummary-Alignment
upvoted
a
paper
3 days ago
Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward
View all activity
Organizations
PeterLauLukCh
's datasets
5
Sort: Recently updated
PeterLauLukCh/ultrafeedback_binarized
Viewer
•
Updated
Apr 5
•
61.1k
•
25
PeterLauLukCh/Offline-RL-Preference-o3
Viewer
•
Updated
Mar 23
•
1.65k
•
36
PeterLauLukCh/Offline-RL-Preference-Qwen
Viewer
•
Updated
Mar 21
•
2.2k
•
29
PeterLauLukCh/Offline-RL-Preference-Ultra
Viewer
•
Updated
Mar 21
•
1.64k
•
19
PeterLauLukCh/Offline-RL-Preference
Viewer
•
Updated
Mar 20
•
2.2k
•
33