Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ritzzai
's Collections
OPRM
OPRM
updated
Mar 2
Upvote
3
ritzzai/OPRM-7B
8B
•
Updated
Feb 10
•
2
•
2
ritzzai/OPRM-14B
15B
•
Updated
Feb 10
•
8
•
2
ritzzai/OPRM-32B
33B
•
Updated
Feb 10
•
3
•
1
ritzzai/OPRM-72B
73B
•
Updated
Feb 10
•
6
•
1
ritzzai/OPRM-RgFT-7B
8B
•
Updated
Feb 12
•
8
•
2
ritzzai/OPRM-RgFT-14B
15B
•
Updated
Feb 12
•
5
•
1
ritzzai/OPRM-RgFT-32B
33B
•
Updated
Feb 13
•
6
•
1
ritzzai/OPRM-RgFT-72B
Updated
Feb 9
•
1
Learning Ordinal Probabilistic Reward from Preferences
Paper
•
2602.12660
•
Published
Feb 13
•
3
Upvote
3
Share collection
View history
Collection guide
Browse collections