Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ritzzai 's Collections
OPRM

OPRM

updated Mar 2
Upvote
3

  • ritzzai/OPRM-7B

    8B • Updated Feb 10 • 2 • 2

  • ritzzai/OPRM-14B

    15B • Updated Feb 10 • 8 • 2

  • ritzzai/OPRM-32B

    33B • Updated Feb 10 • 3 • 1

  • ritzzai/OPRM-72B

    73B • Updated Feb 10 • 6 • 1

  • ritzzai/OPRM-RgFT-7B

    8B • Updated Feb 12 • 8 • 2

  • ritzzai/OPRM-RgFT-14B

    15B • Updated Feb 12 • 5 • 1

  • ritzzai/OPRM-RgFT-32B

    33B • Updated Feb 13 • 6 • 1

  • ritzzai/OPRM-RgFT-72B

    Updated Feb 9 • 1

  • Learning Ordinal Probabilistic Reward from Preferences

    Paper • 2602.12660 • Published Feb 13 • 3
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs