Models

531

Full-text search

Active filters: RLHF

NousResearch/Hermes-2-Pro-Mistral-7B

Text Generation • 7B • Updated Sep 8, 2024 • 4.5k • 501

NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • 8B • Updated Sep 14, 2024 • 223k • • 448

llm-blender/PairRM

Text Generation • Updated Jan 22, 2024 • 377 • 206

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-adapter

Updated Feb 20, 2024 • 16

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO

Text Generation • 47B • Updated Apr 30, 2024 • 8.69k • 453

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF

47B • Updated Feb 20, 2024 • 1.46k • 71

NousResearch/Nous-Hermes-2-Mistral-7B-DPO

Text Generation • 7B • Updated Apr 30, 2024 • 1.22k • 218

NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

7B • Updated Mar 28, 2024 • 4.43k • 247

mlx-community/Hermes-2-Pro-Mistral-7B-4bit

1B • Updated Mar 14, 2024 • 151 • 4

mlx-community/Hermes-2-Pro-Mistral-7B-8bit

2B • Updated Mar 16, 2024 • 326 • 8

aaditya/Llama3-OpenBioLLM-70B

Text Generation • Updated Jan 18, 2025 • 4.78k • 504

NousResearch/Hermes-2-Theta-Llama-3-8B

Text Generation • 8B • Updated Sep 8, 2024 • 10.4k • • 204

NousResearch/Hermes-2-Pro-Llama-3-70B

Text Generation • 71B • Updated Sep 8, 2024 • 74 • • 35

mlx-community/Hermes-2-Pro-Mistral-7B-3bit

0.9B • Updated Dec 14, 2024 • 51 • 1

OpenAssistant/reward-model-deberta-v3-base

Text Classification • Updated Jan 26, 2023 • 1.65k • • 13

OpenAssistant/reward-model-electra-large-discriminator

Text Classification • Updated Jan 26, 2023 • 437 • 5

OpenAssistant/reward-model-deberta-v3-large

Text Classification • Updated Feb 17, 2023 • 773 • 26

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 40.1k • • 245

llm-blender/pair-ranker

Text Ranking • 0.4B • Updated Apr 2, 2025 • 9 • 3

nicholasKluge/RewardModelPT

Text Classification • 0.1B • Updated Jun 9, 2025 • 23

nicholasKluge/RewardModel

Text Classification • 0.1B • Updated Jun 9, 2025 • 317 • 1

fb700/chatglm-fitness-RLHF

Updated Mar 6, 2024 • 268

fb700/Bofan-chatglm-Best-lora

Updated Aug 24, 2023 • 7 • 11

kubernetes-bad/Ligma-L2-13b

Updated Sep 19, 2023 • 9 • 3

berkeley-nest/Starling-LM-7B-alpha

Text Generation • 7B • Updated Mar 20, 2024 • 1.88k • 559

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 72 • 104

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 3

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 1

LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 2

LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 1