HuggingFaceH4
/

zephyr-7b-beta

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

edbeeching HF Staff commited on Oct 26, 2023

Commit

63bc8ee

·

1 Parent(s): 8bf7bff

Update README.md

Files changed (1) hide show

README.md +22 -0

README.md CHANGED Viewed

@@ -37,6 +37,28 @@ Zephyr is a series of language models that are trained to act as helpful assista
 - **Repository:** https://github.com/huggingface/alignment-handbook
 - **Demo:** https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat
 ## Intended uses & limitations
 The model was initially fine-tuned on a filtered and preprocessed of the [`UltraChat`](https://huggingface.co/datasets/stingning/ultrachat) dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.

 - **Repository:** https://github.com/huggingface/alignment-handbook
 - **Demo:** https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat
+## Performance
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6200d0a443eb0913fa2df7cc/raxvt5ma16d7T23my34WC.png)
+| Model | Size | Align | MT-Bench (score) | AlpacaEval (win %) |
+|-------------|-----|----|---------------|--------------|
+| StableLM-Tuned-α | 7B| dSFT |2.75| -|
+| MPT-Chat |  7B |dSFT |5.42| -|
+| Xwin-LMv0.1 | 7B| dPPO| 6.19| 87.83|
+| Mistral-Instructv0.1 | 7B|  - | 6.84 |-|
+| Zephyr-7b-α |7B|  dDPO| 6.88| -|
+| **Zephyr-7b-β** |7B|  dDPO| 7.34| 90.60|
+| Falcon-Instruct |  40B |dSFT |5.17 |45.71|
+| Guanaco 65B |  SFT |6.41| 71.80|
+| Llama2-Chat |  70B |RLHF |6.86| 92.66|
+| Vicuna v1.3 |  33B |dSFT |7.12 |88.99|
+| WizardLM v1.0 |  70B |dSFT |7.71 |-|
+| Xwin-LM v0.1 |   70B |dPPO |- |95.57|
+| GPT-3.5-turbo | - |RLHF |7.94 |89.37|
+| Claude 2 |  - |RLHF |8.06| 91.36|
+| GPT-4 |  -| RLHF |8.99| 95.28|
 ## Intended uses & limitations
 The model was initially fine-tuned on a filtered and preprocessed of the [`UltraChat`](https://huggingface.co/datasets/stingning/ultrachat) dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.