Model Stock: All we need is just a few fine-tuned models
Paper
•
2403.19522
•
Published
•
13
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Model Stock merge method using cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
merge_method: model_stock
base_model: cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser
models:
- model: openchat/openchat_3.5
- model: Open-Orca/Mistral-7B-OpenOrca
- model: cognitivecomputations/dolphin-2.8-mistral-7b-v02
dtype: bfloat16
tokenizer_source: base
int8_mask: true
normalize: true
name: 7B-Cetacea