Model Stock: All we need is just a few fine-tuned models
Paper
•
2403.19522
•
Published
•
13
This is a merge of pre-trained language models created using mergekit.
This model was merged using the Model Stock merge method using mistralai/Mistral-Nemo-Instruct-2407 as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
out_dtype: bfloat16
merge_method: model_stock
base_model: mistralai/Mistral-Nemo-Instruct-2407
models:
- model: ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.3
- model: DavidAU/MN-Dark-Planet-TITAN-12B
- model: HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
- model: inflatebot/MN-12B-Mag-Mell-R1
- model: Khetterman/AbominationScience-12B-v4
- model: Khetterman/DarkAtom-12B-v3
- model: LatitudeGames/Wayfarer-12B
- model: mergekit-community/MN-Sappho-g3-12B
parameters:
weight: 1.5
- model: mergekit-community/MN-Sappho-j-12B
- model: mergekit-community/MN-Sappho-n-12B
parameters:
weight: 2.5
- model: mistralai/Mistral-Nemo-Base-2407
parameters:
weight: 2.0
- model: Nitral-Archive/Diogenes-12B
- model: nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B
- model: PocketDoc/Dans-PersonalityEngine-V1.1.0-12b
- model: PygmalionAI/Eleusis-12B
- model: ToastyPigeon/Sto-vo-kor-12B
- model: yuyouyu/Mistral-Nemo-BD-RP