Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
demonlxrd
/
olmoe-openhermes-ultrafeedback-dora-dpo
like
1
Text Generation
PEFT
Safetensors
Transformers
teknium/OpenHermes-2.5
HuggingFaceH4/ultrafeedback_binarized
English
dpo
dora
qlora
lora
olmoe
alignment
preference-learning
trl
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
olmoe-openhermes-ultrafeedback-dora-dpo
45.5 MB
1 contributor
History:
3 commits
demonlxrd
Upload DoRA adapter for OLMoE-1B-7B with DPO
77b996e
verified
about 1 month ago
pref
Upload folder using huggingface_hub
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
6.16 kB
Upload DoRA adapter for OLMoE-1B-7B with DPO
about 1 month ago
adapter_config.json
861 Bytes
Upload folder using huggingface_hub
about 1 month ago
adapter_model.safetensors
8.66 MB
xet
Upload folder using huggingface_hub
about 1 month ago
chat_template.jinja
587 Bytes
Upload folder using huggingface_hub
about 1 month ago
special_tokens_map.json
Safe
293 Bytes
Upload folder using huggingface_hub
about 1 month ago
tokenizer.json
Safe
3.57 MB
Upload folder using huggingface_hub
about 1 month ago
tokenizer_config.json
Safe
5.4 kB
Upload folder using huggingface_hub
about 1 month ago