qwen-llama-pairs
Collection
14 items • Updated • 1
This model is a result of parameter averaging (Model Soup) across 2 models.
The following models were included in the merge:
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("moe-dtoks/qwen_onto_llama_lambda-0.5-nse-0.5host")
tokenizer = AutoTokenizer.from_pretrained("moe-dtoks/qwen_onto_llama_lambda-0.5-nse-0.5host")