agurung/Qwen2.5-7B-Instruct-flawedfiction-latent-grpo-nosft Text Generation • 8B • Updated Oct 30 • 6
agurung/v3ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset_newprompt Text Generation • 8B • Updated Oct 25 • 18
agurung/v2ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset Text Generation • 8B • Updated Oct 25 • 6
agurung/v1ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset Text Generation • 8B • Updated Oct 25 • 5