qwen2-rloo-countdown-step150 / generation_config.json
thomasjhuang's picture
RLOO checkpoint at optimizer step 150 - Fixed prompt format, temp=0.1, lr=3e-6
85219cc verified
{
"bos_token_id": 151643,
"eos_token_id": 151643,
"max_new_tokens": 2048,
"transformers_version": "4.52.4"
}