aochongoliverli/Qwen2.5-1.5B-math8k-AM-5epochs-5e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 24, 2025 • 7.59k • 12
aochongoliverli/Qwen2.5-1.5B-math8k-AM-10epochs-2e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 21, 2025 • 1.28k • 6
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 20, 2025 • 7.59k • 17
aochongoliverli/Qwen2.5-1.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 16, 2025 • 7.59k • 4
aochongoliverli/Qwen4B-MegaMath-pro-max-4096-len-sft-no-external-knowledge Viewer • Updated Sep 6, 2025 • 2.87k • 7
aochongoliverli/math8k-sft-QwQ-32B-16k-reasoning-traces-limo1000-len-4096 Viewer • Updated Aug 20, 2025 • 1.08k • 9
aochongoliverli/Qwen2.5-3B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Aug 18, 2025 • 7.59k • 13
aochongoliverli/math8k-sft-QwQ-32B-16k-reasoning-traces-limo1500 Viewer • Updated Aug 17, 2025 • 1.46k • 8
aochongoliverli/math8k-sft-QwQ-32B-16k-reasoning-traces-limo600 Viewer • Updated Aug 17, 2025 • 600 • 5
aochongoliverli/Qwen2.5-3B-math8k-QwQ-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Aug 16, 2025 • 7.59k • 21
aochongoliverli/math8k-sft-AM-Distill-Qwen-32B-16k-reasoning-traces Viewer • Updated Jul 31, 2025 • 8.08k • 11
aochongoliverli/Qwen2.5-3B-math8k-coldstart-100steps-dapo-20epochs-8rollouts-8192max-len-rollouts Viewer • Updated Jul 31, 2025 • 5.29k • 11
aochongoliverli/Qwen2.5-3B-math8k-coldstart-100steps-dapo-10epochs-8rollouts-8192max-len-rollouts Viewer • Updated Jul 27, 2025 • 1.28k • 6
aochongoliverli/Qwen2.5-3B-math8k-sft-distill-150steps-dapo-10epochs-8rollouts-8192max-len-rollouts Viewer • Updated Jul 24, 2025 • 6.74k • 7
aochongoliverli/Qwen2.5-3B-math8k-dapo-20epochs-8rollouts-8192max-len-rollouts Viewer • Updated Jul 22, 2025 • 6.74k • 4