aochongoliverli/Qwen2.5-1.5B-math8k-AM-5epochs-5e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 24 • 7.59k • 14
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step500 Text Generation • 3B • Updated Sep 22 • 3
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step500 Text Generation • 3B • Updated Sep 22 • 3
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step400 Text Generation • 3B • Updated Sep 22 • 6
aochongoliverli/Qwen2.5-3B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step400 Text Generation • 3B • Updated Sep 22 • 6
aochongoliverli/Qwen2.5-1.5B-math8k-AM-5epochs-5e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 24 • 7.59k • 14
aochongoliverli/Qwen2.5-1.5B-math8k-AM-10epochs-2e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 21 • 1.28k • 3
aochongoliverli/Qwen2.5-1.5B-math8k-AM-10epochs-2e-5lr-step400-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 21 • 1.28k • 3
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 20 • 7.59k • 30
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 20 • 7.59k • 30
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step600 Text Generation • 2B • Updated Sep 19 • 2
aochongoliverli/Qwen2.5-1.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-2e-5lr-step600 Text Generation • 2B • Updated Sep 19 • 2
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 20 • 7.59k • 30
aochongoliverli/Qwen2.5-0.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 20 • 7.59k • 30
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step500 Text Generation • 0.5B • Updated Sep 18 • 4
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step500 Text Generation • 0.5B • Updated Sep 18 • 4
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step400 Text Generation • 0.5B • Updated Sep 18 • 4
aochongoliverli/Qwen2.5-0.5B-math8k-distill-AM-Distill-Qwen-32B-16k-5epochs-5e-5lr-step400 Text Generation • 0.5B • Updated Sep 18 • 4
aochongoliverli/Qwen2.5-1.5B-math8k-AM-400steps-dapo-5epochs-8rollouts-16384max-len-rollouts Viewer • Updated Sep 16 • 7.59k • 4