hdong0/Qwen3-1.7B-base-Open-R1-GRPO_deepscaler_acc_8192_nokl Text Generation • 2B • Updated Oct 7 • 6
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_to_16384_nokl Text Generation • 8B • Updated Oct 12 • 12
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_to_16384_nokl Text Generation • 8B • Updated Oct 14 • 5
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_8192_to_16384_nokl Text Generation • 8B • Updated Oct 15 • 7