hdong0
/

Qwen2.5-Math-1.5B-GRPO_deepscaler_temp1_prompt1

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Qwen2.5-Math-1.5B-GRPO_deepscaler_temp1_prompt1 / trainer_state.json

hdong0's picture

Model save

b045c9c verified 4 months ago

496 kB

File too large to display, you can check the raw version instead.