Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hdong0
/
Qwen2.5-Math-1.5B-GRPO_deepscaler_temp1_prompt1
like
0
Text Generation
Transformers
Safetensors
agentica-org/DeepScaleR-Preview-Dataset
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-Math-1.5B-GRPO_deepscaler_temp1_prompt1
Commit History
End of training
f65eafe
verified
hdong0
commited on
Aug 7
Model save
b045c9c
verified
hdong0
commited on
Aug 7
Training in progress, step 1000
39eb9da
verified
hdong0
commited on
Aug 7
Training in progress, step 950
9afe4de
verified
hdong0
commited on
Aug 7
Training in progress, step 900
b897f8f
verified
hdong0
commited on
Aug 7
Training in progress, step 850
55b96af
verified
hdong0
commited on
Aug 7
Training in progress, step 800
4660fcf
verified
hdong0
commited on
Aug 7
Training in progress, step 750
76df2ab
verified
hdong0
commited on
Aug 7
Training in progress, step 700
2174c8e
verified
hdong0
commited on
Aug 7
Training in progress, step 650
a7bf129
verified
hdong0
commited on
Aug 7
Training in progress, step 600
110fb75
verified
hdong0
commited on
Aug 7
Training in progress, step 550
bc831e8
verified
hdong0
commited on
Aug 7
Training in progress, step 500
42a7dd9
verified
hdong0
commited on
Aug 7
Training in progress, step 450
ff784af
verified
hdong0
commited on
Aug 7
Training in progress, step 400
33d02e5
verified
hdong0
commited on
Aug 7
Training in progress, step 350
c0776e6
verified
hdong0
commited on
Aug 7
Training in progress, step 300
133a491
verified
hdong0
commited on
Aug 7
Training in progress, step 250
02b334e
verified
hdong0
commited on
Aug 7
Training in progress, step 200
b87797f
verified
hdong0
commited on
Aug 7
Training in progress, step 150
cd24bb8
verified
hdong0
commited on
Aug 7
Training in progress, step 100
b5a24af
verified
hdong0
commited on
Aug 7
Training in progress, step 50
e4063d3
verified
hdong0
commited on
Aug 7
initial commit
b27aa7f
verified
hdong0
commited on
Aug 7