Godheritage/Qwen2.5-14B-Instruct-BesiegeField-CatapultRL Reinforcement Learning • 15B • Updated 26 days ago • 11
BesiegeField/Qwen2.5-14B-Instruct-BesiegeField-CarRL Reinforcement Learning • 15B • Updated 25 days ago • 4
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 25 days ago • 13
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 25 days ago • 22
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 25 days ago • 25
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024 Reinforcement Learning • 2B • Updated 25 days ago • 163
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 24 days ago • 498
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 24 days ago • 357
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 23 days ago • 355
AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new Reinforcement Learning • 2B • Updated 23 days ago • 503
mradermacher/Qwen3-0.6B-Dakota-Grammar-RL-GGUF Reinforcement Learning • 0.8B • Updated 6 days ago • 399