Satori-reasoning
/

Satori-RM-7B

Text Generation

text-generation-inference

Model card Files Files and versions

maohaos2 commited on Jun 3

Commit

df808b1

·

verified ·

1 Parent(s): 7c0311f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: apache-2.0
 library_name: transformers
 pipeline_tag: text-generation
 base_model:
-- Qwen/Qwen2.5-Math-7B
 ---
 **Satori-RM-7B** is the Outcome Reward model for training our RL model [Satori-7B-Round2](https://huggingface.co/Satori-reasoning/Satori-7B-Round2). The usage of **Satori-RM-7B** can be found in our released [RL training code](https://github.com/satori-reasoning/Satori).

 library_name: transformers
 pipeline_tag: text-generation
 base_model:
+- Satori-reasoning/Satori-SFT-7B
 ---
 **Satori-RM-7B** is the Outcome Reward model for training our RL model [Satori-7B-Round2](https://huggingface.co/Satori-reasoning/Satori-7B-Round2). The usage of **Satori-RM-7B** can be found in our released [RL training code](https://github.com/satori-reasoning/Satori).