Kwai-Klear
/

Klear-Reasoner-8B

Model card Files Files and versions

Suu commited on Aug 12

Commit

25257b0

·

verified ·

1 Parent(s): 38c8e08

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -76,7 +76,7 @@ git clone https://github.com/suu990901/Klear_Reasoner
 cd Klear_Reasoner
 pip install -r requirements.txt
 ```
-For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble), which allows user to reclaim all resources allocated to a task when execution times out. For mathematics, we use [math_verify](https://github.com/huggingface/Math-Verify) for judging.
 ### Using Ray for Multi-Node Training
 For multi-node training, ensure all nodes are started and connected via Ray before executing the training script. Below is a brief setup guide for Ray across multiple machines:

 cd Klear_Reasoner
 pip install -r requirements.txt
 ```
+For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble), enabling automatic resource reclamation upon task timeout. For mathematics, we use [math_verify](https://github.com/huggingface/Math-Verify) for judging.
 ### Using Ray for Multi-Node Training
 For multi-node training, ensure all nodes are started and connected via Ray before executing the training script. Below is a brief setup guide for Ray across multiple machines: