Safetensors
English
qwen3
Suu commited on
Commit
25257b0
·
verified ·
1 Parent(s): 38c8e08

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -76,7 +76,7 @@ git clone https://github.com/suu990901/Klear_Reasoner
76
  cd Klear_Reasoner
77
  pip install -r requirements.txt
78
  ```
79
- For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble), which allows user to reclaim all resources allocated to a task when execution times out. For mathematics, we use [math_verify](https://github.com/huggingface/Math-Verify) for judging.
80
 
81
  ### Using Ray for Multi-Node Training
82
  For multi-node training​​, ensure ​​all nodes are started and connected via Ray​​ before executing the training script. Below is a brief setup guide for Ray across multiple machines:
 
76
  cd Klear_Reasoner
77
  pip install -r requirements.txt
78
  ```
79
+ For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble), enabling automatic resource reclamation upon task timeout. For mathematics, we use [math_verify](https://github.com/huggingface/Math-Verify) for judging.
80
 
81
  ### Using Ray for Multi-Node Training
82
  For multi-node training​​, ensure ​​all nodes are started and connected via Ray​​ before executing the training script. Below is a brief setup guide for Ray across multiple machines: