Update README.md
Browse files
README.md
CHANGED
|
@@ -76,7 +76,7 @@ git clone https://github.com/suu990901/Klear_Reasoner
|
|
| 76 |
cd Klear_Reasoner
|
| 77 |
pip install -r requirements.txt
|
| 78 |
```
|
| 79 |
-
For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble),
|
| 80 |
|
| 81 |
### Using Ray for Multi-Node Training
|
| 82 |
For multi-node training, ensure all nodes are started and connected via Ray before executing the training script. Below is a brief setup guide for Ray across multiple machines:
|
|
|
|
| 76 |
cd Klear_Reasoner
|
| 77 |
pip install -r requirements.txt
|
| 78 |
```
|
| 79 |
+
For the code, we use [Firejail](https://github.com/netblue30/firejail) for the **sandbox** environment. Additionally, we implemented multi-process control based on [Pebble](https://github.com/noxdafox/pebble), enabling automatic resource reclamation upon task timeout. For mathematics, we use [math_verify](https://github.com/huggingface/Math-Verify) for judging.
|
| 80 |
|
| 81 |
### Using Ray for Multi-Node Training
|
| 82 |
For multi-node training, ensure all nodes are started and connected via Ray before executing the training script. Below is a brief setup guide for Ray across multiple machines:
|