Spaces:
Sleeping
Sleeping
Commit
·
59b6e0f
1
Parent(s):
56405c9
misc
Browse files
README.md
CHANGED
|
@@ -162,11 +162,11 @@ RLVE_Gymenv = RlveGymEnv.from_docker_image(
|
|
| 162 |
|
| 163 |
### Observation
|
| 164 |
**RlveGymObservation**:
|
| 165 |
-
- `problem_input` (Optional[str]) - The input of the problem; if it is `None`, it means that the problem generation has not been run
|
| 166 |
-
- `verifier_result` (Optional[dict]) - Contains `reward` as the raw reward, `accuracy` as the 0/1 correctness, and `format_score` as the 0/1 format correctness.
|
| 167 |
-
- `success` (bool) - `True` or `False` indicates whether the operation
|
| 168 |
- `message` (str) - The explanation of `success`.
|
| 169 |
-
- `reward` (Optional[float]) - The value is `verifier_result["reward"]
|
| 170 |
|
| 171 |
## Advanced Usage
|
| 172 |
|
|
|
|
| 162 |
|
| 163 |
### Observation
|
| 164 |
**RlveGymObservation**:
|
| 165 |
+
- `problem_input` (Optional[str]) - The input of the problem; if it is `None`, it means that the problem generation has not been run or has failed.
|
| 166 |
+
- `verifier_result` (Optional[dict]) - Contains `reward` as the raw reward, `accuracy` as the 0/1 correctness, and `format_score` as the 0/1 format correctness; if it is `None`, it means that the verification has failed.
|
| 167 |
+
- `success` (bool) - `True` or `False` indicates whether the operation succeeded.
|
| 168 |
- `message` (str) - The explanation of `success`.
|
| 169 |
+
- `reward` (Optional[float]) - The value is `verifier_result["reward"]` when `verifier_result` is not `None` (otherwise, `reward` is also `None`).
|
| 170 |
|
| 171 |
## Advanced Usage
|
| 172 |
|