ZhiyuanZeng commited on
Commit
59b6e0f
·
1 Parent(s): 56405c9
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -162,11 +162,11 @@ RLVE_Gymenv = RlveGymEnv.from_docker_image(
162
 
163
  ### Observation
164
  **RlveGymObservation**:
165
- - `problem_input` (Optional[str]) - The input of the problem; if it is `None`, it means that the problem generation has not been run, or it failed.
166
- - `verifier_result` (Optional[dict]) - Contains `reward` as the raw reward, `accuracy` as the 0/1 correctness, and `format_score` as the 0/1 format correctness.
167
- - `success` (bool) - `True` or `False` indicates whether the operation succeeds.
168
  - `message` (str) - The explanation of `success`.
169
- - `reward` (Optional[float]) - The value is `verifier_result["reward"]`.
170
 
171
  ## Advanced Usage
172
 
 
162
 
163
  ### Observation
164
  **RlveGymObservation**:
165
+ - `problem_input` (Optional[str]) - The input of the problem; if it is `None`, it means that the problem generation has not been run or has failed.
166
+ - `verifier_result` (Optional[dict]) - Contains `reward` as the raw reward, `accuracy` as the 0/1 correctness, and `format_score` as the 0/1 format correctness; if it is `None`, it means that the verification has failed.
167
+ - `success` (bool) - `True` or `False` indicates whether the operation succeeded.
168
  - `message` (str) - The explanation of `success`.
169
+ - `reward` (Optional[float]) - The value is `verifier_result["reward"]` when `verifier_result` is not `None` (otherwise, `reward` is also `None`).
170
 
171
  ## Advanced Usage
172