Update README.md
Browse files
README.md
CHANGED
|
@@ -59,24 +59,6 @@ print(f"Correctness score: {score:.3f}")
|
|
| 59 |
print(f"Prediction: {'Correct' if score > 0.5 else 'Incorrect'}")
|
| 60 |
```
|
| 61 |
|
| 62 |
-
## Training Details
|
| 63 |
-
|
| 64 |
-
This model was trained using the [Weaver distillation pipeline](https://github.com/HazyResearch/scaling-verification). For training your own distilled models, see the [distillation README](https://github.com/ScalingIntelligence/scaling-verification/blob/main/distillation/README.md).
|
| 65 |
-
|
| 66 |
-
## Evaluation
|
| 67 |
-
|
| 68 |
-
Evaluate this model using:
|
| 69 |
-
|
| 70 |
-
```bash
|
| 71 |
-
python evaluate_crossencoder.py \
|
| 72 |
-
--model_name "answerdotai/ModernBERT-large" \
|
| 73 |
-
--checkpoint_path "hazyresearch/Weaver_Distilled_for_MATH500" \
|
| 74 |
-
--dataset_path "hazyresearch/MATH500_with_Llama_3.1_70B_Instruct_v1" \
|
| 75 |
-
--dataset_split "data" \
|
| 76 |
-
--max_length 4096 \
|
| 77 |
-
--batch_size 64
|
| 78 |
-
```
|
| 79 |
-
|
| 80 |
## Citation
|
| 81 |
|
| 82 |
```bibtex
|
|
|
|
| 59 |
print(f"Prediction: {'Correct' if score > 0.5 else 'Incorrect'}")
|
| 60 |
```
|
| 61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 62 |
## Citation
|
| 63 |
|
| 64 |
```bibtex
|