Update README.md
Browse files
README.md
CHANGED
|
@@ -19,7 +19,7 @@ For each document, we calculated a combined educational quality score by taking
|
|
| 19 |
|
| 20 |
We trained Aleph-Alpha-GermanWeb-Quality-Classifier-BERT using up to 75,000 documents from each class. We used 95% of this dataset for training to predict the one to five scores. The model achieved an overall accuracy of 42% and a macro-average accuracy of 46% when evaluated on the remaining 5% of the data, which served as the validation set.
|
| 21 |
|
| 22 |
-
Further details, including our LLM judging prompt, can be found in our accompanying paper
|
| 23 |
|
| 24 |
## Example Snippet
|
| 25 |
|
|
|
|
| 19 |
|
| 20 |
We trained Aleph-Alpha-GermanWeb-Quality-Classifier-BERT using up to 75,000 documents from each class. We used 95% of this dataset for training to predict the one to five scores. The model achieved an overall accuracy of 42% and a macro-average accuracy of 46% when evaluated on the remaining 5% of the data, which served as the validation set.
|
| 21 |
|
| 22 |
+
Further details, including our LLM judging prompt, can be found in our [accompanying paper](https://arxiv.org/abs/2505.00022).
|
| 23 |
|
| 24 |
## Example Snippet
|
| 25 |
|