Spaces:
Sleeping
Sleeping
Commit
·
667cc50
1
Parent(s):
29d8de2
gradio markdown edit
Browse files- demo_watermark.py +3 -0
demo_watermark.py
CHANGED
|
@@ -461,6 +461,9 @@ def run_gradio(args, model=None, device=None, tokenizer=None):
|
|
| 461 |
and as the bias becomes very large the watermark transitions from "soft" to "hard".
|
| 462 |
For a hard watermark, nearly all tokens are green, but this can have a detrimental effect on
|
| 463 |
generation quality, especially when there is not a lot of flexibility in the distribution.
|
|
|
|
|
|
|
|
|
|
| 464 |
- z-score threshold : the z-score cuttoff for the hypothesis test. Higher thresholds (such as 4.0) make
|
| 465 |
_false positives_ (predicting that human/unwatermarked text is watermarked) very unlikely
|
| 466 |
as a genuine human text with a significant number of tokens will almost never achieve
|
|
|
|
| 461 |
and as the bias becomes very large the watermark transitions from "soft" to "hard".
|
| 462 |
For a hard watermark, nearly all tokens are green, but this can have a detrimental effect on
|
| 463 |
generation quality, especially when there is not a lot of flexibility in the distribution.
|
| 464 |
+
|
| 465 |
+
#### Detector Parameters:
|
| 466 |
+
|
| 467 |
- z-score threshold : the z-score cuttoff for the hypothesis test. Higher thresholds (such as 4.0) make
|
| 468 |
_false positives_ (predicting that human/unwatermarked text is watermarked) very unlikely
|
| 469 |
as a genuine human text with a significant number of tokens will almost never achieve
|