OpenSafetyLab
/

MD-Judge-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions

Foreshhh commited on Mar 13, 2024

Commit

43efe12

·

verified ·

1 Parent(s): 4a9afb2

Update README.md

Files changed (1) hide show

README.md +10 -9

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ tags:
 - mistral
 - salad-bench
 - evluation
 ---
 # MD-Judge for Salad-Bench
@@ -25,16 +27,16 @@ tags:
 MD-Judge is a LLM-based safetyguard, fine-tund on top of [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1). MD-Judge serves as a classifier to evaluate the safety of QA pairs.
-MD-Judge was born to study the safety of different LLMs serving as an general evaluation tool, which is proposed under the [SALAD-Bench paper](https://arxiv.org/abs/2402.02416)
-- **Developed by:** The SALAD-Bench Team
-- **Model type:** An auto-regressive language model based on the transformer architecture.
 ## Model Sources
-- **Repository:** [SALAD-Bench Github](https://github.com/OpenSafetyLab/SALAD-BENCH)
-- **Paper:** [SALAD-BENCH](https://arxiv.org/abs/2402.02416)
 ## Model Performance
 Compare our MD-Judge model with other methods on different public safety testsets using QA format. All the model-based methods are evaluated using the same safety proxy template.
@@ -122,5 +124,4 @@ Please refer to our [Github](https://github.com/OpenSafetyLab/SALAD-BENCH) for m
       archivePrefix={arXiv},
       primaryClass={cs.CL}
 }
-```

 - mistral
 - salad-bench
 - evluation
+- judge
+pipeline_tag: text-generation
 ---
 # MD-Judge for Salad-Bench
 MD-Judge is a LLM-based safetyguard, fine-tund on top of [Mistral-7B](https://huggingface.co/mistralai/Mistral-7B-v0.1). MD-Judge serves as a classifier to evaluate the safety of QA pairs.
+MD-Judge was born to study the safety of different LLMs serving as an general evaluation tool, which is proposed under the 🥗SALAD-Bench. You can check the following source for more information:
+- [**Paper**](https://arxiv.org/abs/2402.02416)
+- [**Code**](https://github.com/OpenSafetyLab/SALAD-BENCH)
+- [**Data**](https://huggingface.co/datasets/OpenSafetyLab/Salad-Data)
+- [**Project Page**](https://adwardlee.github.io/salad_bench/)
 ## Model Sources
+- **Repository:** [SALAD-Bench Github]()
+- **Paper:** [SALAD-BENCH]()
 ## Model Performance
 Compare our MD-Judge model with other methods on different public safety testsets using QA format. All the model-based methods are evaluated using the same safety proxy template.
       archivePrefix={arXiv},
       primaryClass={cs.CL}
 }
+```