OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs Paper • 2309.03876 • Published Sep 7, 2023 • 3
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity Paper • 2401.17072 • Published Jan 30, 2024 • 25
LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models Paper • 2408.15729 • Published Aug 28, 2024 • 1
What Layers When: Learning to Skip Compute in LLMs with Residual Gates Paper • 2510.13876 • Published Oct 13 • 11
LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models Paper • 2408.15729 • Published Aug 28, 2024 • 1
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Paper • 2404.04113 • Published Apr 5, 2024 • 3
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Paper • 2404.04113 • Published Apr 5, 2024 • 3