SPY Lab - ETH Zurich

https://spylab.ai

ethz-spylab

AI & ML interests

Security, privacy, and trustworthiness of machine learning systems.

Recent Activity

nkristina authored a paper about 2 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

dpaleka authored a paper 6 months ago

Pitfalls in Evaluating Language Model Forecasters

jayzhang-ethz updated a dataset 6 months ago

ethz-spylab/RealMath

View all activity

ethz-spylab 's models 32

ethz-spylab/rlhf-7b-harmless

Text Generation • 7B • Updated Feb 7, 2024 • 1

ethz-spylab/poisoned-reward-7b-SUDO-10

7B • Updated Feb 7, 2024 • 6