Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SPY Lab - ETH Zurich
https://spylab.ai
ethz-spylab
Activity Feed
Follow
29
AI & ML interests
Security, privacy, and trustworthiness of machine learning systems.
Recent Activity
nkristina
authored
a paper
about 2 months ago
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
dpaleka
authored
a paper
6 months ago
Pitfalls in Evaluating Language Model Forecasters
jayzhang-ethz
updated
a dataset
6 months ago
ethz-spylab/RealMath
View all activity
Team members
7
ethz-spylab
's models
32
Sort: Recently updated
ethz-spylab/rlhf-7b-harmless
Text Generation
•
7B
•
Updated
Feb 7, 2024
•
1
ethz-spylab/poisoned-reward-7b-SUDO-10
7B
•
Updated
Feb 7, 2024
•
6
Previous
1
2
Next