Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

SPY Lab - ETH Zurich

https://spylab.ai
ethz-spylab
Activity Feed

AI & ML interests

Security, privacy, and trustworthiness of machine learning systems.

Recent Activity

nkristina  authored a paper about 2 months ago
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
dpaleka  authored a paper 6 months ago
Pitfalls in Evaluating Language Model Forecasters
jayzhang-ethz  updated a dataset 6 months ago
ethz-spylab/RealMath
View all activity

Daniel Paleka's profile picture Javier Rando's profile picture Edoardo Debenedetti's profile picture JayZhang's profile picture Michael Aerni's profile picture Thomas Baumann's profile picture Kristina Nikolic's profile picture

ethz-spylab 's models 32

ethz-spylab/rlhf-7b-harmless

Text Generation • 7B • Updated Feb 7, 2024 • 1

ethz-spylab/poisoned-reward-7b-SUDO-10

7B • Updated Feb 7, 2024 • 6
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs