Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
passing2961 's Collections
Multi-Turn Evaluation Benchmarks
Thanos
Stark
Ultron
DialogCC

Multi-Turn Evaluation Benchmarks

updated 15 days ago

A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction

Upvote
-

  • passing2961/MultiVerse

    Viewer • Updated 26 days ago • 647 • 123 • 1

  • passing2961/photochat_plus

    Viewer • Updated Dec 3, 2024 • 968 • 38 • 4

  • RefineBench/RefineBench

    Viewer • Updated 23 days ago • 1k • 258 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs