Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FlagEval

non-profit
https://flageval.baai.ac.cn/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

philokey  updated a dataset 4 days ago
FlagEval/coco_val2014_sampled
philokey  authored a paper 7 days ago
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
philokey  updated a dataset 8 days ago
FlagEval/MeasureBench
View all activity

Richeng Xuan's profile picture Xuannan Liu 's profile picture llvvvv's profile picture Sherlock's profile picture Gray 's profile picture makarov's profile picture Zheqi He's profile picture jingshu's profile picture daiteng01's profile picture lixuejing's profile picture HelloGitHub's profile picture

spaces 2

Running
6

FlagEval-Arena

🐢

Arena

Mar 18
Running
12

FlagEval-Debate

🐠

Display a debate interface

Mar 17

models 1

FlagEval/flageval_judgemodel

Text Generation • 33B • Updated Dec 30, 2024 • 1 • 1

datasets 12

FlagEval/coco_val2014_sampled

Viewer • Updated 4 days ago • 1k • 39

FlagEval/MeasureBench

Viewer • Updated 8 days ago • 2.44k • 117

FlagEval/EmbodiedVerse-Bench

Viewer • Updated Jun 25 • 2.04k • 215

FlagEval/Where2Place

Viewer • Updated May 29 • 100 • 208

FlagEval/SAT

Viewer • Updated May 6 • 150 • 49

FlagEval/HMMT_2025

Viewer • Updated May 6 • 30 • 46

FlagEval/ERQA

Viewer • Updated Apr 22 • 400 • 364 • 2

FlagEval/sub_spatial

Viewer • Updated Apr 21 • 690 • 71

FlagEval/EmbSpatial-Bench

Viewer • Updated Apr 21 • 3.64k • 170 • 2

FlagEval/documentation-images

Viewer • Updated Nov 13, 2024 • 3 • 201
View 12 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs