Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
12
16
LLM-Leaderboard
StarscreamDeceptions
Follow
thomwolf's profile picture
binwang's profile picture
21world's profile picture
4 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
8 days ago
LLM-Tuning-Safety/HEx-PHI
upvoted
a
paper
about 1 month ago
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
upvoted
a
paper
about 1 month ago
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
View all activity
Organizations
StarscreamDeceptions
's Spaces
1
Sort:ย Recently updated
pinned
Running
22
๐ Multilingual MMLU Benchmark Leaderboard
๐
View and submit LLM benchmarks