Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
12
16
LLM-Leaderboard
StarscreamDeceptions
Follow
21world's profile picture
longyuewang's profile picture
thomwolf's profile picture
4 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
5 days ago
LLM-Tuning-Safety/HEx-PHI
upvoted
a
paper
28 days ago
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
upvoted
a
paper
28 days ago
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
View all activity
Organizations
spaces
1
pinned
Running
22
๐ Multilingual MMLU Benchmark Leaderboard
๐
View and submit LLM benchmarks
models
0
None public yet
datasets
2
Sort:ย Recently updated
StarscreamDeceptions/results
Viewer
โข
Updated
Nov 13, 2024
โข
17
โข
17
StarscreamDeceptions/requests
Preview
โข
Updated
Nov 13, 2024
โข
34