Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 18 days ago • 10 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 17 days ago • 11.6k • 3.05k • 22
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 18 days ago • 10
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 122 122 Open Chinese LLM Leaderboard 🏆 Explore and submit LLM benchmarks Running on CPU Upgrade 169 169 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 40 OpenLLM French leaderboard 🇫🇷 🥇 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 169 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots
Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 18 days ago • 10 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 17 days ago • 11.6k • 3.05k • 22
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 18 days ago • 10
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 122 122 Open Chinese LLM Leaderboard 🏆 Explore and submit LLM benchmarks Running on CPU Upgrade 169 169 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 40 OpenLLM French leaderboard 🇫🇷 🥇 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 169 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots