nvidia/Nemotron-RL-instruction_following-structured_outputs Viewer • Updated 5 days ago • 9.44k • 219 • 11
HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages Paper • 2505.11475 • Published May 16 • 4
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published Jul 2 • 55
Running 125 Open-LLM performances are plateauing, let’s make the leaderboard steep again 🏔 125 Explore and compare advanced language models on a new leaderboard