view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix 16 days ago • 42
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published Oct 16 • 32
papluca/xlm-roberta-base-language-detection Text Classification • 0.3B • Updated Dec 28, 2023 • 1.33M • • 363
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 381