Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models Paper • 2507.15512 • Published Jul 21 • 3
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs Paper • 2511.07003 • Published 9 days ago • 31
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 99
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published May 21 • 18