RiddleBench: A New Generative Reasoning Benchmark for LLMs Paper • 2510.24932 • Published 22 days ago • 5 • 2