OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models Paper • 2511.14582 • Published 2 days ago • 14
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published 8 days ago • 61
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 3 days ago • 122
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published 4 days ago • 94
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 7 days ago • 75
Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs Paper • 2511.05933 • Published 13 days ago • 7
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 8 days ago • 9
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 9 days ago • 172
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 9 days ago • 27
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale Paper • 2511.05705 • Published 13 days ago • 6
Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR Paper • 2511.01937 • Published 18 days ago • 11
SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning Paper • 2511.02280 • Published 17 days ago • 3
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published 14 days ago • 196
MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity Paper • 2511.03146 • Published 16 days ago • 7
LiveTradeBench: Seeking Real-World Alpha with Large Language Models Paper • 2511.03628 • Published 15 days ago • 11