Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published 2 days ago • 15
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 3 days ago • 26
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 4 days ago • 117
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published 4 days ago • 94
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 7 days ago • 75
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Paper • 2511.13704 • Published 3 days ago • 40
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation Paper • 2511.06251 • Published 12 days ago • 12
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper • 2511.10629 • Published 7 days ago • 107
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 9 days ago • 27
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 7 days ago • 39
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models Paper • 2511.09515 • Published 8 days ago • 15
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning Paper • 2511.06805 • Published 11 days ago • 11
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation Paper • 2511.06307 • Published 12 days ago • 50
World Simulation with Video Foundation Models for Physical AI Paper • 2511.00062 • Published 23 days ago • 39
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms Paper • 2511.04217 • Published 15 days ago • 15
view reply yes, just stream it, it's a no brainer in your case which won't fill your disk at all just make sure your networking infra is fast enough