P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 1 day ago • 103
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 5 days ago • 58
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper • 2511.10629 • Published 5 days ago • 103
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 7 days ago • 66
VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Paper • 2510.14902 • Published Oct 16 • 13
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction Paper • 2509.26633 • Published Sep 30 • 5
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published Sep 29 • 44