Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published 13 days ago • 172
Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality Paper • 2505.18227 • Published May 23 • 15