Cedric Perauer's picture

45 9

Cedric Perauer

CedricPerauer

·

AI & ML interests

2D/3D Vision

Recent Activity

liked a model 1 day ago

black-forest-labs/FLUX.2-dev

liked a dataset 4 days ago

p-doom/openai-minecraft-dataset

upvoted a paper 5 days ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published 8 days ago • 205

upvoted 3 papers 10 days ago

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published 17 days ago • 53

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published 15 days ago • 71

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 15 days ago • 181

upvoted a paper 30 days ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published about 1 month ago • 172

upvoted 4 papers about 1 month ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 162

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training

Paper • 2510.11712 • Published Oct 13 • 30

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published Oct 15 • 70

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9 • 122

upvoted 2 papers about 2 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 93

DA^2: Depth Anything in Any Direction

Paper • 2509.26618 • Published Sep 30 • 25

upvoted 5 papers 2 months ago

VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models

Paper • 2509.17985 • Published Sep 22 • 25

Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

Paper • 2509.18824 • Published Sep 23 • 22

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26 • 41

SPATIALGEN: Layout-guided 3D Indoor Scene Generation

Paper • 2509.14981 • Published Sep 18 • 27

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Paper • 2509.10441 • Published Sep 12 • 30

upvoted 4 papers 3 months ago

LuxDiT: Lighting Estimation with Video Diffusion Transformer

Paper • 2509.03680 • Published Sep 3 • 16

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4 • 91

DINOv3

Paper • 2508.10104 • Published Aug 13 • 283

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Paper • 2508.09968 • Published Aug 13 • 15