DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Paper • 2503.03689 • Published Mar 5
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception Paper • 2503.13587 • Published Mar 17
MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving Paper • 2504.00379 • Published Apr 1
The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey Paper • 2502.10498 • Published Feb 14
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving Paper • 2409.15730 • Published Sep 24, 2024
Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities Paper • 2412.16418 • Published Dec 21, 2024
NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection Paper • 2404.13921 • Published Apr 22, 2024
Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample Paper • 2506.03928 • Published Jun 4
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model Paper • 2310.07771 • Published Oct 11, 2023
Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting Paper • 2501.18672 • Published Jan 30
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment Paper • 2504.18576 • Published Apr 22
U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration Paper • 2507.04503 • Published Jul 6
BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents Paper • 2407.05679 • Published Jul 8, 2024
BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents Paper • 2407.05679 • Published Jul 8, 2024