Li's picture

2 2 2

Li

Shalfunnn

·

AI & ML interests

3DV&more

Recent Activity

authored a paper 21 days ago

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

authored a paper 21 days ago

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

authored a paper 21 days ago

MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving

View all activity

Organizations

authored 8 papers 21 days ago

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

Paper • 2503.03689 • Published Mar 5

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

Paper • 2503.13587 • Published Mar 17

MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving

Paper • 2504.00379 • Published Apr 1

The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey

Paper • 2502.10498 • Published Feb 14

Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving

Paper • 2409.15730 • Published Sep 24, 2024

Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities

Paper • 2412.16418 • Published Dec 21, 2024

NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection

Paper • 2404.13921 • Published Apr 22, 2024

Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample

Paper • 2506.03928 • Published Jun 4

authored 7 papers 22 days ago

DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model

Paper • 2310.07771 • Published Oct 11, 2023

Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting

Paper • 2501.18672 • Published Jan 30

Igniting VLMs toward the Embodied Space

Paper • 2509.11766 • Published Sep 15

DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment

Paper • 2504.18576 • Published Apr 22

U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration

Paper • 2507.04503 • Published Jul 6

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents

Paper • 2407.05679 • Published Jul 8, 2024

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents

Paper • 2407.05679 • Published Jul 8, 2024