Runsen Xu's picture

5 8 6

Runsen Xu

RunsenXu

·

https://runsenxu.com/

AI & ML interests

Large Language Models, Multi-modal Learning, 3D Perception and Understanding, Self-supervised Learning

Recent Activity

updated a dataset 28 days ago

RunsenXu/MMSI-Bench

upvoted a paper about 2 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

updated a dataset 2 months ago

RunsenXu/MyDatasets

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 77

upvoted a paper 2 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Paper • 2509.09676 • Published Sep 11 • 31

upvoted 3 papers 4 months ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 98

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 42

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7 • 47

upvoted 2 papers 6 months ago

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Paper • 2505.23764 • Published May 29 • 3

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Paper • 2505.17015 • Published May 22 • 9

upvoted a paper about 1 year ago

PointLLM: Empowering Large Language Models to Understand Point Clouds

Paper • 2308.16911 • Published Aug 31, 2023 • 1