15 141 22

Jiaheng Liu

CheeryLJH

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

commented on a paper 10 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

upvoted a paper 22 days ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

View all activity

Organizations

upvoted a paper 10 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published 10 days ago • 17

upvoted 2 papers 22 days ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published 23 days ago • 34

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 22 days ago • 213

upvoted 3 papers 30 days ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20 • 19

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published about 1 month ago • 24

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published about 1 month ago • 35

upvoted 12 papers about 1 month ago

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published Oct 20 • 65

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17 • 88

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16 • 13

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13 • 28

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding

Paper • 2510.11498 • Published Oct 13 • 10

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12 • 46

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10 • 50

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9 • 26

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

upvoted 2 papers about 2 months ago

Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Paper • 2509.25301 • Published Sep 29 • 17

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 88

Jiaheng Liu

AI & ML interests

Recent Activity

Organizations

CheeryLJH's activity