15 141 22

Jiaheng Liu

CheeryLJH

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

commented on a paper 9 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

upvoted a paper 21 days ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

View all activity

Organizations

upvoted a paper 9 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published 9 days ago • 17

commented a paper 9 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published 9 days ago • 17 •

upvoted 2 papers 21 days ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published 22 days ago • 34

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 21 days ago • 212

upvoted a paper 29 days ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published about 1 month ago • 19

commented a paper 29 days ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published about 1 month ago • 19 •

upvoted a paper 29 days ago

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published 29 days ago • 24

commented a paper 29 days ago

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published 29 days ago • 24 •

upvoted 2 papers 29 days ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published 29 days ago • 35

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published about 1 month ago • 65

upvoted a paper about 1 month ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17 • 87

liked a dataset about 1 month ago

NJU-LINK/MT-Video-Bench

Updated 29 days ago • 65 • 3

upvoted a paper about 1 month ago

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16 • 13

updated a dataset about 1 month ago

NJU-LINK/MT-Video-Bench

Updated 29 days ago • 65 • 3

upvoted a paper about 1 month ago

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning

Paper • 2510.10518 • Published Oct 12 • 17

commented a paper about 1 month ago

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning

Paper • 2510.10518 • Published Oct 12 • 17 •

upvoted 4 papers about 1 month ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14 • 48

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12 • 27

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13 • 28

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding

Paper • 2510.11498 • Published Oct 13 • 10

Jiaheng Liu

AI & ML interests

Recent Activity

Organizations

CheeryLJH's activity