arxiv:2504.05979
Jianzong Wu PRO
jianzongwu
AI & ML interests
Multimodal Learning
Recent Activity
upvoted
a
paper
27 days ago
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal
Evidence
upvoted
a
paper
29 days ago
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for
Multimodal LLMs
Organizations
None yet