Xiaohan Wang
nicholswang
AI & ML interests
Video Understanding, Vision-Language Models
Recent Activity
authored
a paper
18 days ago
Closing the Modality Gap for Mixed Modality Search
authored
a paper
18 days ago
SciVideoBench: Benchmarking Scientific Video Reasoning in Large
Multimodal Models
authored
a paper
18 days ago
FineVision: Open Data Is All You Need