Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Haoji Zhang's picture
2 1 4

Haoji Zhang

zhang9302002
21world's profile picture LighterDarkness's profile picture
·
https://zhang9302002.github.io/
  • zhang9302002

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago
Ponder & Press: Advancing Visual GUI Agent towards General Computer Control
authored a paper 15 days ago
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
authored a paper 15 days ago
Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning
View all activity

Organizations

AGI Workshop @ Tsinghua's profile picture

authored 4 papers 15 days ago

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Paper • 2412.01268 • Published Dec 2, 2024 • 1

UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning

Paper • 2505.14231 • Published May 20 • 52

Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning

Paper • 2508.04416 • Published Aug 6 • 1

Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

Paper • 2506.23825 • Published Jun 30
authored a paper over 1 year ago

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Paper • 2406.08085 • Published Jun 12, 2024 • 17
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs