Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

zhibinlan
/
UME-R1-2B

Image-Text-to-Text
Transformers
Safetensors
English
qwen2_vl
image-to-text
Sentence Similarity
Embedding
zero-shot-image-classification
video-text-to-text
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
UME-R1-2B / figures
565 kB
  • 2 contributors
History: 1 commit
zhibinlan
update readme
e0110ca about 1 month ago
  • main_result.png
    271 kB
    xet
    update readme about 1 month ago
  • scaling.png
    295 kB
    xet
    update readme about 1 month ago