The official model weight for MetaCaptioner-8B.
MetaCaptioner
π Introduction
We used a data engine built with Capflow-72B to caption multi-source data. This data was then used to train Qwen3-8B, resulting in MetaCaptioner-8B. MetaCaptioner-8B demonstrates outstanding image description capabilities, excelling at generating comprehensive descriptions that incorporate visual perception and understanding. Furthermore, MetaCaptioner-8B outperforms InternVL3.5-8B-Instruct on multiple multimodal understanding and reasoning benchmarks.
π οΈ Usage
See more usage details in MetaCaptioner
- Downloads last month
- 7
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support