tue-mps
/

ade_semantic_eomt_large_512_dinov3

Image Segmentation

Model card Files Files and versions

neikos00 commited on Sep 18

Commit

4384f0d

·

verified ·

1 Parent(s): 8ca1e29

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -16,6 +16,20 @@ by Tommie Kerssies, Niccolò Cavagnero, Alexander Hermans, Narges Norouzi, Giuse
 > **Key Insight**: Given sufficient scale and pretraining, a plain ViT along with additional few params can perform segmentation without the need for task-specific decoders or pixel fusion modules. The same model backbone supports semantic, instance, and panoptic segmentation with different post-processing 🤗
-The original implementation can be found in this [repository](https://github.com/tue-mps/eomt)
 ---

 > **Key Insight**: Given sufficient scale and pretraining, a plain ViT along with additional few params can perform segmentation without the need for task-specific decoders or pixel fusion modules. The same model backbone supports semantic, instance, and panoptic segmentation with different post-processing 🤗
+The original implementation can be found in this [repository](https://github.com/tue-mps/eomt).
+The HuggingFace model page is available at this [link](https://huggingface.co/papers/2503.19108).
 ---
+## Citation
+If you find our work useful, please consider citing us as:
+```bibtex
+@inproceedings{kerssies2025eomt,
+  author    = {Kerssies, Tommie and Cavagnero, Niccolò and Hermans, Alexander and Norouzi, Narges and Averta, Giuseppe and Leibe, Bastian and Dubbelman, Gijs and de Geus, Daan},
+  title     = {Your ViT is Secretly an Image Segmentation Model},
+  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
+  year      = {2025},
+}
+```