Update README.md
Browse files
README.md
CHANGED
|
@@ -16,6 +16,20 @@ by Tommie Kerssies, Niccolò Cavagnero, Alexander Hermans, Narges Norouzi, Giuse
|
|
| 16 |
|
| 17 |
> **Key Insight**: Given sufficient scale and pretraining, a plain ViT along with additional few params can perform segmentation without the need for task-specific decoders or pixel fusion modules. The same model backbone supports semantic, instance, and panoptic segmentation with different post-processing 🤗
|
| 18 |
|
| 19 |
-
The original implementation can be found in this [repository](https://github.com/tue-mps/eomt)
|
|
|
|
|
|
|
| 20 |
|
| 21 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
|
| 17 |
> **Key Insight**: Given sufficient scale and pretraining, a plain ViT along with additional few params can perform segmentation without the need for task-specific decoders or pixel fusion modules. The same model backbone supports semantic, instance, and panoptic segmentation with different post-processing 🤗
|
| 18 |
|
| 19 |
+
The original implementation can be found in this [repository](https://github.com/tue-mps/eomt).
|
| 20 |
+
|
| 21 |
+
The HuggingFace model page is available at this [link](https://huggingface.co/papers/2503.19108).
|
| 22 |
|
| 23 |
---
|
| 24 |
+
|
| 25 |
+
## Citation
|
| 26 |
+
If you find our work useful, please consider citing us as:
|
| 27 |
+
```bibtex
|
| 28 |
+
@inproceedings{kerssies2025eomt,
|
| 29 |
+
author = {Kerssies, Tommie and Cavagnero, Niccolò and Hermans, Alexander and Norouzi, Narges and Averta, Giuseppe and Leibe, Bastian and Dubbelman, Gijs and de Geus, Daan},
|
| 30 |
+
title = {Your ViT is Secretly an Image Segmentation Model},
|
| 31 |
+
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
|
| 32 |
+
year = {2025},
|
| 33 |
+
}
|
| 34 |
+
```
|
| 35 |
+
|