CoVT Checkpoint (Segmentation, Depth, DINO, and Edge Aligned)

Model Description

This CoVT checkpoint is aligned with 8 Segmentation tokens, 4 Depth tokens, 4 DINO tokens, and 4 Edge tokens.
These task-specific tokens are integrated into the model’s embedding space to enhance 2D-awareness, 3D-awareness, patch-level feature representations, and structure-awareness.

Downloads last month
28
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Wakals/CoVT-7B-seg_depth_dino_edge