| license: apache-2.0 | |
| base_model: dhkim2810/MobileSAM | |
| tags: | |
| - mask-generation | |
| - vision.cpp | |
| pipeline_tag: image-segmentation | |
| # GGUF models for MobileSAM | |
| MobileSAM is a model for image segmentation. It generates object masks from | |
| point or box prompts. The weights in this repository are converted for | |
| lightweight inference on consumer hardware with | |
| [vision.cpp](https://github.com/Acly/vision.cpp). | |
| * Original repository: [ChaoningZhang/MobileSAM (Github)](https://github.com/ChaoningZhang/MobileSAM) | |
| * Original weights: [dhkim2810/MobileSAM (HuggingFace)](https://huggingface.co/dhkim2810/MobileSAM) | |
| ## Run | |
| Example inference with [vision.cpp](https://github.com/Acly/vision.cpp): | |
| ```sh | |
| vision-cli sam -m MobileSAM-F16.gguf -i input.png -p 256 480 -o mask.png --composite output.png | |
| ``` | |
| ## Models | |
| | Model | Description | | |
| | ---------------------------------------- | ---------------------------------------------------------------- | | |
| | [MobileSAM-F16.gguf](MobileSAM-F16.gguf) | Encoder + decoder, fused batch norm, NHWC memory layout, float16 | | |