Prithiv Sakthi's picture

Prithiv Sakthi PRO

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow🤗

Recent Activity

liked a Space 26 minutes ago

prithivMLmods/Qwen-Image-Edit-2509-LoRAs-Fast-Fusion

published a Space 26 minutes ago

prithivMLmods/Qwen-Image-Edit-2509-LoRAs-Fast-Fusion

updated a Space 29 minutes ago

prithivMLmods/Qwen-Image-Edit-2509-LoRAs-Fast

View all activity

Organizations

liked a Space 26 minutes ago

Qwen Image Edit 2509 LoRAs Fast Fusion

Demo of the Qwen Image Editing Fusion Collection

published a Space 26 minutes ago

Qwen Image Edit 2509 LoRAs Fast Fusion

Demo of the Qwen Image Editing Fusion Collection

updated a Space 29 minutes ago

Qwen Image Edit 2509 LoRAs Fast

Demo of the Collection of Qwen Image Editing LoRAs

updated a Space 35 minutes ago

Qwen Image Edit 2509 LoRAs Fast Fusion

Demo of the Qwen Image Editing Fusion Collection

New activity in prithivMLmods/Qwen-Image-Edit-2509-LoRAs-Fast about 3 hours ago

can this run on 5090 - im getting oom

#3 opened about 3 hours ago by

upvoted 2 papers about 23 hours ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 93

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 5 days ago • 131

upvoted 2 articles about 23 hours ago

Article

Text-to-image Architectural Experiments

6 days ago

•

30

Article

We’re open-sourcing our text-to-image model and the process behind it

7 days ago

•

62

reacted to their post with 🚀❤️🤗🔥 about 23 hours ago

Post

1547

Made a demo for multimodal understanding of Qwen3-VL space for tasks including point annotation, detection, captioning, guided text inferences, and more. Find the demo link below. 🤗↗️

⮞ Space[Demo]: prithivMLmods/Qwen3-VL-HF-Demo
⮞ Model Used: Qwen/Qwen3-VL-4B-Instruct
⮞ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
⮞ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-3VL-Multimodal-Understanding

To know more about it, visit the app page or the respective model page!

posted an update about 23 hours ago

Post

1547

Made a demo for multimodal understanding of Qwen3-VL space for tasks including point annotation, detection, captioning, guided text inferences, and more. Find the demo link below. 🤗↗️

⮞ Space[Demo]: prithivMLmods/Qwen3-VL-HF-Demo
⮞ Model Used: Qwen/Qwen3-VL-4B-Instruct
⮞ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
⮞ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-3VL-Multimodal-Understanding

To know more about it, visit the app page or the respective model page!

updated 2 Spaces about 24 hours ago

Qwen3 VL HF Demo

object detection, visual grounding, keypoint detection

Qwen3 VL HF Demo

object detection, visual grounding, keypoint detection

updated a Space 1 day ago

Qwen Image Edit 2509 LoRAs Fast Fusion

Demo of the Qwen Image Editing Fusion Collection

liked a model 1 day ago

XenArcAI/SparkEmbedding-300m

Sentence Similarity • 0.3B • Updated 3 days ago • 91 • 6

updated a model 1 day ago

prithivMLmods/Gliese-4B-OSS-0410

Text Generation • 4B • Updated 1 day ago • 21 • 2

updated a Space 1 day ago

Qwen3 VL HF Demo

object detection, visual grounding, keypoint detection