Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

upvoted a paper 12 days ago

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

upvoted a paper 13 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

View all activity

Organizations

upvoted an article 3 days ago

Article

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

•

Mar 26

• 6

upvoted a paper 12 days ago

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

Paper • 2510.20803 • Published 17 days ago • 9

upvoted a paper 13 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published 18 days ago • 26

upvoted a paper 23 days ago

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published 26 days ago • 47

upvoted a paper about 2 months ago

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15 • 19

liked a model 2 months ago

loolootech/no-name-ner-th

Token Classification • 0.3B • Updated Aug 20 • 39 • 5

upvoted 2 papers 3 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 237

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 255

liked a Space 3 months ago

174

Chat with Kimi-VL-A3B-Thinking-2506

🤔

Chat with images, videos, or PDFs to generate text

upvoted 2 papers 3 months ago

A Survey on Diffusion Language Models

Paper • 2508.10875 • Published Aug 14 • 34

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Paper • 2508.05954 • Published Aug 8 • 6

liked 2 models 3 months ago

kpsss34/Stable-Diffusion-3.5-Small-Preview1

Text-to-Image • Updated Aug 13 • 955 • 38

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 242k • • 450

upvoted an article 3 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

liked a model 3 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 206k • • 2.19k

upvoted 5 articles 4 months ago

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23

• 46

Article

Five Big Improvements to Gradio MCP Servers

Jul 17

• 24

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

• 705

Article

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

Jul 10

• 44

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

• 277

Ougrid Dumdang

AI & ML interests

Recent Activity

Organizations

Ougrid-D's activity

Generative AI for Recommendation Systems: A Guide to Tokenizing User Interaction Data

Chat with Kimi-VL-A3B-Thinking-2506

Welcome GPT OSS, the new open-source model family from OpenAI!

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Five Big Improvements to Gradio MCP Servers

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data