hjkim

hojie11

hojie11

AI & ML interests

Computer Vision, 3D Vision, Anomaly Detection

Recent Activity

upvoted a paper 4 days ago

SAM 3D: 3Dfy Anything in Images

upvoted a paper 4 days ago

First Frame Is the Place to Go for Video Content Customization

upvoted a paper 6 days ago

SAM 2: Segment Anything in Images and Videos

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published 5 days ago • 89

First Frame Is the Place to Go for Video Content Customization

Paper • 2511.15700 • Published 6 days ago • 50

upvoted 2 papers 6 days ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 119

A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space

Paper • 2511.10555 • Published 12 days ago • 56

upvoted a paper 7 days ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published 13 days ago • 64

upvoted 2 papers 9 days ago

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published 12 days ago • 111

Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published 12 days ago • 84

upvoted 2 papers 12 days ago

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published 16 days ago • 52

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 13 days ago • 100

upvoted a paper 13 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 16 days ago • 117

upvoted 2 papers 15 days ago

Visual Spatial Tuning

Paper • 2511.05491 • Published 18 days ago • 49

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published 19 days ago • 95

upvoted a paper 16 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published 20 days ago • 77

upvoted a paper 18 days ago

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published 20 days ago • 26

upvoted 6 papers 19 days ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 95

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 120

hjkim

AI & ML interests

Recent Activity

Organizations

hojie11's activity