Haokun Lin's picture

3 16 9

Haokun Lin

Felix1023

·

https://felixmessi.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

upvoted a paper 13 days ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

upvoted a paper 13 days ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

View all activity

Organizations

upvoted 2 papers 13 days ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published 18 days ago • 29

upvoted a collection 14 days ago

Video-As-Prompt

The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation" • 3 items • Updated 14 days ago • 11

upvoted a paper 14 days ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published 17 days ago • 44

upvoted a paper about 2 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10 • 127

upvoted a paper 2 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 20

upvoted 3 papers 3 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20 • 22

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14 • 52

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

upvoted a paper 4 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 130

upvoted 6 papers 5 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19 • 27

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19 • 60

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 48

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9 • 18

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5 • 27

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Paper • 2505.05422 • Published May 8 • 8