arxiv:2511.15700

First Frame Is the Place to Go for Video Content Customization

Published on Nov 19

· Submitted by

LZX on Nov 21

·

UMCP

University of Maryland College Park

Authors:

Jingxi Chen ,

Zongxia Li ,

,

,

,

Fuxiao Liu ,

,

,

Abstract

Video generation models use the first frame as a conceptual memory buffer, enabling robust customization with minimal training examples.

AI-generated summary

What role does the first frame play in video generation models? Traditionally, it's viewed as the spatial-temporal starting point of a video, merely a seed for subsequent animation. In this work, we reveal a fundamentally different perspective: video models implicitly treat the first frame as a conceptual memory buffer that stores visual entities for later reuse during generation. Leveraging this insight, we show that it's possible to achieve robust and generalized video content customization in diverse scenarios, using only 20-50 training examples without architectural changes or large-scale finetuning. This unveils a powerful, overlooked capability of video generation models for reference-based video customization.

View arXiv page View PDF Project page GitHub 47 Add to collection

Community

Paper author Paper submitter 3 days ago

•

edited 3 days ago

Github:
https://github.com/zli12321/FFGO-Video-Customization

Project Page:
http://firstframego.github.io

Paper author Paper submitter 3 days ago

2 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2511.15700 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2511.15700 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2511.15700 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.