Antoine Yang's picture

1

Antoine Yang

antoineyang

·

https://antoyang.github.io/

AI & ML interests

Vision and language, video understanding

Organizations

None yet

authored a paper 7 months ago

Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs

Paper • 2504.00072 • Published Mar 31 • 6

authored 9 papers about 2 years ago

CoVR: Learning Composed Video Retrieval from Web Video Captions

Paper • 2308.14746 • Published Aug 28, 2023 • 2

VidChapters-7M: Video Chapters at Scale

Paper • 2309.13952 • Published Sep 25, 2023 • 11

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

Paper • 2302.14115 • Published Feb 27, 2023

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Paper • 2206.08155 • Published Jun 16, 2022

TubeDETR: Spatio-Temporal Video Grounding with Transformers

Paper • 2203.16434 • Published Mar 30, 2022

Learning to Answer Visual Questions from Web Videos

Paper • 2205.05019 • Published May 10, 2022

MANAS: Multi-Agent Neural Architecture Search

Paper • 1909.01051 • Published Sep 3, 2019

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Paper • 2012.00451 • Published Dec 1, 2020

NAS evaluation is frustratingly hard

Paper • 1912.12522 • Published Dec 28, 2019