VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper β’ 2406.07476 β’ Published Jun 11, 2024 β’ 37
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 3.71k β’ 14
DAMO-NLP-SG/VideoLLaMA2-7B Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 1.31k β’ 42
DAMO-NLP-SG/VideoLLaMA2-7B-16F Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 35 β’ 15
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base Visual Question Answering β’ Updated Oct 21, 2024 β’ 21 β’ 1