[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
-
Updated
Jun 7, 2024 - Python
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Video Foundation Models & Data for Multimodal Understanding
FreeVA: Offline MLLM as Training-Free Video Assistant
Official code for MiniGPT4-video
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"
Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 2023)
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
A PyTorch implementation of EmpiricalMVM
A PyTorch implementation of VIOLET
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Part of my work for my Bachelor's Thesis Project on Counterfactual Reasoning for Videos.
Add a description, image, and links to the video-question-answering topic page so that developers can more easily learn about it.
To associate your repository with the video-question-answering topic, visit your repo's landing page and select "manage topics."