Video captioning using SCN-LSTM models with S2VT baseline
-
Updated
Mar 15, 2023 - Python
Video captioning using SCN-LSTM models with S2VT baseline
An encoder-decoder deep learning model (with/without attention mechanism) where the input is an arabic sign-language video and the output is its translation in text format.
This project utilizes advanced deep learning techniques to automatically generate contextually relevant captions for videos by extracting spatial and temporal features, while incorporating Gaussian attention to focus on important regions. This enhances video indexing, retrieval, and accessibility for visually impaired individuals.
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
Data collection and automatic labeling for dense video captioning models
🔍 Shotluck Holmes: A family of small-scale LLVMs for shot-level video understanding
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Multimodal Video Captioning project for the Natural Language Processing course at Tsinghua University, spring 2021
Master Thesis on Multimodal Video Captioning, done at Huawei's Research Center in Amsterdam.
Video Captioning using Scene Change Detection and Image Captioning
Video captioning | Video2Description
Upload a video and have captions generated automatically. From api.video (https://api.video)
(PRCV'2022) CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter
(TIP) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
Visio Text is a real-time video captioning project that leverages the capabilities of artificial intelligence to provide dynamic text captions for videos.
LSTM RNN and Transformer networks video captioning on MSVD and MSR-VTT using attributes and SVOS
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
AI based Video summarizer along with captioning.
[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding
AI-based Video summarizer along with captioning.
Add a description, image, and links to the video-captioning topic page so that developers can more easily learn about it.
To associate your repository with the video-captioning topic, visit your repo's landing page and select "manage topics."