FrVD: French Video Description dataset
-
Updated
Jun 22, 2023
FrVD: French Video Description dataset
Leveraging Self-Supervised Training for Unintentional Action Recognition (ECCVW 2022)
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
Code for the Paper: Quasi-Online Detection of Take and Release Actions from Egocentric Videos. International Conference on Image Analysis and Processing 2023.
Tool employed to visualize synchronized FrVD metadata and videos simultaneously.
Undergraduate Thesis @ Department of Automation, Tsinghua -- Understanding Few-shot Video with Pretrained Image-Text Models
[IJCNN 2024] Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
The code for 3DTDS-Net with Pytorch
Video understanding with C3D
The code for FSTA-Net with Pytorch
The code for L3AM loss with Pytorch
[ICCV 2021] On the hidden treasure of dialog in video question answering
📚 Paper Notes (Computer vision)
Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
[CVPR 2018] Non-local Neural Networks
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
The code for PB-Net with Pytorch
Temporal Context Awareness for Cluster-based Video Summarization
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."