[FG 2024] "Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention"
-
Updated
May 14, 2024 - Python
[FG 2024] "Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention"
Unified Audio-Visual Perception for Multi-Task Video Localization
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
The official repository of the paper EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
[ECCV 2022] Official implementation of the paper: Audio-Visual Segmentation
FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition
Official implementation for CIGN
Co-Separating Sounds of Visual Objects (ICCV 2019)
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
Related papers about Weakly-supervised Audio-Visual Video Parsing (AVVP) & Audio-Visual Event Localization (AVE)
Add a description, image, and links to the audio-visual-learning topic page so that developers can more easily learn about it.
To associate your repository with the audio-visual-learning topic, visit your repo's landing page and select "manage topics."