A list of research papers on knowledge-enhanced multimodal learning
-
Updated
Dec 8, 2022
A list of research papers on knowledge-enhanced multimodal learning
[AAAI 2023] Hierarchical ConViT with Attention-based Relational Reasoner for Visual Analogical Reasoning
Pytorch implementation of " A simple neural network module for relational reasoning" paper aka Relational networks for visual reasoning.
Visual reasoning modular memory network
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
Reproducibility Challenge - The Neuro-Symbolic Concept Learner
Convert RGB images of Visual-Genome dataset to Depth Maps.
An un-official implementation of Relational Network [A. Santoro et al., 2017] (PyTorch)
Multimodal Learning and Reasoning for Visual Question Answering
ACRE: Abstract Causal REasoning Beyond Covariation
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
Implementation of the VQA model from my MSc project
📄 A curated list of visual reasoning papers.
Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
Learning Perceptual Inference by Contrasting
Add a description, image, and links to the visual-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the visual-reasoning topic, visit your repo's landing page and select "manage topics."