#

multimodal-deep-learning

Here are 349 public repositories matching this topic...

SRM-IST-KTR / disturbance-detection-in-messaging-apps-using-machine-learning-e5d7h9m7

A Fully Deployable React-Native mobile app that seeks to classify incoming messages in messaging apps into important or disturbing categories. using a Multi-Modal Machine Learning Architecture to achieve Text classification, Image classification and YouTube Video Link classification.

graphql react-native aws-amplify multimodal-deep-learning

Updated May 10, 2022
Jupyter Notebook

ishitab1310 / HateFilter

Analyzing Hateful Memes/ (Resources:- Hateful Memes Challenge)

multimodal-deep-learning hateful-memes-challenge

Updated Feb 18, 2024
Jupyter Notebook

macabdul9 / torchmm

PyTorch Data loaders and abstraction for multi-modal data.

python natural-language-processing computer-vision pytorch speech-processing multimodal-deep-learning

Updated Dec 27, 2022
Jupyter Notebook

yubin1219 / deep_learning_music

Deep Learning for Music & Audio - Multi modal project

music image-generation audio-processing multimodal-deep-learning

Updated May 16, 2022
Jupyter Notebook

SheezaShabbir / Multimodel_huggingFace-Swin-Transformer

A multimodal that uses both text and Images to tells what will be the expected emotion of the viewer of the news.

python pytorch transformer classification multimodal-deep-learning huggingface-transformers

Updated Aug 12, 2022
Jupyter Notebook

Etienne-bobo / Skimlit-Nlp

The purpose of this project is to build an NLP model to make reading medical abtracts easier.

nlp keras-tensorflow multimodal-deep-learning tensorflow2

Updated Aug 20, 2023
Jupyter Notebook

Yuan-ManX / ai-multimodal-timeline

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥

ai multi-modal deeplearning-ai multimodal multimodal-deep-learning llm

Updated May 29, 2024

talipucar / talipucar.github.io_old

Showcases ongoing, and completed projects within various research themes.

domain-adaptation self-supervised multimodal-learning multimodal-deep-learning self-supervised-learning domain-translation

Updated Dec 28, 2022

MishraCo / Forecasting-of-Solar-Irradiance

Leveraging Meteorological data and All-Sky Images to create a multimodal model for better forecasting of Solar Irradiance parameters.

transfer-learning multimodal-deep-learning timeseries-forecasting

Updated Dec 24, 2023
Jupyter Notebook

Danesed / Ducho

Accepted at The Web Conference 2024.

deep-learning artificial-intelligence feature-extraction recommender-system multimodal multimodal-deep-learning

Updated Feb 6, 2024
Python

multilearning / multilearning.github.io

Multi-learning

multilingual multitasking multimodality multimodal-learning multimodal multimodal-deep-learning multitask multitask-learning multitask-classification

Updated Sep 11, 2021

mobled37 / utils

Deeplearning utils for multimodal research

finetuning multimodal-deep-learning

Updated Jul 28, 2023
Python

a-tabaza / binding_music

Code and Models for Binding Text, Images, Graphs, and Audio for Music Representation Learning

music-information-retrieval multimodal-deep-learning joint-embedding

Updated May 18, 2024
Python

ShowMeModel / transformers-multimodal-example

Example of a multimodal (end-to-end) deep learning model with transformers architecture

deep-learning transformers end-to-end-machine-learning multimodal-deep-learning

Updated Feb 5, 2023

vijayvee / text-to-image-synthesis

Project to transform a natural language description into an image using Generative Adversarial Networks.

generative-adversarial-networks text-to-image multimodal-deep-learning

Updated Dec 9, 2017
Python

kritika-gupta / multi-modal-music-genre-classification

Final project for CS 7643 : Deep Learning (Fall 2022, Georgia Tech)

music deep-learning convolutional-neural-networks bert multimodal-deep-learning

Updated Jan 9, 2023
Jupyter Notebook

isevr / TVEmotion

A novel multimodal approach for emotion recognition deploying early fusion based on graph-captured embeddings

graph-convolutional-networks emotion-recognition multimodal-deep-learning

Updated Jan 3, 2024
Jupyter Notebook

marcomoldovan / cross-modal-speech-segment-retrieval

Learning a common representation space from speech and text for cross-modal retrieval given textual queries and speech files.

natural-language-processing deep-learning speech transformer speech-recognition spoken-language-understanding multimodal-deep-learning self-supervised-learning

Updated Apr 27, 2023
Python

deeplsd / Syncnet_Analysis

This code is part of the paper: "A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation" published at ACM ICMI 2022.

interpretability synchrony adversarial-attacks multimodal-deep-learning audio-visual

Updated Apr 29, 2023
Python

nicolafan / neural-artwork-caption-generator

Code for the paper "Exploring the Synergy Between Vision-Language Pretraining and ChatGPT for Artwork Captioning: A Preliminary Study"

machine-learning computer-vision deep-learning image-captioning artworks cultural-heritage multimodal-deep-learning

Updated Jan 21, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-deep-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-deep-learning topic, visit your repo's landing page and select "manage topics."