multimodal-large-language-models

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen

Updated Jun 7, 2024
Jupyter Notebook

burglarhobbit / Awesome-Medical-Large-Language-Models

Star

Curated papers on Large Language Models in Healthcare and Medical domain

large-language-models large-vision-language-models multimodal-large-language-models

Updated May 20, 2024

X-PLUG / Youku-mPLUG

Star

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

benchmark video dataset chinese youku multimodal video-retrieval video-question-answering multimodal-pretraining mllm multimodal-large-language-models

Updated Jan 8, 2024
Python

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

Star

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

text-to-speech multimodality text-to-image text-to-audio text-to-video text-to-music multimodal-models aigc large-language-models text-to-3d multimodal-generation text-to-sound large-vision-language-models multimodal-large-language-models

Updated Jun 3, 2024
HTML

AviSoori1x / seemore

Star

From scratch implementation of a vision language model in pure PyTorch

deep-learning pytorch artificial-intelligence neural-networks multimodal-learning multimodal pytorch-implementation large-language-models llm vision-language-model llava multimodal-large-language-models

Updated May 6, 2024
Jupyter Notebook

zjukg / KoPA

Star

[Paper][Preprint 2023] Making Large Language Models Perform Better in Knowledge Graph Completion

knowledge-graph knowledge-graph-completion multi-modal knowledge-graph-embeddings large-language-models instruction-tuning multimodal-large-language-models

Updated Feb 10, 2024
Python

vincentlux / Awesome-Multimodal-LLM

Star

Reading list for Multimodal Large Language Models

machine-learning natural-language-processing computer-vision awesome-list paper-list multimodal-machine-learning large-language-models vision-language-model multimodal-large-language-models

Updated Aug 17, 2023

Improve this page

Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-large-language-models

Here are 52 public repositories matching this topic...

BradyFU / Awesome-Multimodal-Large-Language-Models

modelscope / modelscope-agent

X-PLUG / MobileAgent

YangLing0818 / RPG-DiffusionMaster

X-PLUG / mPLUG-DocOwl

BAAI-DCAI / Bunny

LLaVA-VL / LLaVA-Plus-Codebase

richard-peng-xia / awesome-multimodal-in-medical-imaging

rese1f / MovieChat

X-LANCE / SLAM-LLM

tsujuifu / pytorch_mgie

BradyFU / Woodpecker

HenryHZY / Awesome-Multimodal-LLM

Coobiw / MiniGPT4Qwen

burglarhobbit / Awesome-Medical-Large-Language-Models

X-PLUG / Youku-mPLUG

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

AviSoori1x / seemore

zjukg / KoPA

vincentlux / Awesome-Multimodal-LLM

Improve this page

Add this topic to your repo