multimodal-large-language-models

Personal Project: MPP-Qwen14B(Multimodal Pipeline Parallel-Qwen14B). Don't let the poverty limit your imagination! Train your own 14B LLaVA-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen

Updated Jun 7, 2024
Jupyter Notebook

BradyFU / Video-MME

Star

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

video mme large-language-models large-vision-language-models multimodal-large-language-models video-mme

Updated Jun 7, 2024
Python

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

Star

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

autonomous-car autonomous-driving autonomous-vehicles self-driving multimodal vision-transformer foundation-models large-language-models vision-language-model multimodal-large-language-models

Updated Mar 14, 2024

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

Star

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

text-to-speech multimodality text-to-image text-to-audio text-to-video text-to-music multimodal-models aigc large-language-models text-to-3d multimodal-generation text-to-sound large-vision-language-models multimodal-large-language-models

Updated Jun 3, 2024
HTML

burglarhobbit / Awesome-Medical-Large-Language-Models

Star

Curated papers on Large Language Models in Healthcare and Medical domain

large-language-models large-vision-language-models multimodal-large-language-models

Updated May 20, 2024

AviSoori1x / seemore

Star

From scratch implementation of a vision language model in pure PyTorch

deep-learning pytorch artificial-intelligence neural-networks multimodal-learning multimodal pytorch-implementation large-language-models llm vision-language-model llava multimodal-large-language-models

Updated May 6, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-large-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-large-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-large-language-models

Here are 52 public repositories matching this topic...

BradyFU / Awesome-Multimodal-Large-Language-Models

modelscope / modelscope-agent

X-PLUG / MobileAgent

YangLing0818 / RPG-DiffusionMaster

X-PLUG / mPLUG-DocOwl

BAAI-DCAI / Bunny

LLaVA-VL / LLaVA-Plus-Codebase

BradyFU / Woodpecker

rese1f / MovieChat

X-LANCE / SLAM-LLM

richard-peng-xia / awesome-multimodal-in-medical-imaging

HenryHZY / Awesome-Multimodal-LLM

tsujuifu / pytorch_mgie

X-PLUG / Youku-mPLUG

Coobiw / MiniGPT4Qwen

BradyFU / Video-MME

IrohXu / Awesome-Multimodal-LLM-Autonomous-Driving

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

burglarhobbit / Awesome-Medical-Large-Language-Models

AviSoori1x / seemore

Improve this page

Add this topic to your repo