multi-modal

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

open-source api-wrapper accelerate multi-modal pretraining large-language-models llm rlhf instruction-tuning

Updated Jun 7, 2024
Python

docarray / docarray

Star

Represent, send, store and search multimodal data

elasticsearch machine-learning deep-learning protobuf pytorch data-structures nearest-neighbor-search cross-modal multi-modal semantic-search multimodal nested-data weaviate dataclass pydantic fastapi neural-search qdrant docarray

Updated Jun 6, 2024
Python

vercel / modelfusion

Star

The TypeScript library for building AI applications.

Updated Jun 6, 2024
TypeScript

colurw / temporal_CNN

Star

Time-series forecasting of market price data using a multi-modal Convolutional Neural Network

numpy pandas multi-modal time-series-forecasting tensorflow2

Updated Jun 6, 2024
Jupyter Notebook

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jun 6, 2024
Python

modelscope / data-juicer

Star

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Updated Jun 6, 2024
Python

OpenGVLab / InternVL

Star

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

image-classification gpt multi-modal semantic-segmentation video-classification mme image-text-retrieval llm vision-language-model gpt-4v vit-6b vit-22b gpt-4o

Updated Jun 6, 2024
Python

zjysteven / VLM-Visualizer

Star

Visualizing the attention of vision-language models

attention multi-modal attention-mechanism vision-language vision-language-model llava

Updated Jun 6, 2024
Jupyter Notebook

howard-hou / VisualRWKV

Star

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.

multi-modal large-language-models rwkv

Updated Jun 4, 2024
Python

Yuan-ManX / ai-multimodal-timeline

Star

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

ai multi-modal ai-agents deeplearning-ai multimodal multimodal-deep-learning llm

Updated Jun 4, 2024

Improve this page

Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-modal

Here are 272 public repositories matching this topic...

kyegomez / zeta

SciSharp / LLamaSharp

modelscope / agentscope

open-compass / VLMEvalKit

valhalla / valhalla

Lizhecheng02 / MultiModal

OpenBMB / MiniCPM-V

THUDM / CogVLM2

deep-symbolic-mathematics / Multimodal-Math-Pretraining

marqo-ai / marqo

patrick-tssn / LM-Research-Hub

docarray / docarray

vercel / modelfusion

colurw / temporal_CNN

modelscope / modelscope

modelscope / data-juicer

OpenGVLab / InternVL

zjysteven / VLM-Visualizer

howard-hou / VisualRWKV

Yuan-ManX / ai-multimodal-timeline

Improve this page

Add this topic to your repo