Build high-performance AI models with modular building blocks
-
Updated
Jun 9, 2024 - Python
Build high-performance AI models with modular building blocks
Start building LLM-empowered multi-agent applications in an easier way.
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
Open Source Routing Engine for OpenStreetMap
Basic implementation code for multimodal models and some applications or fine-tuning tasks based on them.
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
GPT4V-level open-source multi-modal model based on Llama3-8B
[ICLR 2024 Spotlight] This is the official code for the paper "SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training"
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)
Represent, send, store and search multimodal data
The TypeScript library for building AI applications.
Time-series forecasting of market price data using a multi-modal Convolutional Neural Network
ModelScope: bring the notion of Model-as-a-Service to life.
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Visualizing the attention of vision-language models
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥
Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."