gpt-vision
Here are 23 public repositories matching this topic...
A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.
-
Updated
Nov 9, 2023 - JavaScript
Create AWS infrastructure using architecture diagrams and natural language interpreted using the OpenAI GPT model.
-
Updated
Nov 18, 2023 - Python
Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.
-
Updated
Nov 22, 2023 - Python
-
Updated
Dec 3, 2023 - Python
Interface to use openAI API (GPT4, Dall-e 3, ...)
-
Updated
Dec 7, 2023 - TypeScript
A powerful AI package (built using typescript), inspired by @rizzlogy/bardie, for interacting with the Google Bard API - without needing to set your own cookie!
-
Updated
Jan 1, 2024 - TypeScript
autoPDFtagger is a Python tool designed for efficient home-office organization, focusing on digitizing and organizing both digital and paper-based documents. By automating the tagging of PDF files, including image-rich documents and scans of varying quality, it aims to streamline the organization of digital archives.
-
Updated
Jan 1, 2024 - Python
Create interactive polls directly from the whiteboard content. Built on top of tldraw make-real template and live audio-video by 100ms, it uses OpenAI's GPT Vision to create an appropriate question with options to launch a poll instantly that helps engage the audience.
-
Updated
Mar 14, 2024 - TypeScript
Project submission for hack it sapiens hackathon.
-
Updated
Mar 30, 2024 - Python
集成 GPT 问答、Midjourney 绘画等一站式服务的系统
-
Updated
Apr 3, 2024 - Vue
A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.
-
Updated
Apr 9, 2024 - Python
Auto caption images for training in Stable Diffusion
-
Updated
Apr 9, 2024 - Python
Convert PDF to Markdown via OpenAI multi-modal text/vision model.
-
Updated
May 3, 2024 - Python
A simple matrix bot that supports image generation and chatting using ChatGPT, Langchain
-
Updated
May 8, 2024 - Python
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.
-
Updated
May 24, 2024 - Java
Improve this page
Add a description, image, and links to the gpt-vision topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpt-vision topic, visit your repo's landing page and select "manage topics."