gpt4v
Here are 33 public repositories matching this topic...
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
-
Updated
Apr 3, 2024 - Python
Vision utilities for web interaction agents 👀
-
Updated
May 21, 2024 - Jupyter Notebook
Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description
-
Updated
Nov 29, 2023 - TypeScript
The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a simple hand drawn sketch on paper captured from webcam
-
Updated
May 3, 2024
Control Any Computer Using LLMs
-
Updated
May 12, 2024 - Python
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
-
Updated
Oct 25, 2023
Convert different model APIs into the OpenAI API format out of the box.
-
Updated
Feb 21, 2024 - Go
AI Voiceover with GPT4V
-
Updated
May 10, 2024 - Jupyter Notebook
Monitor the performance of OpenAI's GPT-4V model over time.
-
Updated
May 21, 2024 - HTML
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
-
Updated
May 17, 2024 - Python
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
-
Updated
Mar 12, 2024 - Python
I GAVE GPT-4 EYES!
-
Updated
Jan 24, 2024 - JavaScript
-
Updated
Nov 16, 2023
Chain of Images for Intuitively Reasoning
-
Updated
Nov 29, 2023 - Python
Improve this page
Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."