Control Any Computer Using LLMs
-
Updated
May 12, 2024 - Python
Control Any Computer Using LLMs
Convert different model APIs into the OpenAI API format out of the box.
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
This is a tool that uses GPT4 Vision to operate your computer
Towards Explainable Metrics for Conditional Image Synthesis Evaluation (ACL 2024)
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision).
Your own personal Ruskin.
Digital Artificial Intelligence Agent
VisionQuery GPT-4v is a cutting-edge tool that combines screenshot-based queries with OpenAI's GPT-4. It enables users to capture screens, ask questions, and receive insightful answers from GPT-4v, revolutionizing digital interaction and understanding.
Highly efficient and easy-to-use utility functions for common tasks. Includes functions to fetch JPG files from a folder, sort by modification time, and preprocess images in batch for GPT-4o vision API. Also includes optimized directory operations, file handling, image processing, and JSON manipulation with cache and multithreading.
Camera powered with AI on the web
Add a description, image, and links to the gpt4vision topic page so that developers can more easily learn about it.
To associate your repository with the gpt4vision topic, visit your repo's landing page and select "manage topics."