GPT-4 Vision for Home Assistant

Image Analyzer for Home Assistant using GPT Vision

🌟 Features · 📖 Resources · ⬇️ Installation · ▶️ Usage · 🪲 How to report Bugs

ha-gpt4vision creates the gpt4vision.image_analyzer service in Home Assistant. This service sends an image to an AI provider and returns the output as a response variable for easy use in automations. Supported providers are OpenAI, LocalAI and Ollama.

Features

Multimodal conversation with AI models
Compatible with OpenAI's API, LocalAI and Ollama
Images can be downscaled for faster processing
Can be installed and updated through HACS and can be set up in the Home Assistant UI

Resources

Check the 📖 wiki for examples on how you can integrate gpt4vision into your Home Assistant or join the 🗨️ discussion in the Home Assistant Community.

Installation

Installation via HACS (recommended)

Search for GPT-4 Vision in Home Assistant Settings/Devices & services
Select your provider
Follow the instructions to complete setup

Manual Installation

Download and copy the gpt4vision folder into your custom_components folder.
Add integration in Home Assistant Settings/Devices & services
Provide your API key or IP address and port of your LocalAI server

Provider specific setup

OpenAI

Simply obtain an API key from OpenAI and enter it in the Home Assistant UI during setup.
A pricing calculator is available here: https://openai.com/api/pricing/.

LocalAI

To use LocalAI, you need to have a LocalAI server running. You can find the installation instructions here. During setup you'll need to provide the IP address of your machine and the port on which LocalAI is running (default is 8000).

Ollama

To use Ollama you first need to install Ollama on your machine. You can download it from here. Once installed you need to run the following command to download the llava model:

ollama run llava

If your Home Assistant is not running on the same computer as Ollama, you need to set the OLLAMA_HOST environment variable.

On Linux:

Edit the systemd service by calling systemctl edit ollama.service. This will open an editor.
For each environment variable, add a line Environment under section [Service]:

[Service]
Environment="OLLAMA_HOST=0.0.0.0"

Save and close the editor.
Reload systemd and restart Ollama

systemctl daemon-reload
systemctl restart ollama

On Windows:

Quit Ollama from the system tray
Open File Explorer
Right click on This PC and select Properties
Click on Advanced system settings
Select Environment Variables
Under User variables click New
For variable name enter OLLAMA_HOST and for value enter 0.0.0.0
Click OK and start Ollama again from the Start Menu

On macOS:

Open Terminal
Run the following command

launchctl setenv OLLAMA_HOST "0.0.0.0"

Restart Ollama

Service call and usage

After restarting, the gpt4vision.image_analyzer service will be available. You can test it in the developer tools section in home assistant. To get GPT's analysis of a local image, use the following service call.

service: gpt4vision.image_analyzer
data:
  provider: OpenAI
  message: Describe what you see?
  max_tokens: 100
  model: gpt-4o
  image_file: |-
    /config/www/tmp/example.jpg
    /config/www/tmp/example2.jpg
  target_width: 1280
  detail: low
  temperature: 0.5

The parameters provider, message, max_tokens, image_file and temperature are required. You can send multiple images per service call. Note that each path must be on a new line.

Optionally, the model, target_width and detail properties can be set.

For available models check these pages: supported models for OpenAI and LocalAI model gallery.
The target_width is an integer between 512 and 3840 representing the image width in pixels. It is used to downscale the image before encoding it.
The detail parameter can be set to low or high. If it is not set, it is set to 'auto'. OpenAI will then use the image size to determine the detail level. For more information check the OpenAI documentation.

How to report a bug or request a feature

Note

Bugs: If you encounter any bugs and have followed the instructions carefully, feel free to file a bug report.
Feature Requests: If you have an idea for a feature, create a feature request.

Create new Issue

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.github		.github
custom_components/gpt4vision		custom_components/gpt4vision
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
hacs.json		hacs.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

custom_components/gpt4vision

custom_components/gpt4vision

.DS_Store

.DS_Store

LICENSE

LICENSE

README.md

README.md

hacs.json

hacs.json

Repository files navigation

GPT-4 Vision for Home Assistant

Features

Resources

Installation

Installation via HACS (recommended)

Manual Installation

Provider specific setup

OpenAI

LocalAI

Ollama

Service call and usage

How to report a bug or request a feature

About

Releases 7

Contributors 2

Languages

License

valentinfrlch/ha-gpt4vision

Folders and files

Latest commit

History

Repository files navigation

GPT-4 Vision for Home Assistant

Features

Resources

Installation

Installation via HACS (recommended)

Manual Installation

Provider specific setup

OpenAI

LocalAI

Ollama

Service call and usage

How to report a bug or request a feature

About

Topics

Resources

License

Stars

Watchers

Forks

Languages