Ollama preload keep alive #1100

frangin2003 · 2024-05-14T01:07:08Z

Issue

N/A

Change

In order to ensure that Ollama model is ready to go at startup, I have introduced an optional preload boolean keyword argument (default set to False) to the Ollama LLM, allowing the model to be preloaded into memory during the initialization phase. This feature includes a preload_model method to trigger preloading at a later time if desired. The preloading process leverages the Ollama FAQ's recommended approach of calling generate with an empty prompt (see how-can-i-pre-load-a-model-to-get-faster-response-times).
Additionally, the keep_alive parameter is now usable to be passed and tell for how long the model stays in memory.

General checklist

There are no breaking changes
I have added unit and integration tests for my change
I have manually run all the unit and integration tests in the module I have added/changed, and they are all green
I have manually run all the unit and integration tests in the core and main modules, and they are all green

I have added/updated the documentation
I have added an example in the examples repo (only for "big" features)

frangin2003 added 3 commits May 14, 2024 00:21

Added keep_alive and preload to Ollama

5110b42

Fixed tests

6d5c205

Updated the documentation

f95f715

langchain4j added the P3 Medium priority label May 14, 2024

Merge branch 'main' into ollama-preload-keep_alive

06cd212

frangin2003 marked this pull request as ready for review May 14, 2024 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama preload keep alive #1100

Ollama preload keep alive #1100

frangin2003 commented May 14, 2024

Ollama preload keep alive #1100

Are you sure you want to change the base?

Ollama preload keep alive #1100

Conversation

frangin2003 commented May 14, 2024

Issue

Change

General checklist