Feat: Add `OLLAMA_LOAD_TIMEOUT` env variable #4123

dcfidalgo · 2024-05-03T09:47:50Z

For certain hardware setups and models, the offloading to the GPU can take a lot of time and the user can hit a timeout. This PR makes the timeout configurable via the OLLAMA_LOAD_TIMEOUT env variable, to be provided in seconds.

@dhiltgen I added a subsection in the FAQ, since I was not sure where to document the env variable. Let me know if this is the right place.

llm/server.go

bsdnet · 2024-05-03T22:14:04Z

llm/server.go

+		}
+	}
+	print(timeout)
+	expiresAt := time.Now().Add(time.Duration(timeout) * time.Second) // be generous with timeout, large models can take a while to load
 	ticker := time.NewTicker(50 * time.Millisecond)


Should we print the message "loading the model" for each tick?
Without the message or a spinning, the compute seems being stuck for 10 mins.

dcfidalgo added 2 commits May 2, 2024 23:32

Add env variable OLLAMA_LOAD_TIMEOUT in seconds

11bcc40

Add entry in FAQ

0905a7b

dhiltgen reviewed May 3, 2024

View reviewed changes

llm/server.go Outdated Show resolved Hide resolved

llm/server.go Outdated Show resolved Hide resolved

bsdnet reviewed May 3, 2024

View reviewed changes

sammcj mentioned this pull request May 5, 2024

Consider Using Standard Config Format #204

Open

log instead of fail

2befddf

dcfidalgo requested a review from dhiltgen May 6, 2024 07:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Add `OLLAMA_LOAD_TIMEOUT` env variable #4123

Feat: Add `OLLAMA_LOAD_TIMEOUT` env variable #4123

dcfidalgo commented May 3, 2024

bsdnet May 3, 2024

Feat: Add OLLAMA_LOAD_TIMEOUT env variable #4123

Are you sure you want to change the base?

Feat: Add OLLAMA_LOAD_TIMEOUT env variable #4123

Conversation

dcfidalgo commented May 3, 2024

bsdnet May 3, 2024

Choose a reason for hiding this comment

Feat: Add `OLLAMA_LOAD_TIMEOUT` env variable #4123

Feat: Add `OLLAMA_LOAD_TIMEOUT` env variable #4123