run llama3-70B-q8_0 error #4226

leoHostProject · 2024-05-07T10:44:08Z

What is the issue?

api call error
message:
{"error":{"message":"timed out waiting for llama runner to start:CUDA error:uncorrectable ECC error encountered
n current device:0,in function ggml cuda_compute_forward at /go/src/github.com/ollama/ollama/11m/1lama.cpp/ggml
-cuda.cu:2300\n err\nGGML_ASSERT:/go/src/github.com/ollama/ollama/11m/1lama.cpp/ggml-cuda.cu:60:!"CUDA error"
""type":"api_error","param"null,"code":null}}

OS

Linux, Docker

GPU

Nvidia

CPU

Intel

Ollama version

No response

leoHostProject added the bug Something isn't working label May 7, 2024

BruceMacD added the nvidia Issues relating to Nvidia GPUs and CUDA label May 7, 2024

pdevine added the gpu label May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run llama3-70B-q8_0 error #4226

run llama3-70B-q8_0 error #4226

leoHostProject commented May 7, 2024

run llama3-70B-q8_0 error #4226

run llama3-70B-q8_0 error #4226

Comments

leoHostProject commented May 7, 2024

What is the issue?

OS

GPU

CPU

Ollama version