Unfinished sentences when setting num_predict parameter #4230

mariomorvan · 2024-05-07T14:41:16Z

What is the issue?

I have tried multiple values of num_predict between 30 and 100 and two models (llama2 and llava).
In all cases the last sentence is often cut short, making it quite inconvenient to use in applications.
Not entirely sure if it is a bug or just expected behaviour that should be handled otherwise (prompt engineering, perplexity, postprocessing...).

The problem has already been mentioned by several people in this issue langgenius/dify#2461 (comment)

OS

macOS

GPU

Intel

CPU

Intel

Ollama version

0.1.33

jmorganca · 2024-05-07T16:41:34Z

Hi @mariomorvan thanks for the issue. This is expected behavior given num_predict decide show many tokens (roughly, words) will be output back. You'd want to leave enough for at least one complete sentence.

That said, I totally understand you'd like to limit the length and receive a complete answer. This is something we'll consider in the future! A good tip for this is to mention the length of the response in the prompt. For example answer this question in a single sentence of no more than 10 words - the language model will often oblige :)

mariomorvan · 2024-05-08T15:49:32Z

Thanks - looks like a useful and often effective workaround

Hi @mariomorvan thanks for the issue. This is expected behavior given num_predict decide show many tokens (roughly, words) will be output back. You'd want to leave enough for at least one complete sentence.

That said, I totally understand you'd like to limit the length and receive a complete answer. This is something we'll consider in the future! A good tip for this is to mention the length of the response in the prompt. For example answer this question in a single sentence of no more than 10 words - the language model will often oblige :)

mariomorvan added the bug Something isn't working label May 7, 2024

jmorganca closed this as completed May 7, 2024

BruceMacD mentioned this issue May 7, 2024

add done_reason to the api #4235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unfinished sentences when setting num_predict parameter #4230

Unfinished sentences when setting num_predict parameter #4230

mariomorvan commented May 7, 2024

jmorganca commented May 7, 2024

mariomorvan commented May 8, 2024

Unfinished sentences when setting num_predict parameter #4230

Unfinished sentences when setting num_predict parameter #4230

Comments

mariomorvan commented May 7, 2024

What is the issue?

OS

GPU

CPU

Ollama version

jmorganca commented May 7, 2024

mariomorvan commented May 8, 2024