Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLama 2 and Code LLama support? #73

Open
Ananderz opened this issue Sep 4, 2023 · 2 comments
Open

LLama 2 and Code LLama support? #73

Ananderz opened this issue Sep 4, 2023 · 2 comments

Comments

@Ananderz
Copy link

Ananderz commented Sep 4, 2023

I have been trying to get llama 2 models to function correctly. They start off ok but then all of them goes into a loop with repetitions or gibberish.

I haven't tried setting model_type:llama to something else, could it be that we need to add llama2 here instead?

model_type: llama

Possible to get any of the code llms to support this ?

@Ananderz
Copy link
Author

Ananderz commented Sep 4, 2023

I tried with llama-2 and llama2 and read the ctransformers documentation and realized its just llama.

The answer gets into a loop when using llama2 models:

The telecom industry is not not not not not not not not not not not not not not not

Like that, I read somewhere that it could be related to something RoPE but don't know how to set that!

@Ananderz
Copy link
Author

Fixed it by implementing prompt template!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant