Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing the params.json #140

Open
littlewwwhite opened this issue Mar 19, 2024 · 2 comments
Open

Missing the params.json #140

littlewwwhite opened this issue Mar 19, 2024 · 2 comments

Comments

@littlewwwhite
Copy link

FileNotFoundError: [Errno 2] No such file or directory: 'Mistral-7B-Instruct-v0.2/params.json'

and I replace the params.json to config.json but still lack of several params such like kv_cache dim etc.

@kolinko
Copy link

kolinko commented Apr 10, 2024

Same here. Also, the model on huggingface has different names for w1/w2/w3 weights (up_proj, down_proj, gate_proj), and it's not documented anywhere. I guess the model from HF works with the Transformers library, but both this and MLX impl need fixing to use it.

@nathan-pugh-fsn
Copy link

Just add the different ones to your params.json file for w1/w2/w3 etc…

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants