-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
StableLM #8
Comments
CC @davidkoski this is the issue I was referring to! Any help here much appreciated 👍 |
I see a slightly different issue, but likely related:
This is coming from Attention -> Linear. The problem stems from the shape of
which produces a shape of At the top level:
y is an empty array:
and it looks like that is what we are getting out of the tokenizer (I am using the default I will continue looking at this later. |
Here are some ideas on how to debug the model behavior:
Notice the augmentation of the prompt -- this is done using python code in the tokenizer configuration. We can't run that so you may need some configuration to help with this. For example in the example repo: Simple, but probably helpful. Given the working python version you can do a few things:
Good luck and ask if you have questions! |
@davidkoski thank you so much for the detailed breakdown of what went on here, this made a ton of sense! The other takeaway for me here is that we need to improve debuggability + implement sanity checks, and also probably expose verification as a visible parameter that can be checked on/off. I'm going to think on a bit and add some UI for it – let me know if anything comes to mind here! |
The text was updated successfully, but these errors were encountered: