After deplyed google/gemma-7b-it, there always is error response. #26

ydh10002023 · 2024-02-26T06:23:03Z

After deplyed google/gemma-7b-it, there always is error response when sending any message.

Response:
Of course! Here are some creative ideas for a 10-year-old's birthday party:

The text was updated successfully, but these errors were encountered:

michaelmoynihan · 2024-02-26T16:39:34Z

Thanks! Could you share some more details? What is the error response you are receiving?

freefer · 2024-02-27T09:02:56Z

docker run -t --rm --gpus all -v "F:\gemma_pytorch-main\7b\gemma-7b-it.ckpt":/tmp/ckpt 51cd9699e157dfd46257dfc19263593015ffcb8d0f0a0c5a14e11adc89daacda python scripts/run.py --device=cuda --ckpt=/tmp/ckpt --variant=7b --output_len=10 --prompt="Introduce your model version and description information"

Traceback (most recent call last):
File "/workspace/gemma/scripts/run.py", line 79, in
main(args)
File "/workspace/gemma/scripts/run.py", line 53, in main
result = model.generate(args.prompt, device)
File "/workspace/gemma/gemma/model.py", line 518, in generate
next_token_ids = self(
File "/opt/conda/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/workspace/gemma/gemma/model.py", line 445, in forward
next_tokens = self.sampler(
File "/opt/conda/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/workspace/gemma/gemma/model.py", line 78, in forward
next_token_ids = torch.multinomial(probs,
RuntimeError: probability tensor contains either inf, nan or element < 0

Ittiz · 2024-02-28T14:15:59Z

I got the same error when trying to run 7b-it. My GPU only has 12gbs of ram so I assumed it just ran out and went back to playing with the 2b-it model.

SedrickWang · 2024-02-29T08:53:36Z

@freefer You can try replacing model_config.dtype = "float32" if args.device == "cpu" else "float16" in run.py with model_config.dtype = "float32".

ShadovvSinger · 2024-03-10T09:51:53Z

Hi @SedrickWang , I've tried the solution, but it doesn't seem to work. Post #10 also mentioned a 'RuntimeError: probability tensor contains either inf, nan, or an element < 0' error. Are these two issues the same?

SedrickWang · 2024-03-11T07:28:18Z

Hi, @ShadovvSinger . While playing with gemma-2b-it, I encountered the error RuntimeError: probability tensor contains either inf, nan or element < 0. To resolve it, I replaced model_config.dtype = "float32" if args.device == "cpu" else "float16" in run.py with model_config.dtype = "float32". Therefore, I suspect that this error may be due to floating-point precision. You can try using more precise floating-points (I have tried float64 but my GPU memory was insufficient; if you have a more powerful GPU, you could give it a shot).

Furthermore, I encountered the same error while using gemma-7b-it: python scripts/run.py --device=cuda --ckpt=/tmp/ckpt --variant="7b" --output_len=10 --prompt="Hi, gemma. Introduce your model version and description information". However, when a shorter prompt is used, the error disappears, for example: python scripts/run.py --device=cuda --ckpt=/tmp/ckpt --variant="7b" --output_len=10 --prompt="Hi, gemma.".

tilakrayal added the bug Something isn't working label Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After deplyed google/gemma-7b-it, there always is error response. #26

After deplyed google/gemma-7b-it, there always is error response. #26

ydh10002023 commented Feb 26, 2024

michaelmoynihan commented Feb 26, 2024

freefer commented Feb 27, 2024 •

edited

Ittiz commented Feb 28, 2024

SedrickWang commented Feb 29, 2024

ShadovvSinger commented Mar 10, 2024

SedrickWang commented Mar 11, 2024 •

edited

After deplyed google/gemma-7b-it, there always is error response. #26

After deplyed google/gemma-7b-it, there always is error response. #26

Comments

ydh10002023 commented Feb 26, 2024

michaelmoynihan commented Feb 26, 2024

freefer commented Feb 27, 2024 • edited

Ittiz commented Feb 28, 2024

SedrickWang commented Feb 29, 2024

ShadovvSinger commented Mar 10, 2024

SedrickWang commented Mar 11, 2024 • edited

freefer commented Feb 27, 2024 •

edited

SedrickWang commented Mar 11, 2024 •

edited