[Bug] [chatglm3-6b] Using chatglm3-6b model QUANTIZE_8bit can be successful, but QUANTIZE_4bit is invalid, and the error message is that the video memory is not enough OOM #1481

zyf950120 · 2024-04-30T08:27:36Z

zyf950120 added bug Something isn't working Waiting for reply labels Apr 30, 2024

Provide feedback