We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
If we compare with the asym quantization logic with AWQ, there are some differences, a major distinction is whether the range of min-max values should include zero. In AWQ, zero is not included in the range, as depicted in https://github.com/mit-han-lab/llm-awq/blob/main/awq/quantize/quantizer.py#L74, whereas GPTQ does include zero, as demonstrated in https://github.com/AutoGPTQ/AutoGPTQ/blob/main/auto_gptq/quantization/quantizer.py#L64.
Since intel/auto-round also follows AutoGPTQ, and the way used by AWQ is better I think
So my questions is does the kernel of AutoGPTQ support not including zero or AutoGPTQ will support this?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
If we compare with the asym quantization logic with AWQ, there are some differences, a major distinction is whether the range of min-max values should include zero.
In AWQ, zero is not included in the range, as depicted in https://github.com/mit-han-lab/llm-awq/blob/main/awq/quantize/quantizer.py#L74,
whereas GPTQ does include zero, as demonstrated in https://github.com/AutoGPTQ/AutoGPTQ/blob/main/auto_gptq/quantization/quantizer.py#L64.
Since intel/auto-round also follows AutoGPTQ, and the way used by AWQ is better I think
So my questions is does the kernel of AutoGPTQ support not including zero or AutoGPTQ will support this?
The text was updated successfully, but these errors were encountered: