support bf16 scale and bias #739

wenhuach21 · 2025-08-15T08:07:51Z

autogptq add torch version restriction
int_sym code refine
zp sym packing optimization

for more information, see https://pre-commit.ci

wenhuach21 · 2025-08-15T09:15:19Z

transformers is ok, as we have kernels to backup autogptq kernels. For vllm, only 4btis and 8 bits if fine.

wenhuach21 and others added 2 commits August 15, 2025 16:06

support bf16 scale and bias

2d81e13

[pre-commit.ci] auto fixes from pre-commit.com hooks

aa67a4e

for more information, see https://pre-commit.ci

wenhuach21 closed this Aug 21, 2025

Provide feedback