-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for 4bit or 8bit tensor #5
Comments
I think this will come but there's a few pre-requisites. Initially the core library doesn't include specific shaders/kernels, I want to gradually build up to that. Only including them when they're clearly useful in multiple contexts so we don't end up with a large surface area of kernels to continually support. This is relevant because there's no built-in 4bit/8bit types so they would have to be closely coupled with various dequant implementations that would be part of the library. A rough outline of the sequence might look like:
|
As of Chrome 123 there is support for dot4 accumulate for int8 vec4s https://developer.chrome.com/blog/new-in-webgpu-123 so for sure 8 bit makes sense to support to the extent that other precisions are supported by this library. |
Although it may be out of scope, it would be nice to have an example of computing 4bit and 8bit tensors, to save memory bandwidth.
The text was updated successfully, but these errors were encountered: