wanghz18

Follow

wanghz18

Follow

Popular repositories Loading

resposity resposity Public

Python
xv6 xv6 Public
exllama exllama Public

Forked from turboderp/exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python
GPTQ-triton GPTQ-triton Public

Forked from fpgaminer/GPTQ-triton

GPTQ inference Triton kernel

Jupyter Notebook