Support for IPEX/Intel GPUs #718

hanthor · 2025-02-03T10:42:43Z

The proper bits for supporting Intel GPUs for running ollama and others exists here: https://github.com/intel/ipex-llm/tree/main/docker/llm

Just need someone with the expertise to add the right stuff to support it

ericcurtin · 2025-02-03T10:56:35Z

@cgruver had joy with this container image, not sure if it's the same type of acceleration:

https://github.com/containers/ramalama/blob/main/container-images/intel-gpu/Containerfile

we would need someone from the community with the hardware to test, expertise, etc. to open a PR.

cgruver · 2025-02-03T11:20:17Z

@hanthor

ramalama --gpu --imagequay.io/cgruver0/llama-cpp-intel-gpu:latest run granite

cgruver · 2025-02-03T11:48:49Z

@hanthor I've tested this on Intel 155H with Arc GPU. It's actually pretty snappy.

also... JFYI, that particular image is using OpenCL. I have a refactor on the way that will use Level Zero. It's a bit faster than OpenCL.

hanthor · 2025-02-05T13:00:55Z

@cgruver had joy with this container image, not sure if it's the same type of acceleration:

https://github.com/containers/ramalama/blob/main/container-images/intel-gpu/Containerfile

we would need someone from the community with the hardware to test, expertise, etc. to open a PR.

This worked great! I'm wondering about these NPUs now... maybe running Whisper on it? still haven't come up with a decent application for these NPUs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for IPEX/Intel GPUs #718

Support for IPEX/Intel GPUs #718

hanthor commented Feb 3, 2025

ericcurtin commented Feb 3, 2025

cgruver commented Feb 3, 2025

cgruver commented Feb 3, 2025

hanthor commented Feb 5, 2025

Support for IPEX/Intel GPUs #718

Support for IPEX/Intel GPUs #718

Comments

hanthor commented Feb 3, 2025

ericcurtin commented Feb 3, 2025

cgruver commented Feb 3, 2025

cgruver commented Feb 3, 2025

hanthor commented Feb 5, 2025