Skip to content

AMD 7600XT - llama off loads to GPU but low TpS #707

Discussion options

You must be logged in to vote

You may need to adjust the GGML_VK_FORCE_MAX_ALLOCATION_SIZE size due to some bugs in the driver on arm64. See: geerlingguy/ollama-benchmark#1 and have a read through my notes towards the bottom of that original issue post.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@frankmtl-git
Comment options

Answer selected by frankmtl-git
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #706 on February 08, 2025 16:13.