Skip to content

Conversation

cavusmustafa
Copy link
Collaborator

@cavusmustafa cavusmustafa commented Oct 1, 2025

  • Dynamic context size support for stateful execution
  • Translation updates to support KVCacheFusion in GPU plugin
  • 4D inputs for SDPA which will enable further optimizations for GPU
  • The changes are only effective for stateful model for now.
  • Performance improvements on CPU and GPU

@github-actions github-actions bot added the ggml label Oct 1, 2025
@cavusmustafa cavusmustafa changed the title kvcachefusion support Performance Optimizations for CPU&GPU Oct 2, 2025
@cavusmustafa cavusmustafa marked this pull request as ready for review October 2, 2025 20:26
@cavusmustafa cavusmustafa requested a review from wine99 October 6, 2025 17:19
@wine99 wine99 merged commit e727c65 into ravi9:dev_backend_openvino Oct 10, 2025
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants