CVS-175734- [OVEP GPU] add GQA in support list for GPU backend #830

Kotomi-Du · 2025-10-10T22:28:14Z

Description:

GQA is originally supported by OV starting from 2025.1. This PR is to align with OV support. Will go to New ABI as well.

If feature goes to new ABI?

Yes

Jira Ticket :

https://jira.devtools.intel.com/browse/CVS-175734

ankitm3k · 2025-10-13T05:35:35Z

onnxruntime/core/providers/openvino/backends/basic_backend.h

      "beam_idx",
      "past_key_values",
      "present",
+      "total_seq_len",


@Kotomi-Du Does the stateful model post translation into OVIR comprise of total_seq_len input always? Is this a general case for all LLMs now (since which OV toolkit version this was added)?

it is the input name from Msft generic model (specifically Phisilica model), not the Epctx OVIR model OV toolkit generated

onnxruntime/core/providers/openvino/ov_versions/data_ops.cc

ankitm3k

LGTM

Kotomi-Du · 2025-10-17T17:57:48Z

removed CPU support and added Jira ticket

MayureshV1

LGTM !

support GQA

5708bda

Kotomi-Du requested review from RyanMetcalfeInt8, ankitm3k and preetha-intel October 10, 2025 22:28

Kotomi-Du mentioned this pull request Oct 11, 2025

CVS-175736-[OVEP] Enable stateful mode for Phi-silica models #821

Merged

ankitm3k force-pushed the support_GQA_GPU branch from 5708bda to c4f7cdd Compare October 13, 2025 05:08

ankitm3k reviewed Oct 13, 2025

View reviewed changes

onnxruntime/core/providers/openvino/ov_versions/data_ops.cc Outdated Show resolved Hide resolved

ankitm3k approved these changes Oct 13, 2025

View reviewed changes

remove CPU/NPU support

bf8939b

remove useless code

d73ef49

Kotomi-Du force-pushed the support_GQA_GPU branch from 5234a6c to d73ef49 Compare October 27, 2025 20:13

Merge branch 'ovep-develop' into support_GQA_GPU

d90129b

MayureshV1 approved these changes Oct 27, 2025

View reviewed changes

MayureshV1 changed the title ~~[OVEP GPU] add GQA in support list for GPU backend~~ CVS-175734- [OVEP GPU] add GQA in support list for GPU backend Oct 27, 2025

MayureshV1 merged commit eff6cac into intel:ovep-develop Oct 27, 2025
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CVS-175734- [OVEP GPU] add GQA in support list for GPU backend #830

CVS-175734- [OVEP GPU] add GQA in support list for GPU backend #830

Uh oh!

Kotomi-Du commented Oct 10, 2025 •

edited

Loading

Uh oh!

ankitm3k Oct 13, 2025

Uh oh!

Kotomi-Du Oct 14, 2025

Uh oh!

Uh oh!

ankitm3k left a comment

Uh oh!

Kotomi-Du commented Oct 17, 2025

Uh oh!

MayureshV1 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CVS-175734- [OVEP GPU] add GQA in support list for GPU backend #830

CVS-175734- [OVEP GPU] add GQA in support list for GPU backend #830

Uh oh!

Conversation

Kotomi-Du commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description:

If feature goes to new ABI?

Jira Ticket :

Uh oh!

ankitm3k Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Kotomi-Du Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ankitm3k left a comment

Choose a reason for hiding this comment

Uh oh!

Kotomi-Du commented Oct 17, 2025

Uh oh!

MayureshV1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Kotomi-Du commented Oct 10, 2025 •

edited

Loading