Initial Inference Extension Plugin #10684
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Adds initial support for an inference extension endpoint picker plugin. The plugin will:
API changes
N/A
Code changes
CI changes
N/A
Docs changes
Godocs added throughout. User docs will be added in a future PR.
Context
Supports #10411
Interesting decisions
To keep the PR small, this is the first of multiple PRs to implement the endpoint picker plugin. This PR _does not create the ext-prc cluster nor does it add the ext-proc filter to the listeners filter chain.
Testing steps
Unit tests were added. e2e tests are still required and not included here due to the size of the PR.
Notes for reviewers
Refer to the upstream docs for add'l context.
Checklist: