Allow container entrypoint and args to be defined in inference engine spec

### Feature request description

The current inference engine spec assumes the container entry point is a shell, the "binary" field is the command line to be run by the shell.
However in many cases (e.g the upstream llama.cpp images) the entrypoint of the container image is the binary to run.
The inference engine spec should allow both scenarios to be defined.

### Suggest potential solution

_No response_

### Have you considered any alternatives?

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow container entrypoint and args to be defined in inference engine spec #2392

Feature request description

Suggest potential solution

Have you considered any alternatives?

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Allow container entrypoint and args to be defined in inference engine spec #2392

Description

Feature request description

Suggest potential solution

Have you considered any alternatives?

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions