ollama example #145

codefromthecrypt · 2025-01-20T16:52:53Z

It would be nice to have a simple example of this proxying to ollama instead of openai. Ollama itself could be a pod or an external service (e.g. in k3s back to the docker host). If serving in the example, we could use a small model like qwen2.5:0.5b to keep the inference lag down.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ollama example #145

ollama example #145

codefromthecrypt commented Jan 20, 2025

ollama example #145

ollama example #145

Comments

codefromthecrypt commented Jan 20, 2025