How can I switch to local LLM engine

In Chat UI, there is a long list of LLM model. The default one is GPT 3.5 Turbo, which is openAI as I guess. 
I configure openAI api key in .env, so it should be used, as the answer is very fast. 

When I try to switch it to Llama 7B, it report:
```
An error occurred while generating text: Model llama-7b-GGML is currently booting.
```

I setup another llm engine "vllm" based on llama-2-7b-chat model, and expose in port 3000, it is compatible with openAI API.
how can I configure it to use this new engine?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How can I switch to local LLM engine #43

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How can I switch to local LLM engine #43

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions