-
Notifications
You must be signed in to change notification settings - Fork 51
Open
Description
In Chat UI, there is a long list of LLM model. The default one is GPT 3.5 Turbo, which is openAI as I guess.
I configure openAI api key in .env, so it should be used, as the answer is very fast.
When I try to switch it to Llama 7B, it report:
An error occurred while generating text: Model llama-7b-GGML is currently booting.
I setup another llm engine "vllm" based on llama-2-7b-chat model, and expose in port 3000, it is compatible with openAI API.
how can I configure it to use this new engine?
Metadata
Metadata
Assignees
Labels
No labels