File tree Expand file tree Collapse file tree
demos/agent_orchestration/travel_agents Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -141,7 +141,7 @@ vllm serve katanemo/Plano-Orchestrator-4B \
141141 --gpu-memory-utilization 0.3 \
142142 --tokenizer katanemo/Plano-Orchestrator-4B \
143143 --chat-template chat_template.jinja \
144- --served-model-name Plano-Orchestrator \
144+ --served-model-name katanemo/ Plano-Orchestrator-4B \
145145 --enable-prefix-caching
146146```
147147
Original file line number Diff line number Diff line change 11version : v0.3.0
22
33overrides :
4- orchestrator_model : plano/Plano-Orchestrator
4+ orchestrator_model : plano/katanemo/ Plano-Orchestrator-4B
55
66agents :
77 - id : weather_agent
@@ -10,7 +10,7 @@ agents:
1010 url : http://localhost:10520
1111
1212model_providers :
13- - model : plano/Plano-Orchestrator
13+ - model : plano/katanemo/ Plano-Orchestrator-4B
1414 base_url : http://localhost:8000
1515
1616 - model : openai/gpt-5.2
Original file line number Diff line number Diff line change @@ -379,7 +379,7 @@ Using vLLM
379379 --gpu-memory-utilization 0.3 \
380380 --tokenizer katanemo/Plano-Orchestrator-4B \
381381 --chat-template chat_template.jinja \
382- --served-model-name Plano-Orchestrator \
382+ --served-model-name katanemo/ Plano-Orchestrator-4B \
383383 --enable-prefix-caching
384384
385385 For the 30B-A3B-FP8 model (production):
@@ -394,18 +394,20 @@ Using vLLM
394394 --tokenizer katanemo/Plano-Orchestrator-30B-A3B-FP8 \
395395 --chat-template chat_template.jinja \
396396 --max-model-len 32768 \
397- --served-model-name Plano-Orchestrator \
397+ --served-model-name katanemo/ Plano-Orchestrator-30B-A3B-FP8 \
398398 --enable-prefix-caching
399399
400400 4. **Configure Plano to use the local orchestrator **
401401
402+ Use the model name matching your ``--served-model-name ``:
403+
402404 .. code-block :: yaml
403405
404406 overrides :
405- orchestrator_model : plano/Plano-Orchestrator
407+ orchestrator_model : plano/katanemo/ Plano-Orchestrator-4B
406408
407409 model_providers :
408- - model : plano/Plano-Orchestrator
410+ - model : plano/katanemo/ Plano-Orchestrator-4B
409411 base_url : http://<your-server-ip>:8000
410412
411413 5. **Verify the server is running **
You can’t perform that action at this time.
0 commit comments