Skip to content

Commit 680dee6

Browse files
committed
update orchestrator model name
1 parent 48bf83f commit 680dee6

3 files changed

Lines changed: 9 additions & 7 deletions

File tree

demos/agent_orchestration/travel_agents/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -141,7 +141,7 @@ vllm serve katanemo/Plano-Orchestrator-4B \
141141
--gpu-memory-utilization 0.3 \
142142
--tokenizer katanemo/Plano-Orchestrator-4B \
143143
--chat-template chat_template.jinja \
144-
--served-model-name Plano-Orchestrator \
144+
--served-model-name katanemo/Plano-Orchestrator-4B \
145145
--enable-prefix-caching
146146
```
147147

demos/agent_orchestration/travel_agents/config_local_orchestrator.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
version: v0.3.0
22

33
overrides:
4-
orchestrator_model: plano/Plano-Orchestrator
4+
orchestrator_model: plano/katanemo/Plano-Orchestrator-4B
55

66
agents:
77
- id: weather_agent
@@ -10,7 +10,7 @@ agents:
1010
url: http://localhost:10520
1111

1212
model_providers:
13-
- model: plano/Plano-Orchestrator
13+
- model: plano/katanemo/Plano-Orchestrator-4B
1414
base_url: http://localhost:8000
1515

1616
- model: openai/gpt-5.2

docs/source/guides/orchestration.rst

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -379,7 +379,7 @@ Using vLLM
379379
--gpu-memory-utilization 0.3 \
380380
--tokenizer katanemo/Plano-Orchestrator-4B \
381381
--chat-template chat_template.jinja \
382-
--served-model-name Plano-Orchestrator \
382+
--served-model-name katanemo/Plano-Orchestrator-4B \
383383
--enable-prefix-caching
384384
385385
For the 30B-A3B-FP8 model (production):
@@ -394,18 +394,20 @@ Using vLLM
394394
--tokenizer katanemo/Plano-Orchestrator-30B-A3B-FP8 \
395395
--chat-template chat_template.jinja \
396396
--max-model-len 32768 \
397-
--served-model-name Plano-Orchestrator \
397+
--served-model-name katanemo/Plano-Orchestrator-30B-A3B-FP8 \
398398
--enable-prefix-caching
399399
400400
4. **Configure Plano to use the local orchestrator**
401401

402+
Use the model name matching your ``--served-model-name``:
403+
402404
.. code-block:: yaml
403405
404406
overrides:
405-
orchestrator_model: plano/Plano-Orchestrator
407+
orchestrator_model: plano/katanemo/Plano-Orchestrator-4B
406408
407409
model_providers:
408-
- model: plano/Plano-Orchestrator
410+
- model: plano/katanemo/Plano-Orchestrator-4B
409411
base_url: http://<your-server-ip>:8000
410412
411413
5. **Verify the server is running**

0 commit comments

Comments
 (0)