Skip to content

Conversation

memoryCoderC
Copy link
Contributor

@memoryCoderC memoryCoderC commented Sep 12, 2025

添加cli命令serve用来启动apiserve

使用方式

使用fastdeploy命令执行相关操作

  1. serve 启动API server
  2. 启动参数与之前python -m fastdeploy.entrypoints.openai.api_server参数一致

接口使用方式

fastdeploy serve  参数

示例:
fastdeploy serve --model=/root/paddlejob/ERNIE-0.3B --port=8490 --engine-worker-queue-port=8491 --metrics-port=8492 --controller-port=8493 --num-gpu-blocks-override=1000 --tensor-parallel-size=1 --max-model-len=8192 --max-num-seqs=128 --timeout-graceful-shutdown=100

参数参考
https://github.com/PaddlePaddle/FastDeploy/blob/develop/docs/zh/parameters.md

Copy link

paddle-bot bot commented Sep 12, 2025

Thanks for your contribution!

@paddle-bot paddle-bot bot added the contributor External developers label Sep 12, 2025
@CLAassistant
Copy link

CLAassistant commented Sep 18, 2025

CLA assistant check
All committers have signed the CLA.

@memoryCoderC memoryCoderC force-pushed the develop branch 2 times, most recently from 97765a4 to ef4b427 Compare September 18, 2025 10:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contributor External developers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants