Skip to content

JingliangGao/vllm-edge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vllm-edge

The edge-based vLLM framework, which is suitable for devices with limited video memory, low communication bandwidth, and weak computing ability.

install

git clone https://github.com/JingliangGao/vllm-edge.git
cd vllm-edge/
chmod +x ./build-for-debug.sh && ./build-for-debug.sh

download model

modelscope download --model Qwen/Qwen3-0.6B

run

cd vllm_edge/examples/
python3  qwen3_inference.py

About

The edge-based vLLM framework, which is suitable for devices with limited video memory, low communication bandwidth, and weak computing ablity.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors