File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -62,7 +62,7 @@ For a detailed starting guide, please see the [Quick Tour](https://huggingface.c
6262model=tiiuae/falcon-7b-instruct
6363volume=$PWD /data # share a volume with the Docker container to avoid downloading weights every run
6464
65- docker run --gpus all --shm-size 1g -p 8080:80 -v $volume :/data ghcr.io/huggingface/text-generation-inference:1.1.0 --model-id $model
65+ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume :/data ghcr.io/huggingface/text-generation-inference:1.1.1 --model-id $model
6666```
6767
6868And then you can make requests like
@@ -104,7 +104,7 @@ model=meta-llama/Llama-2-7b-chat-hf
104104volume=$PWD /data # share a volume with the Docker container to avoid downloading weights every run
105105token=< your cli READ token>
106106
107- docker run --gpus all --shm-size 1g -e HUGGING_FACE_HUB_TOKEN=$token -p 8080:80 -v $volume :/data ghcr.io/huggingface/text-generation-inference:1.1.0 --model-id $model
107+ docker run --gpus all --shm-size 1g -e HUGGING_FACE_HUB_TOKEN=$token -p 8080:80 -v $volume :/data ghcr.io/huggingface/text-generation-inference:1.1.1 --model-id $model
108108```
109109
110110### A note on Shared Memory (shm)
You can’t perform that action at this time.
0 commit comments