Skip to content

Commit 457e72c

Browse files
v1.1.1
1 parent a5def7c commit 457e72c

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ For a detailed starting guide, please see the [Quick Tour](https://huggingface.c
6262
model=tiiuae/falcon-7b-instruct
6363
volume=$PWD/data # share a volume with the Docker container to avoid downloading weights every run
6464

65-
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:1.1.0 --model-id $model
65+
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:1.1.1 --model-id $model
6666
```
6767

6868
And then you can make requests like
@@ -104,7 +104,7 @@ model=meta-llama/Llama-2-7b-chat-hf
104104
volume=$PWD/data # share a volume with the Docker container to avoid downloading weights every run
105105
token=<your cli READ token>
106106

107-
docker run --gpus all --shm-size 1g -e HUGGING_FACE_HUB_TOKEN=$token -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:1.1.0 --model-id $model
107+
docker run --gpus all --shm-size 1g -e HUGGING_FACE_HUB_TOKEN=$token -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:1.1.1 --model-id $model
108108
```
109109

110110
### A note on Shared Memory (shm)

0 commit comments

Comments
 (0)