@@ -440,6 +440,56 @@ More info at:
440440https://github.com/Project-MONAI/monai-deploy-informatics-gateway/blob/develop/docs/api/rest/config.md
441441
442442
443+ ## Deploying in Google Kubernetes Engine
444+
445+ Check you have enough GPU quota in the zone you want to deploy. I have
446+ tested this in europe-west4-a and I had to request to increase the quota
447+ to 1 nvidia-tesla-a100 GPU.
448+
449+ Create a standard cluster and name it monai-deploy-1
450+
451+ > Note you could create an autopilot cluster, so that GPU nodes are
452+ deployed automatically. Be aware that autopilot needs as much quota
453+ as nodes you have multplied by the GPUs you request. With the standard
454+ autopilot cluster configuration, I was getting 11 nodes, and so it was
455+ requesting 11 GPUs, which was way higher than my quota.
456+
457+ Configure your cluster:
458+
459+ ```gcloud container clusters get-credentials monai-deploy-1```
460+
461+ Check your cluster:
462+
463+ ```kubectl cluster-info```
464+
465+ Edit the argo template and add the node selector:
466+
467+ ```
468+ metadata: {}
469+ + nodeSelector:
470+ + cloud.google.com/gke-accelerator: nvidia-tesla-a100
471+ container:
472+ ```
473+
474+ Add a node pool and inside that pool a node with 1 GPU.
475+
476+ Install drivers with:
477+
478+ ```kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded-latest.yaml```
479+
480+ More info at https://cloud.google.com/kubernetes-engine/docs/how-to/gpus#console
481+
482+ Then, you will need a bigger machine for the argo workflows. You need
483+ to create another pool with one node and this time you can select type
484+ e2-standard-2.
485+
486+ Now, you can run all the helm and kubectl commands above.
487+
488+ ```Warning: Using GKE with GPUs will cost you money, be careful with
489+ that. ```
490+
491+
492+
443493[0] https://drive.google.com/file/d/1d8Scm3q-kHTqr_-KfnXH0rPnCgKld2Iy/view?usp=sharing
444494a DICOM dataset that was converted to DICOM from Medical Decathlon
445495training and validation images (see https://github.com/Project-MONAI/monai-deploy/tree/main/deploy/monai-deploy-express#running-a-monai-deploy-workflow)
0 commit comments