Skip to content

ArmDeveloperEcosystem/workshop-ai-gke

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scaling LLM Deployments with Arm and Google Kubernetes Engine

This repository contains the code and resources for the "Scaling LLM deployments with Arm and Google Kubernetes Engine" workshop, available on Qwiklabs.

Feel free to open issues or pull requests if you have questions or suggestions.

Workshop Overview

In this workshop, you will learn how to deploy and scale Large Language Models (LLMs) using Arm-based nodes on Google Kubernetes Engine (GKE). The hands-on labs guide you through setting up your environment, deploying models, and optimizing for performance and cost.

Workshop Link

Access the workshop on Qwiklabs: Workshop Link

About

A workshop for building out a AI application using Google Kubernetes Engine

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published