GPU-Optimization-for-LLM-Inference This is a short course covering GPU optimization techniques for LLM inference