Skip to content
View zianglih's full-sized avatar
🌰
🌰

Block or report zianglih

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zianglih/README.md

Hi there 👋

I am 栗子昂, an MTS @ humans& ai working on the full-stack of LLM performance engineering. I graduated from University of Michigan, Ann Arbor (BS), after I transfered and spent 2 years at Chinese University of Hong Kong, Shenzhen.

Some of the places I previously worked/interned at:

  • NVIDIA GPU architecture simulation team
  • NVIDIA DevTech Compute team
  • Google Gemini GPU performance team
  • Samsung OpenCL compute team

I used to enjoy cycle-level extreme GPU kernel performance optimization but I no longer consider it an important problem. My work has shfited more into low-precision numerics and model co-design.

Pinned Loading

  1. sgl-project/sglang sgl-project/sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 25.7k 5.3k

  2. NVIDIA/TransformerEngine NVIDIA/TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

    Python 3.3k 691

  3. radixark/miles radixark/miles Public

    Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

    Python 1.1k 151

  4. flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Python 5.4k 891