Building LLM inference engines, open to work | 正在手搓大模型框架,找工作中
-
找工作中 (Looking for opportunities)
- 深圳 (Shenzhen)
- https://orcid.org/0009-0005-7807-4020
Pinned Loading
-
femto-vllm
femto-vllm PublicPersonal practice repo for understanding LLM inference. Learning vLLM mechanics and CUDA kernels.
Jupyter Notebook
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.