Skip to content
View JensenFire's full-sized avatar
  • MM
  • Beijing

Block or report JensenFire

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. vllm_flash_attn vllm_flash_attn Public

    C++ 1

  2. fast.cu fast.cu Public

    Forked from pranjalssh/fast.cu

    Fastest kernels written from scratch

    Cuda 1

  3. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  4. aotriton aotriton Public

    Forked from ROCm/aotriton

    Ahead of Time (AOT) Triton Math Library

    Python

  5. CUDA-Learn-Notes CUDA-Learn-Notes Public

    Forked from xlite-dev/LeetCUDA

    📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

    Cuda

  6. Self-learning-Computer-Science Self-learning-Computer-Science Public

    Forked from PKUFlyingPig/Self-learning-Computer-Science

    the resources I use to learn computer science in my spare time