JensenFire

Follow

JensenFire JensenFire

Follow

Focusing on interesting things

4 followers · 5 following

MM
Beijing

Achievements

Achievements

Popular repositories Loading

vllm_flash_attn vllm_flash_attn Public

C++ 1
fast.cu fast.cu Public

Forked from pranjalssh/fast.cu

Fastest kernels written from scratch

Cuda 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
aotriton aotriton Public

Forked from ROCm/aotriton

Ahead of Time (AOT) Triton Math Library

Python
CUDA-Learn-Notes CUDA-Learn-Notes Public

Forked from xlite-dev/LeetCUDA

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda
Self-learning-Computer-Science Self-learning-Computer-Science Public

Forked from PKUFlyingPig/Self-learning-Computer-Science

the resources I use to learn computer science in my spare time