Johnson
@Johnsonmsflash-attention Maintainer | Cutlass | SGlang kernel contributor | HPC, C++, CUDA, LLM training & inference
Language Breakdown
Lines of code distribution across 3 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Python
Collaboration Network
Global Impact visualization
Repos
42
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Top Repositories
My personal repository
A Quirky Assortment of CuTe Kernels
FlashInfer: Kernel Library for LLM Serving
Fast and memory-efficient exact attention
CUDA Templates and Python DSLs for High-Performance Linear Algebra
SGLang is a fast serving framework for large language models and vision language models.
Kubernetes operator for managing the lifecycle of PaddlePaddle job.
A high-throughput and memory-efficient inference and serving engine for LLMs
A PyTorch native platform for training generative AI models
Open Source Impact
Contributions to external projects