PatrykSaffer

PatrykSaffer

Achievements

cuda-learning cuda-learning Public

Forked from Infatoshi/cuda-course

Cuda
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
DeepGEMM DeepGEMM Public

Forked from deepseek-ai/DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda
runai-model-streamer runai-model-streamer Public

Forked from run-ai/runai-model-streamer

C++
flash-attention flash-attention Public

Forked from vllm-project/flash-attention

Fast and memory-efficient exact attention

Python
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python