amd-callumm

Follow

Callum Mitchell amd-callumm

Follow

Achievements

Achievements

Popular repositories Loading

llama.cpp llama.cpp Public

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python
aotriton aotriton Public

Forked from ROCm/aotriton

Ahead of Time (AOT) Triton Math Library

Python