Skip to content
View amd-callumm's full-sized avatar

Block or report amd-callumm

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. llama.cpp llama.cpp Public

    Forked from ggml-org/llama.cpp

    LLM inference in C/C++

    C++

  2. flash-attention flash-attention Public

    Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python

  3. aotriton aotriton Public

    Forked from ROCm/aotriton

    Ahead of Time (AOT) Triton Math Library

    Python