Skip to content
View tianmu-li's full-sized avatar

Block or report tianmu-li

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. bitsandbytes bitsandbytes Public

    Forked from bitsandbytes-foundation/bitsandbytes

    8-bit CUDA functions for PyTorch

    Python

  2. vllm-hpu-extension vllm-hpu-extension Public

    Forked from HabanaAI/vllm-hpu-extension

    Python

  3. neural-compressor neural-compressor Public

    Forked from intel/neural-compressor

    SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

    Python

  4. optimum-habana optimum-habana Public

    Forked from huggingface/optimum-habana

    Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

    Python

  5. vllm-gaudi vllm-gaudi Public

    Forked from vllm-project/vllm-gaudi

    Community maintained hardware plugin for vLLM on Intel Gaudi

    Python

  6. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python