Skip to content
View the-david-oy's full-sized avatar

Block or report the-david-oy

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. triton-inference-server triton-inference-server Public

    Forked from triton-inference-server/server

    The Triton Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.

    C++

  2. ai-dynamo/aiperf ai-dynamo/aiperf Public

    AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

    Python 418 121

  3. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 85.2k 18.9k

  4. ai-dynamo/dynamo ai-dynamo/dynamo Public

    A Datacenter Scale Distributed Inference Serving Framework

    Rust 7.4k 1.3k