the-david-oy

David Oy the-david-oy

Achievements

triton-inference-server triton-inference-server Public

Forked from triton-inference-server/server

The Triton Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.

C++
ai-dynamo/aiperf ai-dynamo/aiperf Public

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 418 121
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85.2k 18.9k
ai-dynamo/dynamo ai-dynamo/dynamo Public

A Datacenter Scale Distributed Inference Serving Framework

Rust 7.4k 1.3k