-
-
-
llm-manifesto Public
Render shareable Kubernetes manifests for LLM deployments
Python UpdatedJul 1, 2026 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
-
claudectx Public
kubectx for AI coding agents — switch paired Claude Code + Codex CLI contexts (settings, tokens, skills, MCP servers) and translate config between them
Go Apache License 2.0 UpdatedJun 12, 2026 -
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Python Apache License 2.0 UpdatedMay 15, 2026 -
-
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedMay 1, 2026 -
llm-d Public
Forked from llm-d/llm-dllm-d is a Kubernetes-native high-performance distributed LLM inference framework
Makefile Apache License 2.0 UpdatedApr 30, 2026 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedApr 23, 2026 -
-
llm-d-inference-scheduler Public
Forked from llm-d/llm-d-routerInference scheduler for llm-d
Go Apache License 2.0 UpdatedApr 12, 2026 -
llmd-routing-bench Public
Benchmarking tool for the llm-d routing sidecar (P/D disaggregation overhead)
Go UpdatedMar 21, 2026 -
-
-
-
-
guidellm Public
Forked from vllm-project/guidellmEvaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Python Apache License 2.0 UpdatedSep 25, 2025 -
llm-d-infra Public
Forked from llm-d-incubation/llm-d-infrallm-d helm charts and deployment examples
Shell Apache License 2.0 UpdatedAug 25, 2025 -
llm-d-modelservice Public
Forked from llm-d-incubation/llm-d-modelserviceSmarty UpdatedJul 20, 2025 -
-
-
benchmark-pod-interactive Public
Forked from robertgshaw2-redhat/benchmark-pod-interactivePod for benchmarking interactive in llm-d
Dockerfile UpdatedJul 15, 2025 -
canhazgpu Public
Forked from russellb/canhazgpuA simple GPU reservation tool for single host shared development systems
Go Apache License 2.0 UpdatedJul 10, 2025 -
ci-infra Public
Forked from vllm-project/ci-infraThis repo hosts code for vLLM CI & Performance Benchmark infrastructure.
HCL UpdatedJun 30, 2025 -
-
-




