arpera

Follow

Artem Perevedentsev arpera

Follow

Achievements

Achievements

Popular repositories Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python
srt-slurm srt-slurm Public

Forked from NVIDIA/srt-slurm

NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across hardware and software combinations.

Python