chfeng-cs

Follow

💬

All In AI

Ethan Feng chfeng-cs

💬

All In AI

Follow

Focused on LLM inference: engines, kernels, and systems. Open to AI Infra opportunities.

9 followers · 19 following

Alibaba
Shanghai Jiao Tong University
15:19 (UTC +08:00)

Achievements

Achievements

chfeng-cs/README.md

Ethan Feng

Infrastructure engineer focused on LLM inference systems.

M.S. Computer Science — Shanghai Jiao Tong University
B.S. Computer Science — Harbin Institute of Technology
2 yrs at Alibaba

Focus Areas: LLM Inference / GPU Performance

Open Source

Currently contributing to vllm — KV cache transfer, scheduler optimization, and hybrid KV cache management (HMA).

See detail at my vllm contributions

Contact

📫 ethan.fengch [at] gmail [dot] com

Pinned Loading

sglang sglang Public

Forked from sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Python
vllm-contributions vllm-contributions Public

Python
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python