Skip to content
View shen-shanshan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report shen-shanshan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shen-shanshan/README.md

🤖 About me

I'm currently a senior software engineer working at AMD ROCm (previously at Huawei Ascend), building vLLM inference engine for GPU/NPU software ecosystem (focusing on multi-modality / structured output / OOT hardware extensibility).

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 85.2k 18.9k

  2. vllm-project/vllm-ascend vllm-project/vllm-ascend Public

    Community maintained hardware plugin for vLLM on Ascend

    C++ 2.3k 1.5k

  3. cs-self-learning cs-self-learning Public

    This repo is used for archiving my notes, codes and materials of cs learning.

    Jupyter Notebook 92 3

  4. vllm-dev-skills vllm-dev-skills Public

    A curated collection of Claude Code agent skills that accelerate the entire vLLM development lifecycle.

    Python 10 2