Skip to content

[DSV4][XPU] Pass gemm1_clamp_limit to XpuFusedMoe#44517

Merged
jikunshang merged 3 commits into
vllm-project:mainfrom
majian4work:dsv4-pr6-moe-clamp
Jun 22, 2026
Merged

[DSV4][XPU] Pass gemm1_clamp_limit to XpuFusedMoe#44517
jikunshang merged 3 commits into
vllm-project:mainfrom
majian4work:dsv4-pr6-moe-clamp

Conversation

@majian4work

Copy link
Copy Markdown
Contributor

Summary

Pass quant_config.gemm1_clamp_limit to XpuFusedMoe so that the SwiGLU clamp limit is applied during MoE expert computation on XPU.

Dependencies

This PR depends on:

@mergify mergify Bot added intel-gpu Related to Intel GPU v1 labels Jun 4, 2026
@mergify

mergify Bot commented Jun 4, 2026

Copy link
Copy Markdown
Contributor

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @majian4work.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Ma Jian <jian1.ma@intel.com>
@mergify mergify Bot removed the needs-rebase label Jun 10, 2026
@majian4work majian4work marked this pull request as ready for review June 22, 2026 02:32
@jikunshang jikunshang added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 22, 2026
@jikunshang jikunshang merged commit 9037498 into vllm-project:main Jun 22, 2026
91 checks passed
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Jun 22, 2026
nkzhenhua pushed a commit to nkzhenhua/vllm that referenced this pull request Jun 24, 2026
qli88 pushed a commit to qli88/vllm that referenced this pull request Jun 26, 2026
Signed-off-by: Ma Jian <jian1.ma@intel.com>
Signed-off-by: Qiang Li <qiang.li2@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

intel-gpu Related to Intel GPU ready ONLY add when PR is ready to merge/full CI is needed v1

2 participants