[DSV4][XPU] Add MHC fused_post_pre support by majian4work · Pull Request #44144 · vllm-project/vllm

majian4work · 2026-06-01T00:58:32Z

Summary

Add MHCFusedPostPreOp XPU support for DeepSeek-V4 on Intel XPU, enabling the fused MHC post+pre path in the decoder loop (matching the AMD/CUDA pattern).

Changes

vllm/model_executor/layers/mhc.py: Implement forward_native for MHCFusedPostPreOp (decomposes into mhc_post_torch + mhc_pre_torch); add forward_xpu delegating to forward_native.
vllm/models/deepseek_v4/xpu/model.py: Update decoder loop to use fused MHC path (first layer → standalone hc_pre, middle layers → mhc_fused_post_pre, explicit hc_post after loop). Add weight loading guards for truncated model testing.

Dependencies

⚠️ This PR depends on #42953 being merged first.

PR #42953 introduces the XPU attention decode path (dsv4-pr4-attention-decode) which this PR builds upon.

mergify · 2026-06-04T16:11:26Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @majian4work.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

majian4work · 2026-06-08T08:45:15Z

@jikunshang @xinyu-intel @wuxun-zhang Please help to take a review， thanks.

wuxun-zhang

LGTM

mergify · 2026-06-09T06:05:15Z

Hi @majian4work, the pre-commit checks have failed. Please run:

uv pip install pre-commit>=4.5.1
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

- Add forward_xpu to MHCFusedPostPreOp (decomposes into mhc_post_torch + mhc_pre_torch) - Update XPU model forward to use fused MHC path (matching AMD pattern): first layer uses standalone hc_pre, middle layers use mhc_fused_post_pre - Add explicit hc_post after decoder loop Signed-off-by: Ma Jian <jian1.ma@intel.com>

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: Waqar Ahmed <waqar.ahmed@amd.com>

Signed-off-by: Ma Jian <jian1.ma@intel.com>

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: divineearthly <divineearthly@gmail.com>

Signed-off-by: Ma Jian <jian1.ma@intel.com>

mergify Bot added intel-gpu Related to Intel GPU v1 labels Jun 1, 2026

mergify Bot added the needs-rebase label Jun 4, 2026

zyongye mentioned this pull request Jun 5, 2026

[Bugfix][Kernel] Fix mHC fused-RMSNorm big-fuse miscompile for hidden_size != 4096 #44692

Merged

majian4work force-pushed the dsv4-pr5-mhc-fused-post-pre branch from ed73f8c to 4d2a1c7 Compare June 8, 2026 05:51

majian4work marked this pull request as ready for review June 8, 2026 05:52

majian4work requested a review from zyongye as a code owner June 8, 2026 05:52

claude Bot reviewed Jun 8, 2026

View reviewed changes

mergify Bot removed the needs-rebase label Jun 8, 2026

jikunshang approved these changes Jun 9, 2026

View reviewed changes

jikunshang added the verified Run pre-commit for new contributors without triggering other tests label Jun 9, 2026

wuxun-zhang approved these changes Jun 9, 2026

View reviewed changes

majian4work force-pushed the dsv4-pr5-mhc-fused-post-pre branch from 4d2a1c7 to 1fab1f0 Compare June 9, 2026 06:13

jikunshang added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 9, 2026

jikunshang merged commit 70db148 into vllm-project:main Jun 9, 2026
69 of 70 checks passed

ekagra-ranjan pushed a commit to ekagra-ranjan/vllm that referenced this pull request Jun 9, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

7751a71

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>

waqahmed-amd-fi pushed a commit to waqahmed-amd-fi/vllm that referenced this pull request Jun 10, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

7e46bee

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: Waqar Ahmed <waqar.ahmed@amd.com>

Saddss pushed a commit to Saddss/vllm that referenced this pull request Jun 14, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

c9d99f7

Signed-off-by: Ma Jian <jian1.ma@intel.com>

vivek8123 pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Jun 18, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

7470349

Signed-off-by: Ma Jian <jian1.ma@intel.com>

divineearthly pushed a commit to divineearthly/vllm that referenced this pull request Jun 19, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

0da766b

Signed-off-by: Ma Jian <jian1.ma@intel.com> Signed-off-by: divineearthly <divineearthly@gmail.com>

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Jun 22, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

649c4bc

Signed-off-by: Ma Jian <jian1.ma@intel.com>

nkzhenhua pushed a commit to nkzhenhua/vllm that referenced this pull request Jun 24, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

d87d647

Signed-off-by: Ma Jian <jian1.ma@intel.com>

ohsono pushed a commit to ohsono/vllm that referenced this pull request Jul 3, 2026

[DSV4][XPU] Add MHC fused_post_pre support (vllm-project#44144)

6a83281

Signed-off-by: Ma Jian <jian1.ma@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[DSV4][XPU] Add MHC fused_post_pre support#44144

[DSV4][XPU] Add MHC fused_post_pre support#44144
jikunshang merged 1 commit into
vllm-project:mainfrom
majian4work:dsv4-pr5-mhc-fused-post-pre

majian4work commented Jun 1, 2026

mergify Bot commented Jun 4, 2026

claude Bot left a comment

majian4work commented Jun 8, 2026

wuxun-zhang left a comment

mergify Bot commented Jun 9, 2026

Uh oh!

Labels

3 participants

Uh oh!

Uh oh!

Conversation

majian4work commented Jun 1, 2026

Summary

Changes

Dependencies

mergify Bot commented Jun 4, 2026

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

majian4work commented Jun 8, 2026

wuxun-zhang left a comment

Choose a reason for hiding this comment

mergify Bot commented Jun 9, 2026

Uh oh!

Labels

3 participants