Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix][CPU][RISC-V] Fix VLEN detection for RVV attention path bug Something isn't working ci/build cpu Related to CPU backends v1
#47532 opened Jul 3, 2026 by I3eg1nner Loading…
[Rust Frontend] Bump llm-multimodal version ready ONLY add when PR is ready to merge/full CI is needed rust
#47530 opened Jul 3, 2026 by Isotr0py Member Loading…
1 of 4 tasks
[Frontend] Limit SO_REUSEPORT to multi-worker serving frontend
#47529 opened Jul 3, 2026 by BugenZhao Member Loading…
4 tasks
fix: return SSE content type from NIXL toy proxy kv-connector v1
#47526 opened Jul 3, 2026 by Spycsh Loading…
3 of 4 tasks
Add assertion for group_size and BLOCK_K consistency
#47525 opened Jul 3, 2026 by hnhyzz Loading…
4 tasks
[MRV2] Draw 64-bit uniforms for fp32 Gumbel sampling ready ONLY add when PR is ready to merge/full CI is needed v1
#47524 opened Jul 3, 2026 by WoosukKwon Collaborator Loading…
[Rust Frontend] Speed up chat roundtrip tests ready ONLY add when PR is ready to merge/full CI is needed rust
#47523 opened Jul 3, 2026 by BugenZhao Member Loading…
4 tasks
[Quantization][INC] Support INT2 XPU Linear intel-gpu Related to Intel GPU
#47521 opened Jul 3, 2026 by Zhenzhong1 Contributor Loading…
[ROCm][CI] Fix Kernels and Kernels attention test failures rocm Related to AMD ROCm
#47519 opened Jul 3, 2026 by cpersson-amd Loading…
4 tasks done
[ROCm][DSV4] Enable fused AITER mHC post+pre kernel for decode rocm Related to AMD ROCm
#47518 opened Jul 3, 2026 by Fangzhou-Ai Contributor Draft
[XPU][UT]fix _POSSIBLE_KERNELS error on XPU intel-gpu Related to Intel GPU
#47516 opened Jul 3, 2026 by Yejing-Lai Contributor Loading…
[Quantization][INC]Add MXFP8 Linear Support
#47514 opened Jul 3, 2026 by Zhenzhong1 Contributor Loading…
DFlash SWA — resolved for personal build ci/build cpu Related to CPU backends deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend needs-rebase nvidia qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding v1
#47511 opened Jul 3, 2026 by sdh2749 Draft
[Bugfix] Gemma-4 k_eq_v x compressed-tensors: propagate shard aliases bug Something isn't working
#47507 opened Jul 3, 2026 by soaringk Contributor Loading…
2 tasks done
[Bugfix][Tool Parser] deepseek_v3: accept optional newline before JSON arguments bug Something isn't working deepseek Related to DeepSeek models tool-calling
#47503 opened Jul 3, 2026 by weizhoublue Contributor Loading…
4 tasks
[Rust Frontend] add gigachat3 tool parser rust
#47501 opened Jul 3, 2026 by yangyang-cs95 Contributor Loading…
1 of 2 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.