Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Warn when Dumper may capture CUDA graph outputs documentation Improvements or additions to documentation
#30028 opened Jul 3, 2026 by feichai0017 Loading…
Add deterministic inference for eagle parity test
#30026 opened Jul 3, 2026 by ANSHUMAN87 Contributor Loading…
perf: reorder DSA indexer dual-stream ops to avoid CUDA graph stream explosion
#30025 opened Jul 3, 2026 by kpham-sgl Collaborator Loading…
3 tasks
[tracing] sglang tracing v2: support exporting tracing data asynchronously documentation Improvements or additions to documentation
#30023 opened Jul 3, 2026 by sufeng-buaa Collaborator Loading…
4 of 5 tasks
fix: serialize FanOutCommunicator queueing calls with a lock
#30022 opened Jul 3, 2026 by lyang24 Loading…
5 tasks
[CI] Add GLM52 NVFP4 MTP B200 tests blackwell SM100/SM120
#30021 opened Jul 3, 2026 by Fridge003 Collaborator Draft
[codex] Support CUDA 12.2 source builds blackwell SM100/SM120 jit-kernel npu quant LLM Quantization sgl-kernel
#30020 opened Jul 3, 2026 by BBuf Collaborator Draft
[MPS] Fix diffusion output stability diffusion SGLang Diffusion
#30017 opened Jul 3, 2026 by mickqian Collaborator Draft
[diffusion] feat: performance_mode=speed enables torch.compile by default diffusion SGLang Diffusion run-ci
#30016 opened Jul 3, 2026 by mickqian Collaborator Loading…
[DSv4] Use BF16 instead of FP32 for indexer score computation
#30012 opened Jul 3, 2026 by TTThanos Contributor Loading…
5 tasks
refactor: make time_stats msgpack-native
#30005 opened Jul 3, 2026 by oleksii-tumanov Contributor Loading…
5 tasks done
[diffusion] feat: per-layer TP shard planner for DiT linears (--dit-tp-plan) diffusion SGLang Diffusion
#30004 opened Jul 3, 2026 by mickqian Collaborator Loading…
fix(mimo-vl): pass padded_context_dim to Qwen2_5_VisionPatchMerger
#29994 opened Jul 3, 2026 by alisonshao Collaborator Loading…
2 of 3 tasks
FlashInfer Backend for MXFP8 Grouped Quantization documentation Improvements or additions to documentation quant LLM Quantization sgl-kernel
#29992 opened Jul 3, 2026 by philipphack Loading…
5 tasks done
[docs] Multi-node deployment: add PD disaggregation and Apptainer examples for SLURM documentation Improvements or additions to documentation
#29991 opened Jul 3, 2026 by davislx Loading…
3 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2026-06-03.