-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[router] improve router logs and request id header
enhancement
New feature or request
feature
router
#8415
opened Jul 27, 2025 by
slin1237
Loading…
3 of 6 tasks
fix: resolve DeepSeek-V3 accuracy drop when EP is enabled
#8412
opened Jul 27, 2025 by
juyterman1000
Loading…
6 tasks
[wip] try to fix sgl-kernel moe_align kernel hip warp scan bug
#8411
opened Jul 27, 2025 by
BBuf
Loading…
6 tasks
[EAGLE] Improve eagle drafting process to use correct KV cache in draft forward
#8409
opened Jul 27, 2025 by
yubofredwang
Loading…
2 of 6 tasks
[bugfix] remove launch_lb.py (rust_LB has been merged in sgl-router)
#8408
opened Jul 27, 2025 by
1195343015
Loading…
6 tasks done
[bugfix] Fix 2 minor bugs in the hicache storage layer
#8404
opened Jul 27, 2025 by
yapple
Loading…
1 of 6 tasks
[Model] [Draft PR] Add support for SmallThinker model series
#8399
opened Jul 27, 2025 by
SorryMaker2022
•
Draft
6 tasks
fix fp8 update_weights for block_quant
#8390
opened Jul 26, 2025 by
GuoweiWangU
Loading…
1 of 6 tasks
Support DeepEP communication for nvfp4 moe (+12% e2e)
#8376
opened Jul 26, 2025 by
fzyzcjy
Loading…
6 tasks
Update qwen3_coder_detector.py for streaming
#8371
opened Jul 26, 2025 by
maocheng23
Loading…
6 tasks
Enables force reasoning based on chat template for Qwen3-Thinking
#8369
opened Jul 25, 2025 by
JustinTong0323
Loading…
6 tasks
[Feature] Accelerate chunked prefill with persistent kernel
#8368
opened Jul 25, 2025 by
Edenzzzz
Loading…
6 tasks
Bug: Fix google gemma3n-mm audio input not working bug
#8365
opened Jul 25, 2025 by
byjiang1996
Loading…
6 tasks done
Draft: Fuse routed scaling factor into select_experts for FP4 MoE
#8364
opened Jul 25, 2025 by
trevor-m
Loading…
6 tasks
[Bugfix] Fix Llama4 Divide by Zero when interleave_moe_layer_step is zero
#8358
opened Jul 25, 2025 by
TJ5
Loading…
6 tasks
[Feature][1/N] Optimize DeepSeek's DeepEP on Ascend NPU
high priority
#8355
opened Jul 25, 2025 by
iforgetmyname
Loading…
1 of 6 tasks
perf: add ZMQ poller to reduce CPU usage in recv_requests
#8354
opened Jul 25, 2025 by
philippebourcier
Loading…
[PD] Fix abort_request for PD disaggregation
#8352
opened Jul 25, 2025 by
ShangmingCai
Loading…
6 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.