-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(example/fastapi): support --startup-timeout using Qwen3-Next-80B-A3B-Instruct as example
#11710
opened Oct 16, 2025 by
Kindyaa
Loading…
4 tasks
[Fix] fix type issue of env flag value MODELOPT_MAX_TOKENS_PER_EXPERT
#11709
opened Oct 16, 2025 by
zejunchen-zejun
Loading…
Support running FP4 Deepseek on SM120.
#11708
opened Oct 16, 2025 by
weireweire
Loading…
2 of 4 tasks
[sgl-kernel] enhance sgl-kernel import logic for sm8x
run-ci
#11707
opened Oct 16, 2025 by
FlamingoPg
Loading…
1 of 4 tasks
[quantization][MoE] fix the check for
tp_size
/ moe_ep_size
/ moe_intermediate_size
/ weight_block_size_n
run-ci
#11702
opened Oct 16, 2025 by
kevin85421
Loading…
1 of 4 tasks
[Test] support llm-compressor: w8a8_fp8_block, wNa16
#11701
opened Oct 16, 2025 by
Wangzheee
Loading…
4 tasks
[router] Add Configurable L0 and L1 Tokenizer Caching
enhancement
New feature or request
router
router-benchmark
run-ci
#11688
opened Oct 16, 2025 by
slin1237
Loading…
2 of 4 tasks
[router] fix get_models endpoint for openai router
run-ci
#11687
opened Oct 16, 2025 by
key4ng
Loading…
4 tasks
[Lint] Add
python/sglang
to ruff F401 checks and remove unused imports in files
run-ci
#11685
opened Oct 15, 2025 by
CatherineSue
Loading…
1 of 4 tasks
wip: Remove redundant fill_(0) in dp_scatter
run-ci
#11683
opened Oct 15, 2025 by
ch-wan
Loading…
4 tasks
[Router] Refactor protocol definitions: split spec.rs into modular files
run-ci
#11677
opened Oct 15, 2025 by
key4ng
Loading…
4 tasks
[Bug fix] fix Qwen3-VL dense model launch failure caused by rotary-embedding
#11675
opened Oct 15, 2025 by
coco-alen
Loading…
4 tasks
feat: return partial generation results when aborting requests in waiting queue
run-ci
#11673
opened Oct 15, 2025 by
guoyuhong
Loading…
1 of 4 tasks
[2/2] [feature] support openai like classification api in router
run-ci
#11670
opened Oct 15, 2025 by
whybeyoung
Loading…
[RL] support weight updation with dp attention
run-ci
#11669
opened Oct 15, 2025 by
zhuzilin
Loading…
1 of 4 tasks
Manually flip deepep_mode for cuda_graph
run-ci
#11666
opened Oct 15, 2025 by
zhuzilin
Loading…
1 of 4 tasks
WIP: Use trtllm_mla decode kernel for draft extend in speculative decoding
run-ci
#11664
opened Oct 15, 2025 by
Qiaolin-Yu
Loading…
4 tasks
fix: bench_serving error with PD disaggregation
run-ci
#11662
opened Oct 15, 2025 by
Yi-sir
Loading…
4 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.