Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[wip] re-enable torch.compile
#1576 opened Aug 26, 2025 by guilhermeleobas Draft
4 of 5 tasks
misc: remove some unused files
#1574 opened Aug 26, 2025 by yzh119 Loading…
5 tasks
update trtllm-gen fp4 autotuner
#1573 opened Aug 25, 2025 by IwakuraRein Loading…
3 of 5 tasks
Support input and output signals for cutedsl gemm
#1569 opened Aug 25, 2025 by fzyzcjy Draft
5 tasks
feat: add lse return on trtllm-gen attention
#1566 opened Aug 24, 2025 by yyihuang Draft
5 tasks
[DRAFT] feat: support radix-based top-k sampling algorithm
#1561 opened Aug 24, 2025 by JasonJ2021 Loading…
4 of 5 tasks
[DRAFT] feat: add support of fp4_batched_quantize
#1552 opened Aug 23, 2025 by yicwang Loading…
5 tasks
Add mnnvl_moe_alltoallv_prepare_without_allgather
#1550 opened Aug 22, 2025 by trevor-m Loading…
3 tasks done
Fix autotuner for trtllm fp4 fused moe
#1548 opened Aug 22, 2025 by stslxg-nv Draft
4 of 5 tasks
Remove version limit of cuda-python
#1534 opened Aug 21, 2025 by VALLIS-NERIA Loading…
5 tasks
Auto tuning cutedsl gemm
#1527 opened Aug 21, 2025 by fzyzcjy Loading…
5 tasks
feat: integrate xqa attention backend
#1503 opened Aug 18, 2025 by qsang-nv Loading…
3 of 5 tasks
Trtllm-gen Fp8 MoE Autotunner
#1494 opened Aug 15, 2025 by aleozlx Draft
5 tasks done
feat: Centralize env vars in flashinfer.json with caching
#1487 opened Aug 14, 2025 by yongwww Loading…
5 tasks
feat(attention): add RoPE offset support for batch prefill
#1457 opened Aug 11, 2025 by MengAiDev Loading…
3 tasks done
benchmark: add allreduce_fusion benchmark
#1450 opened Aug 10, 2025 by yyihuang Draft
5 tasks
refactor: unify autotuner for fp4 gemm backends
#1439 opened Aug 8, 2025 by ttyio Loading…
3 of 5 tasks
Restore llama4 fc2 required kernels
#1417 opened Aug 8, 2025 by aleozlx Loading…
5 tasks done
Removes MPI dependency from MNNVL AllReduce
#1379 opened Aug 4, 2025 by pranavm-nvidia Loading…
5 tasks
feat: Support sliding window for persistent kernel
#1368 opened Aug 3, 2025 by Edenzzzz Loading…
5 tasks
ProTip! Filter pull requests by the default branch with base:main.