-
Notifications
You must be signed in to change notification settings - Fork 457
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[DRAFT] feat: support radix-based top-k sampling algorithm
#1561
opened Aug 24, 2025 by
JasonJ2021
Loading…
4 of 5 tasks
[DRAFT] feat: add support of fp4_batched_quantize
#1552
opened Aug 23, 2025 by
yicwang
Loading…
5 tasks
Add mnnvl_moe_alltoallv_prepare_without_allgather
#1550
opened Aug 22, 2025 by
trevor-m
Loading…
3 tasks done
tests(attn): add short-seq CUDA edge-case test (qo_len=1) for prefill
#1515
opened Aug 19, 2025 by
PrithviElancherran
Loading…
3 tasks done
feat: Centralize env vars in flashinfer.json with caching
#1487
opened Aug 14, 2025 by
yongwww
Loading…
5 tasks
feat(attention): add RoPE offset support for batch prefill
#1457
opened Aug 11, 2025 by
MengAiDev
Loading…
3 tasks done
feat: enable trtllm-gen attn speculative decoding verify by decode
priority: high
#1453
opened Aug 11, 2025 by
yyihuang
Loading…
5 tasks
refactor: unify autotuner for fp4 gemm backends
#1439
opened Aug 8, 2025 by
ttyio
Loading…
3 of 5 tasks
misc: Customize kv lens buffer size for sparse attention
#1383
opened Aug 5, 2025 by
Edenzzzz
Loading…
5 tasks
Removes MPI dependency from MNNVL AllReduce
#1379
opened Aug 4, 2025 by
pranavm-nvidia
Loading…
5 tasks
feat: Support sliding window for persistent kernel
#1368
opened Aug 3, 2025 by
Edenzzzz
Loading…
5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.