-
Notifications
You must be signed in to change notification settings - Fork 37
Pull requests: meta-pytorch/tritonbench
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Blackwell attention Triton Bench
cla signed
fb-exported
meta-exported
#416
opened Sep 12, 2025 by
henrylhtsang
Loading…
adding swa to tritonbench
cla signed
fb-exported
meta-exported
#415
opened Sep 11, 2025 by
henrylhtsang
Loading…
[do_bench][easy] warmup cudagraph mode in do_bench_profiler
cla signed
#411
opened Sep 10, 2025 by
BoyuanFeng
Loading…
Prototype FP8 Blackwell persistent + TMA kernel with warp specialization
cla signed
fb-exported
#385
opened Sep 3, 2025 by
jananisriram
Loading…
[WIP][FA][Blackwell] Implementation with explicit data partitioning
cla signed
#384
opened Sep 2, 2025 by
manman-ren
Loading…
adding arguments to add_benchmark to match registry
cla signed
fb-exported
#381
opened Sep 2, 2025 by
adamomainz
Loading…
Add cutlass decode kernel to TritonBench
cla signed
fb-exported
#376
opened Aug 28, 2025 by
Aya-ZIbra
Loading…
Improve TritonParse Log Organization and Force Overwrite
cla signed
#357
opened Aug 26, 2025 by
FindHao
Loading…
Validate exhaustive autotuning for FP8 Inductor templates
cla signed
fb-exported
#355
opened Aug 25, 2025 by
jananisriram
Loading…
[DO NOT LAND] Try always enabling cuda graph
cla signed
#348
opened Aug 21, 2025 by
xuzhao9
Loading…
Fixing naming convention of gemms
cla signed
fb-exported
#342
opened Aug 20, 2025 by
adamomainz
Loading…
Add amax as default per-row scaling factor for fp8_gemm benchmark
cla signed
fb-exported
#341
opened Aug 20, 2025 by
jananisriram
Loading…
Move scaling logic to input generation
cla signed
fb-exported
#338
opened Aug 20, 2025 by
jananisriram
Loading…
Add benchmarking on shapes from CSV files to fp8_gemm
cla signed
fb-exported
#332
opened Aug 18, 2025 by
jananisriram
Loading…
Fix input to TritonSplitK performance benc
cla signed
fb-exported
#323
opened Aug 1, 2025 by
Aya-ZIbra
Loading…
Add TLX attention (WS pipelined pingpong hopper)
cla signed
#320
opened Jul 31, 2025 by
yf225
Loading…
Allow TMA benchmarks for flex-attention kernel
cla signed
fb-exported
#225
opened May 15, 2025 by
mandroid6
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.