Add subgroup matrix multiplication #80

junjihashimoto · 2025-09-05T04:21:55Z

Add the shader of subgroup matrix multiplication.

$ sysctl -n machdep.cpu.brand_string
Apple M4 Max

$ for i in 11 12 13 ; do MATMUL_VERSION=$i ./build/matmul  | grep 'Dispatching\|FLOPS' ; done
# Without subgroupMatrix (f16) 
[info] Dispatching Kernel version 11: f16: 2D blocktiling with loop unrolling, vectorization and transpose, 30 iterations ...
25.8 milliseconds / dispatch ~ 10640.35 GFLOPS
# With subgroupMatrix (f16) 
[info] Dispatching Kernel version 12: f16: Subgroup matrix multiply with transpose, 30 iterations ...
20.7 milliseconds / dispatch ~ 13299.45 GFLOPS
# With subgroupMatrix (f32) 
[info] Dispatching Kernel version 13: f32: Subgroup matrix multiply with transpose (default), 30 iterations ...
24.6 milliseconds / dispatch ~ 11185.63 GFLOPS

junjihashimoto · 2025-09-05T18:55:17Z

~~The main branch does not seem to output any shader compilation errors.~~

junjihashimoto · 2025-10-13T06:14:28Z

cmake/dawn.cmake

  # Ensure source present on required commit (idempotent remote setup)
  if(NOT DEFINED DAWN_COMMIT OR DAWN_COMMIT STREQUAL "")
-    set(DAWN_COMMIT "e1d6e12337080cf9f6d8726209e86df449bc6e9a" CACHE STRING "Dawn commit to checkout" FORCE)
+    set(DAWN_COMMIT "3f79f3aefe0b0a498002564fcfb13eb21ab6c047" CACHE STRING "Dawn commit to checkout" FORCE)


google/dawn@d7d27a6
Required to set subgroupsize to 32 on MacOS.

junjihashimoto force-pushed the feature/matmul branch from 4d5d20b to 21b2f6f Compare September 22, 2025 09:32

junjihashimoto changed the base branch from main to dev September 22, 2025 09:33

junjihashimoto force-pushed the feature/matmul branch from 21b2f6f to 3165df5 Compare September 22, 2025 09:34

Bump dawn to use subgroupSize == 32 on macos

3d6e51c

junjihashimoto force-pushed the feature/matmul branch from ffbb983 to 149a961 Compare October 13, 2025 06:05

junjihashimoto commented Oct 13, 2025

View reviewed changes

junjihashimoto marked this pull request as ready for review October 13, 2025 06:14

Add the SubgroupMatrixMultiply shader

17614f7

junjihashimoto force-pushed the feature/matmul branch from 149a961 to 17614f7 Compare October 13, 2025 06:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add subgroup matrix multiplication #80

Add subgroup matrix multiplication #80

Uh oh!

junjihashimoto commented Sep 5, 2025 •

edited

Loading

Uh oh!

junjihashimoto commented Sep 5, 2025 •

edited

Loading

Uh oh!

junjihashimoto Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add subgroup matrix multiplication #80

Are you sure you want to change the base?

Add subgroup matrix multiplication #80

Uh oh!

Conversation

junjihashimoto commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junjihashimoto commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junjihashimoto Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

junjihashimoto commented Sep 5, 2025 •

edited

Loading

junjihashimoto commented Sep 5, 2025 •

edited

Loading