layernorm: enlarge the range for 2-pass reduction #2282

weishi-deng · 2025-11-04T06:30:36Z

From the OOB models, there are some shapes still below the performance expectation with large M but small N.
Simple shapes:
[128, 197, 384]
[64,784, 256]
[(64, 28, 28, 256]
[256, 197, 256]
[128, 196, 384]
After enlarging the range for 2-pass reduction, these models can benefit an average of 10-20ms model execution time and optimize the geomean performance of eager training in timm models from 0.835 to 0.842.

src/ATen/native/xpu/sycl/LayerNormKernels.cpp

EikanWang · 2025-11-05T08:06:17Z

src/ATen/native/xpu/sycl/LayerNormKernels.cpp

-      N / 32 < syclGpuEuCount() / syclGpuEUCountPerSubslice() / 2) {
+  int subslice_count = syclGpuEuCount() / syclGpuEUCountPerSubslice();
+  if (use_two_stage_col_reduction && M > subslice_count * 1024 &&
+      N / 32 < subslice_count) {


How could we correlate N / 32 < subslice_count to the performance insights? Does the 32 mean simd32?

Co-authored-by: Eikan Wang <[email protected]>

enlarge the range for 2-pass reduction

9403eed

weishi-deng requested a review from jianyizh November 4, 2025 06:30

update config logic

d072252

EikanWang reviewed Nov 5, 2025

View reviewed changes

weishi-deng and others added 2 commits November 5, 2025 16:30

Update src/ATen/native/xpu/sycl/LayerNormKernels.cpp

98ccf83

Co-authored-by: Eikan Wang <[email protected]>

update condition for 2 pass

7f6f9a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

layernorm: enlarge the range for 2-pass reduction #2282

layernorm: enlarge the range for 2-pass reduction #2282

Uh oh!

weishi-deng commented Nov 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

EikanWang Nov 5, 2025 •

edited

Loading

Uh oh!

weishi-deng Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

layernorm: enlarge the range for 2-pass reduction #2282

Are you sure you want to change the base?

layernorm: enlarge the range for 2-pass reduction #2282

Uh oh!

Conversation

weishi-deng commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

EikanWang Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

weishi-deng Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

weishi-deng commented Nov 4, 2025 •

edited

Loading

EikanWang Nov 5, 2025 •

edited

Loading