Prevent complex<Half> accumulation overflow in sum_functor specialization #2308

yucai-intel · 2025-11-07T06:21:47Z

Problem Description:
The XPU backend did not correctly enable the high-precision accumulator rule when processing the sum reduction of complexHalf, causing premature overflow of low-precision accumulators in the kernel's internal accumulation operation.

Specific Modifications and Effects:

Modify the reduce_dispatch function: The original scheduler logic omitted explicit high-precision boosting for ComplexHalf. The fix forces the acc_t to be boosted to complex at the scheduling entry point, ensuring the correct precision is used for reduction.
Modify the sum_functor specialization: The kernel's underlying implementation may incorrectly downgrade the accumulator type to the low-precision out_t. The fix forces the out_t parameter of gpu_reduce_kernel to be boosted to complex to avoid the type downgrading bug within the kernel.

yucai-intel · 2025-11-07T06:34:14Z

For #2008

yucai-intel added 3 commits November 7, 2025 14:17

Update ReduceSumProdKernels.cpp

351e470

Update Reduce.h

e01e0b3

format

c8a1ddd

yucai-intel mentioned this pull request Nov 7, 2025

reduction got accuracy issue on large tensor cases #2008

Open

yucai-intel changed the title ~~Fix complex32 reduce presicion error~~ Prevent complex<Half> accumulation overflow in sum_functor specialization Nov 7, 2025

Update Reduce.h

b2ccf33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prevent complex<Half> accumulation overflow in sum_functor specialization #2308

Prevent complex<Half> accumulation overflow in sum_functor specialization #2308

Uh oh!

yucai-intel commented Nov 7, 2025 •

edited

Loading

Uh oh!

yucai-intel commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prevent complex<Half> accumulation overflow in sum_functor specialization #2308

Are you sure you want to change the base?

Prevent complex<Half> accumulation overflow in sum_functor specialization #2308

Uh oh!

Conversation

yucai-intel commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yucai-intel commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yucai-intel commented Nov 7, 2025 •

edited

Loading