Implement FP32 kleidiai Gemv #26302

JonathanC-ARM · 2025-10-14T15:45:44Z

Description

Implementation of special sgemm path which uses GEMV kernels in cases where M or N are 1

Additionally this pr introduces the usage of a microkernel interface which utilizes typedef's provided by KleidiAI such that we can simplify the code and remove things such as ternary operations for SME1 vs SME2 kernels

Indicative Performance

In Lieu of any production models where gemv was a large contributor of the network. I opted to create a mini model to test which contains thousands of randomized matmul variants. With a distribution of GEMV cases throughout

Using onnxruntime perf test I was able to half the total inference time vs mlas with this model

More Benchmarks to come shortly

Signed-off-by: Jonathan Clohessy <[email protected]>

hariharans29 · 2025-10-14T19:50:14Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-10-14T19:50:34Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29 · 2025-10-16T17:10:08Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-10-16T17:10:27Z

Azure Pipelines successfully started running 4 pipeline(s).

Implement FP32 kleidiai Gemv

f201162

Signed-off-by: Jonathan Clohessy <[email protected]>

Merge branch 'microsoft:main' into jclohess_kleidiai_gemv_implementation

46c9c14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement FP32 kleidiai Gemv #26302

Implement FP32 kleidiai Gemv #26302

Uh oh!

JonathanC-ARM commented Oct 14, 2025

Uh oh!

hariharans29 commented Oct 14, 2025

Uh oh!

azure-pipelines bot commented Oct 14, 2025

Uh oh!

hariharans29 commented Oct 16, 2025

Uh oh!

azure-pipelines bot commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement FP32 kleidiai Gemv #26302

Are you sure you want to change the base?

Implement FP32 kleidiai Gemv #26302

Uh oh!

Conversation

JonathanC-ARM commented Oct 14, 2025

Description

Indicative Performance

Uh oh!

hariharans29 commented Oct 14, 2025

Uh oh!

azure-pipelines bot commented Oct 14, 2025

Uh oh!

hariharans29 commented Oct 16, 2025

Uh oh!

azure-pipelines bot commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants