ovep stateful: Enable explicit slice of prefill logits when NPUW_SLICE_OUT is disabled #850

RyanMetcalfeInt8 · 2025-11-14T17:02:44Z

This PR adds some necessary changes to support whisper decoder via NPUW stateful flow.

The whisper decoder has slightly different behavior than LLM decoders with NPU, in that the prefill logits are not already sliced (which is the assumption made by ORT GenAI, for which this pipeline is supported through).

Ref ticket: CVS-176474

…E_OUT is disabled

Copilot

Pull Request Overview

This PR enables explicit slicing of prefill logits in the OVEP stateful flow when NPUW_SLICE_OUT is disabled, specifically to support whisper decoder behavior where prefill logits are not pre-sliced by NPU.

Key changes:

Added logic to detect when NPU logit slicing is required based on NPUW_SLICE_OUT property
Implemented GetTensor override in StatefulOVInferRequest to slice logits tensor when needed
Made GetTensor virtual in base OVInferRequest class to enable override

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
ov_interface.h	Added virtual GetTensor method and NPU logits slice detection members to support stateful request overrides
ov_interface.cc	Implemented NPUW_SLICE_OUT property checking and logits slicing logic in GetTensor override

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/core/providers/openvino/ov_interface.cc

Co-authored-by: Copilot <[email protected]>

ovep stateful: Enable explicit slice of prefill logits when NPUW_SLIC…

6c46cf3

…E_OUT is disabled

RyanMetcalfeInt8 requested a review from Copilot November 14, 2025 17:02

Copilot AI reviewed Nov 14, 2025

View reviewed changes

onnxruntime/core/providers/openvino/ov_interface.cc Outdated Show resolved Hide resolved

Update onnxruntime/core/providers/openvino/ov_interface.cc

768e443

Co-authored-by: Copilot <[email protected]>

RyanMetcalfeInt8 requested a review from MayureshV1 November 14, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ovep stateful: Enable explicit slice of prefill logits when NPUW_SLICE_OUT is disabled #850

ovep stateful: Enable explicit slice of prefill logits when NPUW_SLICE_OUT is disabled #850

Uh oh!

RyanMetcalfeInt8 commented Nov 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ovep stateful: Enable explicit slice of prefill logits when NPUW_SLICE_OUT is disabled #850

Are you sure you want to change the base?

ovep stateful: Enable explicit slice of prefill logits when NPUW_SLICE_OUT is disabled #850

Uh oh!

Conversation

RyanMetcalfeInt8 commented Nov 14, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant