Skip to content

Conversation

qti-mattsinc
Copy link
Contributor

Description

  • Transposes are inserted for Softmax with axis != output_rank-1 for the HTP backend.
  • The GPU backend also has this requirement on the axis param, so this change enables the layout transformation for the GPU as well.

Motivation and Context

  • Enables more models with GPU backend.

- Transposes are inserted for Softmax with axis != output_rank-1
  for the HTP backend.
- The GPU backend also has this requirement on the axis param, so
  this change enables the layout transformation for the GPU as well.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant