Fixed RoPE scaling extension factors buffer size for partial_rotary_factor #337
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Phi-4 was causing runtime errors in TVM due to its partial_rotary_factor of 0.75, which resulted in a mismatch between the expected ext_factor dimensions and the actual. Changed ext_factors buffer size from head_dim//2 to rotary_dim//2 to align with RoPE implementation. This resolves TVMError: Assert fail for fused_rope_longrope_scaling_ext_factors_handle. Tested other models to ensure this didn't break support.