Skip to content

[MLAS][KleidiAI] Apply fp16 split independent fixes#28720

Open
Laan33 wants to merge 1 commit into
microsoft:mainfrom
Laan33:fp16-split/01-independent-fixes-squashed
Open

[MLAS][KleidiAI] Apply fp16 split independent fixes#28720
Laan33 wants to merge 1 commit into
microsoft:mainfrom
Laan33:fp16-split/01-independent-fixes-squashed

Conversation

@Laan33
Copy link
Copy Markdown
Contributor

@Laan33 Laan33 commented May 29, 2026

Description

This PR splits out a small set of independent fixes and test-scope updates from the larger CPU fp16 enablement work.

Changes:

  • Adds KleidiAI Conv2D regression coverage for LHS indirection cache behaviour when the per-thread padding buffer grows.
  • Runs that regression in both short and long Conv2D test paths when KleidiAI convolution is available.
  • Disables WebGPU for the fp16 Attention test case that uses mask_index, which WebGPU Attention does not support.
  • Renames the dynamic QGEMM batch-count parameter from BatchSize to BatchN for consistency with MLAS batch naming.

Motivation and Context

These changes are independent from the wider fp16 MLAS/API and CPU EP enablement work in #28487.

The Conv2D regression exercises the small-CI -> large-CI -> small-CI sequence that can expose stale cached LHS pointer tables after the KleidiAI padding buffer grows. The Attention test exclusion keeps the test targeted to providers that support the tested mask_index input combination.

Note: This PR is the first offshoot from #28487.

Squash the fp16 split independent fixes into a single IP review patch while preserving the original branch history separately.

Signed-off-by: Cathal Lawlor <cathal.lawlor@arm.com>
@Laan33 Laan33 changed the title Apply fp16 split independent fixes [MLAS][KleidiAI] Apply fp16 split independent fixes May 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant