xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

Simonsays095 · 2025-02-28T23:59:28Z

Addresses MFDNN-12420, adding a few dynamic quantization kernels to cover 2nd token cases (1xK:KxN) for both symmetric (s8) and asymmetric (u8) src quantization.

Simonsays095 · 2025-03-01T00:09:58Z

make test
disable test_device_cpu
disable build_cpu_runtime_omp
disable build_cpu_runtime_sycl
disable build_cpu_runtime_tbb
disable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-dg2
disable arch_gpu_xe-lp
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-lpg

Simonsays095 · 2025-03-01T01:11:57Z

make test perf-gpu
set primitive=matmul ip
disable arch_gpu_xe-hpc
disable arch_gpu_xe3-lpg

Simonsays095 requested a review from a team as a code owner February 28, 2025 23:59

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Feb 28, 2025

Simonsays095 mentioned this pull request Mar 1, 2025

[Backport rls-v3.8-pc] xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2789

Merged

Simonsays095 force-pushed the optimize_dq_DG2_BMG branch from 96d684d to beea80f Compare March 1, 2025 00:06

kealan-barbieri approved these changes Mar 1, 2025

View reviewed changes

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies

f78be4e

Simonsays095 force-pushed the optimize_dq_DG2_BMG branch from beea80f to f78be4e Compare March 3, 2025 18:07

Simonsays095 mentioned this pull request Mar 3, 2025

[Backport v3.7] xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2800

Merged

hidefromkgb approved these changes Mar 3, 2025

View reviewed changes

skazakov1 approved these changes Mar 3, 2025

View reviewed changes

Simonsays095 merged commit 3ebdf16 into uxlfoundation:main Mar 3, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

Simonsays095 commented Feb 28, 2025

Simonsays095 commented Mar 1, 2025

Simonsays095 commented Mar 1, 2025

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

Conversation

Simonsays095 commented Feb 28, 2025

Simonsays095 commented Mar 1, 2025

Simonsays095 commented Mar 1, 2025