Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xe: jit: gemm: add DG2+Xe2 2nd token dynamic quant strategies #2788

Merged
merged 1 commit into from
Mar 3, 2025

Conversation

Simonsays095
Copy link
Contributor

Addresses MFDNN-12420, adding a few dynamic quantization kernels to cover 2nd token cases (1xK:KxN) for both symmetric (s8) and asymmetric (u8) src quantization.

@Simonsays095
Copy link
Contributor Author

make test
disable test_device_cpu
disable build_cpu_runtime_omp
disable build_cpu_runtime_sycl
disable build_cpu_runtime_tbb
disable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-dg2
disable arch_gpu_xe-lp
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-lpg

@Simonsays095
Copy link
Contributor Author

make test perf-gpu
set primitive=matmul ip
disable arch_gpu_xe-hpc
disable arch_gpu_xe3-lpg

@Simonsays095 Simonsays095 merged commit 3ebdf16 into uxlfoundation:main Mar 3, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants