Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cpu: x64: jit_binary_kernel: pass src2_dt unconditionally #2783

Merged
merged 1 commit into from
Mar 1, 2025

Conversation

dzarukin
Copy link
Contributor

@dzarukin dzarukin commented Feb 28, 2025

cl --version
Microsoft (R) C/C++ Optimizing Compiler Version 19.29.30158 for x64

This version of compiler for C++17 (C++11 is fine) will generate wrong code for non-ternary algorithms due to the branch in the constructor. It inverts objects that must be created and must not:

io_=0000028F599311D8
io_.at(conf_.src0_type)=0000000000000000
io_.at(conf_.src1_type)=0000000000000000
io_.at(conf_.src2_type)=0000028F59929C00
io_.at(conf_.dst_type)=0000000000000000

The solution is to pass src2_type unconditionally, access to such io_injector must be regulated by checking for is_ternary().

@dzarukin dzarukin requested a review from a team as a code owner February 28, 2025 20:22
@github-actions github-actions bot added platform:cpu-x64 Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64 backport labels Feb 28, 2025
@dzarukin
Copy link
Contributor Author

make test
disable test_device_gpu
disable build_gpu_runtime_ocl
disable build_gpu_runtime_sycl
disable benchdnn_all
enable benchdnn_binary
enable arch_cpu_adl
enable arch_cpu_clx
enable arch_cpu_hsw
enable arch_cpu_nhm
enable arch_cpu_skx
enable arch_cpu_snb
enable arch_cpu_spr
enable arch_cpu_srf

>cl --version
Microsoft (R) C/C++ Optimizing Compiler Version 19.29.30158 for x64

This version of compiler for C++17 (C++11 is fine) will generate wrong
code for non-ternary algorithms due to the branch in the constructor.
It inverts objects that must be created and must not:
io_=0000028F599311D8
io_.at(conf_.src0_type)=0000000000000000
io_.at(conf_.src1_type)=0000000000000000
io_.at(conf_.src2_type)=0000028F59929C00
io_.at(conf_.dst_type)=0000000000000000

The solution is to pass src2_type unconditionally, access to such
io_injector must be regulated by checking for is_ternary().
@dzarukin dzarukin force-pushed the dzarukin/rls-v3.7/wa_jit_binary branch from c7646cc to 9496130 Compare February 28, 2025 22:39
@dzarukin
Copy link
Contributor Author

make test
disable test_device_gpu
disable build_gpu_runtime_ocl
disable build_gpu_runtime_sycl
disable benchdnn_all
enable benchdnn_binary
enable arch_cpu_adl
enable arch_cpu_clx
enable arch_cpu_hsw
enable arch_cpu_nhm
enable arch_cpu_skx
enable arch_cpu_snb
enable arch_cpu_spr
enable arch_cpu_srf

@vpirogov vpirogov merged commit 50dd6cc into rls-v3.7 Mar 1, 2025
3 of 4 checks passed
@vpirogov vpirogov deleted the dzarukin/rls-v3.7/wa_jit_binary branch March 1, 2025 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport platform:cpu-x64 Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants