Aarch64 jit depthwise convolution kernels #2014

nishith-fujitsu · 2024-07-29T09:31:43Z

Description

This commit expands ARM SVE support for forward and backward JIT SVE Depth-wise convolution in FP32 for, introducing compatibility with various vector lengths. The changes made are for implementing different ARM ISA.
Major code changes:

Added common files jit_uni_dw_convolution.cpp, jit_uni_dw_convolution.hpp, jit_uni_dw_conv_kernel_utils.hpp, jit_uni_dw_conv_kernel_f32.cpp, jit_uni_dw_conv_kernel_f32.hpp to accommodate the extended ARM SVE ISA for depth-wise convolution operators.
Set data format tags according to the ARM ISA being used for forward and backward operator in JIT depth-wise convolution.
Replaced ldr, and str instructions for vector registers with ld1w and st1w to utilize predication.
Made changes to reducer.cpp to support Vector agnostic approach.

Checklist

General

[✓] Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit? Yes
Test output is same with and without this commit.

99% tests passed, 2 tests failed out of 196
 
Total Test time (real) = 525.25 sec
 
The following tests FAILED:
        155 - test_graph_unit_dnnl_large_partition_usm_cpu (Failed)
        177 - test_benchdnn_modeC_graph_ci_cpu (Failed)
Errors while running CTest
Output from these tests are in: /home/nishith/oss/oneDNN/build/Testing/Temporary/LastTest.log
Use "--rerun-failed --output-on-failure" to re-run the failed cases verbosely.
make: *** [Makefile:71: test] Error 8

[ ✓] Have you formatted the code using clang-format? Yes

abhijain1204fujitsu · 2024-08-05T15:44:23Z

@vpirogov , Can you please share your feedback for the PR & Support us for merger

vpirogov · 2024-08-06T16:58:44Z

@oneapi-src/onednn-cpu-aarch64, @snadampal, @kawakami-k, could you please help reviewing and validating this one?

jondea

This is really great work, thank you!

My only important comment is that I think it is better than the current ACL depthwise implementation and it should go ahead of it in the dispatch list.

I have also done some testing with this on a C7g with benchdnn and with PyTorch, and I didn't hit any issues.

src/cpu/cpu_convolution_list.cpp

src/cpu/aarch64/jit_uni_dw_convolution.hpp

src/cpu/aarch64/cpu_reducer.cpp

jondea

Code generally looks good, but can you remove the merge commit from the history by rebasing onto main please? https://github.com/oneapi-src/oneDNN/blob/main/CONTRIBUTING.md#code-contribution-guidelines

abhijain1204fujitsu · 2024-08-21T02:43:15Z

@vpirogov , @jondea the requested changes has been completed, kindly check and support to merge the PR

abhijain1204fujitsu · 2024-08-26T04:24:36Z

@vpirogov kindly support to merge the changes
In case there is any more feedback kindly let us know.
Thank you !

src/cpu/aarch64/jit_uni_dw_convolution.hpp

mgouicem

I believe a missing closing brace should make that code break the build on ARM platforms.

mgouicem

please remove the merge commit. Instead rebase your branch on top of the main branch.

nishith-fujitsu · 2024-08-26T09:59:48Z

please remove the merge commit. Instead rebase your branch on top of the main branch.

Hi @mgouicem, removed merge commits and rebased to latest main branch.

abhijain1204fujitsu · 2024-08-26T16:00:52Z

@vpirogov , @mgouicem please support to merge the PR

vpirogov added this to the v3.6 milestone Jul 29, 2024

vpirogov added the platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64 label Jul 29, 2024

vpirogov changed the base branch from main to vpirogov/codeowners-update July 31, 2024 16:27

vpirogov requested review from a team as code owners July 31, 2024 16:27

vpirogov changed the base branch from vpirogov/codeowners-update to main July 31, 2024 16:27

vpirogov removed the request for review from a team July 31, 2024 16:27

jondea mentioned this pull request Aug 8, 2024

cpu: aarch64: Enable BRGEMM Depthwise Forward Convolution. #2009

Merged

2 tasks

jondea requested changes Aug 15, 2024

View reviewed changes

src/cpu/cpu_convolution_list.cpp Outdated Show resolved Hide resolved

src/cpu/aarch64/jit_uni_dw_convolution.hpp Outdated Show resolved Hide resolved

src/cpu/aarch64/cpu_reducer.cpp Show resolved Hide resolved

jondea requested changes Aug 19, 2024

View reviewed changes

nishith-fujitsu force-pushed the AARCH64_JIT_Depthwise_Convolution_kernels branch 2 times, most recently from 7740689 to ad33f1c Compare August 19, 2024 10:23

jondea approved these changes Aug 21, 2024

View reviewed changes

mgouicem reviewed Aug 26, 2024

View reviewed changes

src/cpu/aarch64/jit_uni_dw_convolution.hpp Outdated Show resolved Hide resolved

mgouicem requested changes Aug 26, 2024

View reviewed changes

nishith-fujitsu force-pushed the AARCH64_JIT_Depthwise_Convolution_kernels branch from 8dd253d to 9a4fcc5 Compare August 26, 2024 09:42

mgouicem approved these changes Aug 26, 2024

View reviewed changes

mgouicem requested changes Aug 26, 2024

View reviewed changes

nishith-fujitsu force-pushed the AARCH64_JIT_Depthwise_Convolution_kernels branch from e70a447 to 9a4fcc5 Compare August 26, 2024 09:48

mgouicem approved these changes Aug 26, 2024

View reviewed changes

cpu:aarch64: Extend Arm SVE support for Depthwise Convolution Kernels

6f6a7ae

nishith-fujitsu force-pushed the AARCH64_JIT_Depthwise_Convolution_kernels branch from 9a4fcc5 to 6f6a7ae Compare August 26, 2024 09:55

nishith-fujitsu closed this Aug 26, 2024

nishith-fujitsu reopened this Aug 26, 2024

vpirogov merged commit 5639fae into uxlfoundation:main Aug 26, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aarch64 jit depthwise convolution kernels #2014

Aarch64 jit depthwise convolution kernels #2014

nishith-fujitsu commented Jul 29, 2024

abhijain1204fujitsu commented Aug 5, 2024

vpirogov commented Aug 6, 2024 •

edited

Loading

jondea left a comment

jondea left a comment •

edited

Loading

abhijain1204fujitsu commented Aug 21, 2024

abhijain1204fujitsu commented Aug 26, 2024

mgouicem left a comment

mgouicem left a comment

nishith-fujitsu commented Aug 26, 2024

abhijain1204fujitsu commented Aug 26, 2024

Aarch64 jit depthwise convolution kernels #2014

Aarch64 jit depthwise convolution kernels #2014

Conversation

nishith-fujitsu commented Jul 29, 2024

Description

Checklist

General

abhijain1204fujitsu commented Aug 5, 2024

vpirogov commented Aug 6, 2024 • edited Loading

jondea left a comment

Choose a reason for hiding this comment

jondea left a comment • edited Loading

Choose a reason for hiding this comment

abhijain1204fujitsu commented Aug 21, 2024

abhijain1204fujitsu commented Aug 26, 2024

mgouicem left a comment

Choose a reason for hiding this comment

mgouicem left a comment

Choose a reason for hiding this comment

nishith-fujitsu commented Aug 26, 2024

abhijain1204fujitsu commented Aug 26, 2024

vpirogov commented Aug 6, 2024 •

edited

Loading

jondea left a comment •

edited

Loading