Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport: src: cpu: aarch64: conv: Use acl_indirect_gemm for bf16 convolutions #1953

Merged
merged 1 commit into from
Jun 7, 2024
Merged

Backport: src: cpu: aarch64: conv: Use acl_indirect_gemm for bf16 convolutions #1953

merged 1 commit into from
Jun 7, 2024

Conversation

Ryo-not-rio
Copy link
Contributor

Backport of #1933 to unreleased rls-v3.5

…volutions

performance improvements:

Total benchdnn tests: 57
Min: 15x
Average: 131x
Max: 320x

Change-Id: I6266fa72491a03dd96f0c6d51334d7ab376a0e26
@Ryo-not-rio
Copy link
Contributor Author

@fadara01

@vpirogov vpirogov added this to the v3.5 milestone Jun 7, 2024
@vpirogov vpirogov merged commit 19272d1 into uxlfoundation:rls-v3.5 Jun 7, 2024
@vpirogov vpirogov added the platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64 label Jun 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants