Skip to content

Commit 7202dfc

Browse files
authored
Support for ARM SVE and better performance for NEON (#281)
* Support for ARM SVE and better performance for NEON * Code refactoring to support code generation for both amd64 as well as arm64
1 parent d4574a5 commit 7202dfc

20 files changed

+28506
-175
lines changed

README.md

+4
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,10 @@ Using Go modules is recommended.
2525

2626
# Changes
2727

28+
## 2024
29+
30+
* Auto-generation of SVE and NEON routines for ARM based on AVX2 code. This results in a speedup of 2x for SVE (as measured using Graviton 3 on AWS) and a speedup of 1.5x as compared to the existing NEON-accelerated code.
31+
2832
## 2022
2933

3034
* [GFNI](https://github.com/klauspost/reedsolomon/pull/224) support for amd64, for up to 3x faster processing.

0 commit comments

Comments
 (0)