You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
U8,i8,u4,i4,fp16 and dynamic quantization support on U8,i8,u4,i4.
ONEDNN3.5 migration has squashed these changes into one:
[FORK][FEATURE] InnerProduct primitive: 8bit weights decompression support
[FORK][FEATURE] InnerProduct primitive: 8bit weights decompression support on AMX
[FORK][FEATURE] InnerProduct primitive: 4bit weights decompression support
[FORK][FEATURE] Enable prepack algorithm for 4bit weights decompression
[FORK][FEATURE] InnerProduct primitive: 4bit weights decompression support on SPR
[FORK][FEATURE] InnerProduct primitive: src dynamic quantization
[FORK][FIX] Fixed behavior for unaligned src and weights ic groups
[FORK][FEATURE] InnerProduct primitive: src dynamic quantization
[FORK][FEATURE] Support (f32,fp16,f32) inner-product
[FORK][FEATURE] Support s8 for weight-compresseion ip
[FORK][FIX] Squash FC weight compression fix when migrating 3.5.
[FORK][FIX]fix ld_step
[FORK][FEATURE]Remove sub_byte_data_type_multiplier
[Fork][Fix]Fix offset for dynamic quantization
[FORK][FIX]Fix zp check and u4/s4 reorder
[FORK][FIX]Fix zp set API
[Fork][Fix]Skip runtime scale & zp check with weight compression
[Fork][Fix] fix bug in brgemm introduced by (f32,fp16,f32) inner-product
Co-Authored-By: Tingqian Li <Tingqian.Li@intel.com>, Yi Zhang<Yi3.Zhang@intel.com>
Copy file name to clipboardexpand all lines: include/oneapi/dnnl/dnnl_common_types.h
+4-2
Original file line number
Diff line number
Diff line change
@@ -104,9 +104,11 @@ typedef enum {
104
104
dnnl_u4=12,
105
105
/// [MX-compliant 8-bit compliant scale data type](https://www.opencompute.org/documents/ocp-microscaling-formats-mx-v1-0-spec-final-pdf) with 8-bit exponent.
106
106
dnnl_e8m0=13,
107
-
107
+
/// 4-bit normalized float.
108
+
dnnl_nf4=14,
108
109
/// 1-bit integer.
109
-
dnnl_bin=14,
110
+
dnnl_bin=15,
111
+
110
112
/// Parameter to allow internal only data_types without undefined behavior.
111
113
/// This parameter is chosen to be valid for so long as sizeof(int) >= 2.
0 commit comments