Skip to content

Commit 6a7f6db

Browse files
densamoilovazhai219
authored andcommitted
cpu: x64: rnn: multiplier is not required for gru
The multiplier (2) is only required when rnn.brgemm_fwd_iter_layer_fuse_possible is true and is only used in kernel_fused_iter_layer.
1 parent 03632e4 commit 6a7f6db

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

src/cpu/x64/rnn/brgemm_cell_common_fwd.cpp

+3-4
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
/*******************************************************************************
2-
* Copyright 2021-2024 Intel Corporation
2+
* Copyright 2021-2025 Intel Corporation
33
*
44
* Licensed under the Apache License, Version 2.0 (the "License");
55
* you may not use this file except in compliance with the License.
@@ -688,9 +688,8 @@ void brgemm_gru_t<src_t, weights_t, scratch_t, gemm_acc_t>::kernel(
688688
gemm_acc_t *const amx_buffer = is_amx
689689
? amx_scratchpad_ + rnn_.m_block * rnn_.n_block * ithr
690690
: nullptr;
691-
const int max_K_Block = 2
692-
* nstl::max(rnn_.KB1_blocks + 1,
693-
nstl::max(rnn_.KBproj_blocks + 1, rnn_.KB2_blocks + 1));
691+
const int max_K_Block = nstl::max(rnn_.KB1_blocks + 1,
692+
nstl::max(rnn_.KBproj_blocks + 1, rnn_.KB2_blocks + 1));
694693
brgemm_batch_element_t *const addr_batch
695694
= addr_batch_global_ + ithr * max_K_Block;
696695

0 commit comments

Comments
 (0)