[LPT] Quantized LSTMSequence & GRUSequence extended support #25654

eshoguli · 2024-07-19T20:08:37Z

Details:

Low Precision Transformations: Quantized LSTMSequence & GRUSequence extended support

Tickets:

Current implementation for: CVS-146067
Will be changed in feature request: CVS-147588

v-Golubev

Please remove debug info (serialization, couts) and fix the CI

src/common/low_precision_transformations/src/markup_precisions.cpp

src/common/low_precision_transformations/src/broadcast.cpp

v-Golubev

I am still concerned by the current implementation. We can merge it, but I think we need to prioritize CVS-147588 to remove this WA as soon as possible

src/common/low_precision_transformations/src/broadcast.cpp

v-Golubev · 2024-07-25T07:43:11Z

src/common/low_precision_transformations/src/broadcast.cpp

+
+BroadcastTransformation::BroadcastTransformation(const Params& params) : TransparentBaseTransformation(params) {
+    MATCHER_SCOPE(BroadcastTransformation);
+    auto matcher = pattern::wrap_type<ov::opset1::Broadcast>({


We still don't match on v3::Broadcast here. Maybe we can match on BroadcastBase?

We need specify supported operation here, not base types, which can be inheritted by not supported operation in future.

src/common/low_precision_transformations/src/markup_precisions.cpp

...functional/plugin/shared/src/low_precision_transformations/recurrent_cell_transformation.cpp

src/common/low_precision_transformations/src/recurrent_cell.cpp

v-Golubev · 2024-07-25T08:16:11Z

src/common/low_precision_transformations/src/recurrent_cell.cpp

+
+std::vector<std::pair<size_t, element::Type>> get_quantized_inputs(std::shared_ptr<ov::Node> lstm) {
+    return is_type<ov::opset5::LSTMSequence>(lstm) ?
+        std::vector<std::pair<size_t, element::Type>>{ {0, element::u8}, { 1, element::u8 }, { 4, element::undefined }, { 5, element::undefined } } :


We always regulate the desired precisions on inputs with PrecisionsRestriction which are set by plugin. In CPU, we actually have u8u8 restrictions:

PrecisionsRestriction::create<ov::opset5::LSTMSequence>({ {{0, 1}, {ov::element::u8}} }), PrecisionsRestriction::create<ov::opset6::GRUSequence>({ {{0, 1}, {ov::element::u8}} }),

Why do we need this additional logic with precisions limitations here? If there is no chance to avoid it, could you please add an explanatory comment in the code on what undefined precision means?

We need it here.
LPT uses operation input precision restrictions to identify output precision for FakeQuantize dequantization transformation. But it doesn't mean, that we can ignore filter or input validation in each transformation pattern matching, because in Constant branch (for example, a branch with single Constant operation) LPT can not provide require precision.

This is a valid point, but it seems like we have such checks only in this transformation. Does it mean that we must perform the checks in each transformation (or at least in the transformations which work with layers with weights)?

src/common/low_precision_transformations/include/low_precision/broadcast.hpp

src/common/low_precision_transformations/src/broadcast.cpp

src/common/low_precision_transformations/src/recurrent_cell.cpp

.../functional/plugin/shared/include/low_precision_transformations/broadcast_transformation.hpp

src/tests/ov_helpers/ov_lpt_models/src/broadcast.cpp

src/common/low_precision_transformations/tests/broadcast_transformation.cpp

...functional/shared_tests_instances/low_precision_transformations/broadcast_transformation.cpp

eshoguli requested a review from a team as a code owner July 19, 2024 20:08

github-actions bot added the category: LP transformations label Jul 19, 2024

eshoguli added the category: CPU label Jul 20, 2024

eshoguli closed this Jul 22, 2024

eshoguli deleted the es/lpt/lstm_support_extension branch July 22, 2024 10:39

eshoguli restored the es/lpt/lstm_support_extension branch July 22, 2024 10:48

eshoguli reopened this Jul 22, 2024

github-actions bot removed the category: CPU label Jul 22, 2024

eshoguli force-pushed the es/lpt/lstm_support_extension branch from 717d7bd to e7840ff Compare July 22, 2024 18:44

eshoguli requested review from a team as code owners July 22, 2024 18:44

github-actions bot added category: IE Tests category: CPU labels Jul 22, 2024

eshoguli requested review from v-Golubev and EgorDuplensky July 22, 2024 18:46

eshoguli requested review from a team as code owners July 22, 2024 23:03

github-actions bot added the category: GPU label Jul 22, 2024

eshoguli force-pushed the es/lpt/lstm_support_extension branch 5 times, most recently from 13709dc to 9708278 Compare July 24, 2024 09:54

v-Golubev reviewed Jul 24, 2024

View reviewed changes

src/common/low_precision_transformations/src/markup_precisions.cpp Show resolved Hide resolved

src/common/low_precision_transformations/src/broadcast.cpp Outdated Show resolved Hide resolved

src/common/low_precision_transformations/src/broadcast.cpp Outdated Show resolved Hide resolved

eshoguli requested a review from v-Golubev July 24, 2024 23:22

eshoguli force-pushed the es/lpt/lstm_support_extension branch from 4487204 to 715e207 Compare July 24, 2024 23:39

eshoguli added 2 commits July 25, 2024 00:48

[LPT] Quantized LSTM & GRU extended support

0622ae6

cleanup + opset3::Broadcast

715e207

v-Golubev reviewed Jul 25, 2024

View reviewed changes

eshoguli force-pushed the es/lpt/lstm_support_extension branch from a29d204 to dd6f642 Compare July 25, 2024 18:29

eshoguli requested a review from v-Golubev July 25, 2024 18:30

tests + comments

dd6f642

v-Golubev reviewed Jul 26, 2024

View reviewed changes

eshoguli requested a review from v-Golubev July 26, 2024 12:25

eshoguli added 2 commits July 26, 2024 13:34

comments

cdaf971

tests cleanup + compilation fix

7247f3b

v-Golubev approved these changes Jul 26, 2024

View reviewed changes

eshoguli added this pull request to the merge queue Jul 27, 2024

Merged via the queue into openvinotoolkit:master with commit 3056b53 Jul 27, 2024
124 checks passed

eshoguli deleted the es/lpt/lstm_support_extension branch July 27, 2024 01:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LPT] Quantized LSTMSequence & GRUSequence extended support #25654

[LPT] Quantized LSTMSequence & GRUSequence extended support #25654

eshoguli commented Jul 19, 2024 •

edited

Loading

v-Golubev left a comment

v-Golubev left a comment

v-Golubev Jul 25, 2024

eshoguli Jul 25, 2024

v-Golubev Jul 25, 2024

eshoguli Jul 25, 2024 •

edited

Loading

v-Golubev Jul 26, 2024

[LPT] Quantized LSTMSequence & GRUSequence extended support #25654

[LPT] Quantized LSTMSequence & GRUSequence extended support #25654

Conversation

eshoguli commented Jul 19, 2024 • edited Loading

Details:

Tickets:

v-Golubev left a comment

Choose a reason for hiding this comment

v-Golubev left a comment

Choose a reason for hiding this comment

v-Golubev Jul 25, 2024

Choose a reason for hiding this comment

eshoguli Jul 25, 2024

Choose a reason for hiding this comment

v-Golubev Jul 25, 2024

Choose a reason for hiding this comment

eshoguli Jul 25, 2024 • edited Loading

Choose a reason for hiding this comment

v-Golubev Jul 26, 2024

Choose a reason for hiding this comment

eshoguli commented Jul 19, 2024 •

edited

Loading

eshoguli Jul 25, 2024 •

edited

Loading