[RFCs] Add Flow Graph try_put_and_wait RFC #1513

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

kboyarinov wants to merge 47 commits into master from dev/kboyarinov/try_put_and_wait_rfc

Contributor

kboyarinov commented Sep 13, 2024

Description

Add a comprehensive description of proposed changes

Fixes # - issue number(s) if exists

Type of change

Choose one or multiple, leave empty if none of the other choices apply

Add a respective label(s) to PR if you have permissions

bug fix - change that fixes an issue
new feature - change that adds functionality
tests - change in tests
infrastructure - change in infrastructure and CI
documentation - documentation update

Tests

added - required for new features and some bug fixes
not needed

Documentation

updated in # - add PR number
needs to be updated
not needed

Breaks backward compatibility

Yes
No
Unknown

Notify the following users

List users with @ to send notifications

Other information

vossmjp and others added 30 commits

August 1, 2024 13:59


          Add rfcs directory

e8aa05b


          Removed deprecated as cause for archive

9e6b4ce


          Update rfcs/README.md

6b819a3

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/README.md

0bc4bba

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/README.md

7e0de37

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/template.md

6eddaa5

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/experimental/README.md

f8d7709

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/template.md

86c7031

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/template.md

60e7623

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/README.md

d70a09a

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/README.md

640ee5d

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

ac8e085

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

d329f70

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/template.md

06fd104

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/template.md

758959a

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/template.md

068c26e

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/template.md

4d67b5b

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/template.md

446927b

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Made wording changes in response to review.

e449916


          Update rfcs/README.md

ad8ee51

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

6f309e0

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

fd42065

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

344688e

Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Update rfcs/README.md

074a1e2

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

d407d0c

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Update rfcs/README.md

207f07d

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>


          Apply suggestions from code review

62d9188

Made suggested wording changes.

Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>
Co-authored-by: Aleksei Fedotov <aleksei.fedotov@intel.com>


          Fixed line lengths and made suggested changes after review

30c06ca


          Add try_put_and_wait RFC

8c29592

kboyarinov marked this pull request as draft

September 13, 2024 09:35

Base automatically changed from dev/vossmjp/rfcs to master

September 26, 2024 14:02

kboyarinov added 3 commits

January 6, 2025 04:27


          Merge remote-tracking branch 'origin/master' into dev/kboyarinov/try_…

23f859e

…put_and_wait_rfc


          Remove whitespace changes

9abea43


          Fix spelling, add information about multi-output nodes support

ecd4e83

kboyarinov marked this pull request as ready for review

January 6, 2025 15:42

kboyarinov requested review from aleksei-fedotov and vossmjp

January 6, 2025 15:42

vossmjp reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

vossmjp added the RFC label

kboyarinov added 3 commits

February 24, 2025 13:36


          Try newline in tables

4c5b788


          Rework the document

513786c


          Merge remote-tracking branch 'origin/master' into dev/kboyarinov/try_…

c566362

…put_and_wait_rfc

vossmjp reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated

+              Otherwise, if the concurrency limit of the node is reached, both message and the associated metainformations would be rejected and the predecessor that called the ``try_put_task``
+              is responsible on buffering both of them.
+              If the predecessor is not the buffering node, both message and the metainfo would be lost.

Contributor

vossmjp Mar 26, 2025 •

edited

Loading

You should say what "lost" means in this context. How does it affect waiting? Does a lost message/metainfo properly decrement the count?

Contributor Author

kboyarinov Apr 3, 2025

Updated the description

vossmjp reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated

+              * Multi-output nodes support should be described and implemented
+              * Feedback from the customers should be received
+              * More multithreaded tests should be implemented for the existing functionality
+              * The corresponding oneAPI specification update should be done

Contributor

vossmjp Mar 27, 2025

Should we add a clear function to all buffering nodes?

Contributor Author

kboyarinov Apr 3, 2025

Added to the list of the questions, together with the safety guarantees for this method.

akukanov reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_for_one/README.md Outdated Show resolved Hide resolved

kboyarinov and others added 5 commits

April 2, 2025 16:18


          Apply suggestions from code review

70ba1d5

Co-authored-by: Alexey Kukanov <alexey.kukanov@intel.com>
Co-authored-by: Mike Voss <michaelj.voss@intel.com>


          Address review comments

12fcd80


          Merge remote-tracking branch 'origin/master' into dev/kboyarinov/try_…

a8c58d5

…put_and_wait_rfc


          Add missed renamed files

b327fd6


          Remove extra file

31b1451

akukanov reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md

Comment on lines +332 to +334

+              Each input port of the join_node should support the queue for both values and the associated metainformations. Once all of the input ports contain the value, the values
+              should be combined into single tuple output and the metainformation objects should be combined into single metainfo using `metainfo1.merge(metainfo2)`, associated with the tuple
+              and submitted to successors.

Contributor

akukanov Apr 3, 2025

Since the merged metainformation is associated with a tuple of messages, could not we additionally store the indexes of the input ports? That would allow split nodes to be smarter and split the meta info as well. Or would it be too risky to assume that nodes between join and split do not reshuffle the tuple? Tagging @vossmjp as well.

Contributor Author

kboyarinov Apr 4, 2025

Yes, as far as I remember the risk of changing the tuple between join and split was the main reason why we did not make any splitting of the metainfo.

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md

+                  * Should ``clear()`` member function be added to the non-rejecting ``join_node`` to handle the case when when we don't have the input present on each input port.
+                  * Concurrent safety guarantees for ``clear()`` should be defined (e.g. is it safe to clear the buffer when other thread tries to insert the item).
+              * Feedback from the customers should be received
+              * More multithreaded tests should be implemented for the existing functionality

Contributor

akukanov Apr 3, 2025

An idea for an example/test: a version of dining philosophers where an external thread would use try_put_and_wait() for one of them (perhaps the fastest thinker :))

Contributor Author

kboyarinov Apr 4, 2025

I think it is also a good example that shows the necessity of some trait for buffering nodes that should allow ignoring the metainformation. If we consider the implementation from our examples but without using the multifunction_node, the chopstick is always present in the graph. If we want to use try_put_and_wait for one of the philosophers, at the point when both chopsticks are acquired, we have the tuple of signal and two chopsticks, associated with the metainfo from the signal.
When the chopstick is returned back to its buffer, it would contain the metainfo from the signal, and the chopstick would be considered part of computations needed by try_put_and_wait.

In this case, the buffer for chopsticks can be marked by the special trait to ignore the metainformation.
I think the same problem takes place in each graph that uses token-passing.

Alternatively, it can be done by the multifunction_node by explicitly passing the metainfo only to the "keep thinking" port but not to the "return chopstick" ports.

Contributor

akukanov Apr 4, 2025

The idea to keep a tuple index in the metainfo (#1513 (comment)) could help as well.

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

akukanov reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved


          Apply feedback, add more information

eafdede

akukanov reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Show resolved Hide resolved

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated Show resolved Hide resolved


          Address comments

99fb810

akukanov reviewed

View reviewed changes

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated

Comment on lines 175 to 177

+              * The item associated with the computations is taken from the buffering node or from the internal buffer.
+              * When the desired number of signals from the predecessors was received by the ``continue_node``. This case is equivalent to retrieving the set of buffered ``message_metainfo``s received previously
+                from each predecessor.

Contributor

akukanov Apr 7, 2025

For these cases, how it is ensured that the reference counters are not decreased prematurely?

Contributor Author

kboyarinov Apr 9, 2025

continue_node creates a copy of the stored metainfo merged from each of the predecessors and creates a task for executing the body that holds an additional reference on the counter (similarly to any task with the stored metainfo).
And only after spawning the task, the reference counters on the copy are decreased.

I have added a note that the reference counter is decreased after spawning a task.

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md Outdated


		It may buffer both ``t`` and ``metainfo`` or broadcast the result and the ``info`` to the successors of the node.

		The existing API ``try_put_task(const T& t)`` can reuse the new one with the empty metainfo object if an empty metainfo is a preserved as a lightweight structure.

Contributor

akukanov Apr 7, 2025

I would instead say "if that does not incur a noticeable overhead for the normal use of flow graphs."

Contributor Author

kboyarinov Apr 9, 2025

Applied

rfcs/experimental/flow_graph_wait_try_put_and_wait/README.md

+              The difference is that for lightweight nodes no tasks are created and spawned in most of the cases and the node body will be executed by the calling thread.
+              Since there are no tasks, the calling thread broadcasts the output and the metainformation to the successors after completing the function.
+              ### ``continue_node``

Contributor

akukanov Apr 7, 2025

I am afraid that the overhead of metainfo for dependency graphs can be big with a straightforward implementation.

Imagine a 2D wavefront graph started with try_put_and_wait. Each node that receives the metainfo will send two copies of it, one for each successor. Each node in the middle of the graph will receive two piles of metainfo from its predecessors, merge those (doubling the size of the pile) and send to two its successors, This is an exponential growth pattern.

And the benefit of single-message wait for continue nodes is very doubtful, as dependency graphs are not supposed to handle several independent message flows.

I would reconsider implementation and/or applicability of this feature for continue nodes.

Contributor Author

kboyarinov Apr 9, 2025

Added a separate section for this issue.

kboyarinov added 2 commits

April 9, 2025 13:38


          Add dependency graphs issue section

ecf1886


          add note about continue_node behavior

35bc88a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

RFC