Node: Channel writes without blocking #4276
Draft
+137
−115
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
The purpose of this PR is to address instances of writes to go channels that may block unexpectedly. I went through the guardian code, identified unbuffered channel writes, and made a determination if blocking is appropriate or not.
Conclusion
This review of the guardian code did not find any instances where a channel write would block the critical path. It did detect lesser cases that are addressed.
Implementation
In the cases where blocking is appropriate, I tagged the line with
//nolint:channelcheck // <reason_why_blocking_is_appropriate>
. That way when the linter is updated with this plugin, these lines will be correctly ignored..In some of the cases where blocking is not appropriate, I changed it to use the
common.WriteToChannelWithoutBlocking
function instead. This does the write with a select and pegs a metric if the write would block.In some cases (the processor, for example), a write that would block is handled in a way more consistent with the existing code.
Change in Behavior
One significant change in behavior introduced by this PR is that previously, the channel used by all the watchers to post observations to the processor (
msgC
) was unbuffered, meaning that if more than one watcher posted an event at roughly the same time, it could block. This doesn't seem like desirable behavior now that we have so many watchers, so I changedmsgC
to be buffered. This change is open for discussion!