-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nighthawk doesn't correctly drain the connection pool #873
Comments
We used to call That was a while ago, and I have no recollection of any reason why that call had to be removed. Therefore, I think that it may have been dropped by accident :-( |
Thank you for adding the details @oschaaf. This gives us reassurance that adding it back it should do no harm and should be an easy fix. |
Did a quick check into this and this might require an upstream envoy change. https://github.com/envoyproxy/envoy/pull/16544/files obfuscated the connection pool behind a new object called PoolData. https://github.com/envoyproxy/envoy/pull/16865/files is an example change made previously by @mum4k that re-exposed some methods. A similar change is probably required to allow us to drain connections. |
#22554) Exposing the drainConnections method on the connection pool, so that Nighthawk can correctly drain the connection pools used by its individual workers. Fixes #22527. Works on envoyproxy/nighthawk#873. /cc @alyssawilk Risk Level: low, existing functionality isn't affected. Testing: unit tests. Docs Changes: n/a. Release Notes: n/a. Platform Specific Features: n/a. Signed-off-by: Jakub Sobon <mumak@google.com>
…. (#22554) Exposing the drainConnections method on the connection pool, so that Nighthawk can correctly drain the connection pools used by its individual workers. Fixes #22527. Works on envoyproxy/nighthawk#873. /cc @alyssawilk Risk Level: low, existing functionality isn't affected. Testing: unit tests. Docs Changes: n/a. Release Notes: n/a. Platform Specific Features: n/a. Signed-off-by: Jakub Sobon <mumak@google.com>
The pool termination code is here.
While the code does register an idle callback, that callback only gets called if the pool is draining. Nighthawk doesn't seem to trigger pool drain anywhere. We should call
drainConnections(DrainAndDelete)
while waiting for the pool to drain.The current state can cause Nighthawk to crash, because the shutdown thread owns the client dispatcher and may end up reading packets while shutting down resulting in a call to pure virtual method.
This can be reproduced by running high QPS test using HTTP3/QUIC, inducing a situation where Nighthawk has to deal with some late responses after it started shutting down.
The text was updated successfully, but these errors were encountered: