Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

statsd metric overflow #2098

Closed
tkrafael opened this issue Sep 5, 2023 · 1 comment
Closed

statsd metric overflow #2098

tkrafael opened this issue Sep 5, 2023 · 1 comment
Labels

Comments

@tkrafael
Copy link

tkrafael commented Sep 5, 2023

Hi,
We're troubleshooting an app in production.
Our app does:

  1. Read from S3 (using sqs for event)
  2. Do some processing on input and generate a new root
  3. Send output to SQS (batch mode, 64 concurrent requests)
    For metrics, we're using telegraf's statsd with all default configuration. Telegraf is running as side car in same ecs task

From time to time we get an error level=warning msg="[STATSD] 814 packets lost (overflow)" @service=benthos
After digging the problem further, we found that nearly 50% of metrics were lost. We're handing 1M messages, the target system receives 1M requests but benthos only shows 424k requests (this number varies).

Can someone provide some advice on it?

@Jeffail
Copy link
Collaborator

Jeffail commented Sep 6, 2023

Hey @tkrafael, you might need to tune the flush period so that it clears its memory more frequently as I'd imagine the client simply cannot keep up with the volume of traffic, and rather than block the overal flow of traffic is leans on the side of dropping observability data. It's happening within the statsd client library we're using so I can't say for absolute sure what's happening with investigating it deeper. Converting to discussion as per #2026

@redpanda-data redpanda-data locked and limited conversation to collaborators Sep 6, 2023
@Jeffail Jeffail converted this issue into discussion #2100 Sep 6, 2023

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
Projects
None yet
Development

No branches or pull requests

2 participants