-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IPMI Monitoring Data Interruption Issue #3197
Comments
2025-03-30 17:16:19.654 [metrics-task-timeout-monitor-0] ERROR org.apache.hertzbeat.collector.dispatch.CommonDispatcher Line:168 - [Collect Timeout]: 2025-03-30 17:20:39.656 [metrics-task-timeout-monitor-0] ERROR org.apache.hertzbeat.collector.dispatch.CommonDispatcher Line:168 - [Collect Timeout]: 2025-03-30 17:24:59.658 [metrics-task-timeout-monitor-0] ERROR org.apache.hertzbeat.collector.dispatch.CommonDispatcher Line:168 - [Collect Timeout]: 2025-03-30 17:29:19.660 [metrics-task-timeout-monitor-0] ERROR org.apache.hertzbeat.collector.dispatch.CommonDispatcher Line:168 - [Collect Timeout]: 2025-03-30 17:33:39.663 [metrics-task-timeout-monitor-0] ERROR org.apache.hertzbeat.collector.dispatch.CommonDispatcher Line:168 - [Collect Timeout]: |
hi @sdlwdong is there more log information? The current doesn't seem to tell what the problem is. hi @gjjjj0101 please help take a look if have time, thanks. |
@gjjjj0101 Hello, which service's log do you need to see? Please guide me. Thank you ! 您好,需要看哪个服务的日志?请指导一下谢谢。 |
I have located the problem now. When there is a problem with the communication network between the collector and the machine, the datagramChannel.receive() of nio used in the collector will not throw a network timeout exception, causing the manager's collection to time out. Therefore, the status is still up and the collection time is the earliest correct collection time. |
So this is a bug, I am still designing how to solve it, if you have good suggestions please share with me. |
The solutions can be:
2.IPMI Tool Validation:
3.Hertzbeat Configuration Adjustments: Increase timeout settings in hertzbeat.yml:
|
Since no exception like a timeout is thrown (as it’s using UDP), how about manually setting a specific timeout? If there’s no response within a certain period, we could treat it as a failed request. |
Question
Title:IPMI Monitoring Data Interruption Issue
Description:
After successfully configuring and establishing a normal monitoring connection for the physical machine via IPMI, we encountered an issue where data collection is interrupted after a period of time.
This suggests there may be an underlying issue with the IPMI integration or Hertzbeat's ability to maintain the connection over time.
please help me! thanks.
标题:IPMI监控数据中断问题
描述:
通过IPMI成功配置并建立物理机的正常监控连接后,我们发现数据采集在一段时间后中断。
请帮忙解决,谢谢。

The text was updated successfully, but these errors were encountered: