The portcheck_connection_timeouts
alert calculates the average ratio of connection timeouts when trying to connect to a TCP endpoint over the last 5 minutes. If you receive this alert, it means that the monitored TCP endpoint is unreachable, potentially due to networking issues or an overloaded host/service.
This alert triggers a warning state when the ratio of timeouts is between 10-40% and a critical state if the ratio is greater than 40%.
Check the network connectivity
Use the ping
command to check network connectivity between your system and the monitored TCP endpoint.
ping <tcp_endpoint_ip>
If the connectivity is intermittent or not established, it indicates network issues. Reach out to your network administrator for assistance.
Check the status of the monitored TCP service
Use the netstat
command to check the service status:
netstat -tnlp | grep <port_number>
If the service is not running or unresponsive, restart the service or investigate further into the application logs for any issues.
Verify the load on the TCP endpoint host
top
, vmstat
, iostat
, and iftop
.Examine the firewall rules and security groups
Check the Netdata configuration
/etc/netdata/netdata.conf
to ensure the portcheck
plugin settings are correctly configured for monitoring the TCP endpoint.