This alert presents the number of seconds that the replica is behind the master. Receiving this means that the replication SQL thread is far behind processing the source binary log. A constantly high value (or an increasing one) indicates that the replica is unable to handle events from the source in a timely fashion.
This alert is raised into warning when the metric exceeds 10 seconds. If the number of seconds that the replica is behind the master exceeds 30 seconds then the alert is raised into critical.
To minimize slave SQL_THREAD
lag, focus on query optimization. The following logs will help you identify the problem:
full
.You can also read the Percona blog for a nice write-up aboutMySQL replication slave lag.