Alert and SDT Examples
Queries relating to active alerts, scheduled downtime (SDT), and alert history.
See also:
- devices.md for device-specific queries
- collectors.md for collector-related SDTs
- Find long SDTs
- Find the oldest active critical alert
- Find time-related alerts (NTP, clock skew)
- Find the oldest WMI alerts for Windows devices
- meta
Find long SDTs
This will find SDTs that don’t end for at least one year from the current time:
elm SDTList -F endDateTime\>$(( ( $(date +'%s') + 31536000 ) * 1000 )) -f id,deviceGroupFullPath,deviceDisplayName,endDateTimeOnLocal,duration,admin,comment -S endDateTime -s0
Find the oldest active critical alert
Alert severity: 2=Warning, 3=Error, 4=Critical.
elm AlertList -s1 -S startEpoch -F severity:4,cleared:false \
-f id,severity,startEpoch,resourceTemplateName,instanceName,resourceId,resourceName
Find time-related alerts (NTP, clock skew)
Search for active alerts whose datasource or datapoint name mentions NTP or time:
elm AlertList -s0 -F cleared:false \
-f id,severity,startEpoch,resourceTemplateName,dataPointName,resourceName,alertMessage | \
jq '.AlertList[] | select(
(.resourceTemplateName | ascii_downcase | test("ntp|time")) or
(.dataPointName | ascii_downcase | test("ntp|time|offset|skew"))
)'
Find the oldest WMI alerts for Windows devices
elm AlertList -s0 -S startEpoch -F cleared:false \
-f id,severity,startEpoch,resourceTemplateName,instanceName,resourceName | \
jq '.AlertList[] | select(.resourceTemplateName | ascii_downcase | test("wmi"))' | \
jq -s 'sort_by(.startEpoch) | .[0:5]'
meta
Update the ToC on this page by running the following:
gh-md-toc --insert --no-backup --hide-footer --skip-header examples/alerts.md