Configure DevHelm incident trigger rules, confirmation windows, and recovery behavior
Every monitor has an incident policy that controls when incidents open, how they’re confirmed, and when they auto-resolve. A policy has three components: trigger rules, a confirmation policy, and a recovery policy.
Confirmation prevents false positives by requiring failures from multiple probe regions before promoting an incident to CONFIRMED status.
Field
Type
Description
type
string
Confirmation strategy — currently multi_region
minRegionsFailing
integer
Minimum regions that must be failing to confirm
maxWaitSeconds
integer
Maximum seconds to wait for enough regions to report failures
When a trigger rule fires in one region, the confirmation policy waits up to maxWaitSeconds for at least minRegionsFailing regions to also report failures. If enough regions confirm within the window, the incident moves to CONFIRMED and alerts fire. If the window expires without enough regions failing, the incident is discarded.
Set minRegionsFailing to 1 to confirm on the first region that reports a failure. This is useful for monitors running from a single region.
Recovery controls when a confirmed incident auto-resolves.
Field
Type
Description
consecutiveSuccesses
integer
Number of consecutive passing checks required before resolving
minRegionsPassing
integer
Minimum regions that must be healthy before recovery completes
cooldownMinutes
integer
Minutes after resolution before a new incident can open (0–60)
The recovery policy ensures stability before closing an incident. After the required consecutive successes are observed across enough regions, the incident moves to RESOLVED. The cooldown period then suppresses new incidents for the same monitor, preventing flapping.