Prerequisites
Prerequisites
- At least one monitor running — see First HTTP monitor
- Familiarity with the Dashboard or CLI
When an incident appears
An incident appears in your Dashboard when a monitor’s check failures match its incident policy. Here’s the typical flow:Checks start failing
A monitor’s check returns an error or fails an assertion. DevHelm starts watching the situation.
Trigger rule matched
After enough consecutive failures (default: 2), the trigger rule fires. The incident enters
TRIGGERED status.Multi-region confirmation
If the monitor runs from multiple regions, DevHelm waits for confirmation from additional regions. This reduces false positives from single-region network issues.
Incident confirmed
The incident moves to
CONFIRMED. Alerts fire through your notification policies.Investigate the incident
View incident details
- Severity —
DOWN,DEGRADED, orMAINTENANCE - Affected regions — which probe regions are seeing failures
- Timeline — every status change and update since detection
- Trigger rule — which rule fired and why
- Duration — how long the incident has been active
Check the timeline
The incident timeline shows exactly what happened and when:updates array — each entry records a status change, user update, or system event with a timestamp.
View failing check results
Drill into the monitor’s recent check results to understand what’s failing:Resolve the incident
Automatic resolution
Most incidents resolve automatically. When the monitor starts passing again:- Consecutive passing checks accumulate (default: 2 required)
- Enough regions must be healthy (default: 2)
- The incident moves to
RESOLVED - A cooldown period prevents immediate reopening (default: 5 minutes)
Manual resolution
If you’ve fixed the issue and don’t want to wait for automatic recovery:Add context
Post updates during investigation to keep your team informed:Key concepts to know
| Concept | What to know |
|---|---|
| Incident policy | Controls when incidents open (trigger rules) and close (recovery policy) |
| Confirmation | Multi-region validation that reduces false positives |
| Cooldown | Quiet period after resolution that prevents flapping |
| Reopening | If the monitor fails again after cooldown, the same incident reopens |
Next steps
Incidents overview
Full lifecycle, statuses, severities, and sources.
Incident policies
Customize trigger rules and recovery behavior.
First alert
Get notified when incidents happen.
Incidents guide
Day-to-day incident management workflows.