Severity Matrix

Example Severity Matrix in FireHydrant

Example Severity Matrix in FireHydrant

Have you ever been uncertain about the criticality of an incident? FireHydrant's severity matrix allows you to specify what severity levels should be automatically assigned to different combinations of fault states and impacted infrastructure. Let's take a look at how this is implemented.

πŸ“˜

Note:

Auto-assigning severity only occurs when an incident is started with components marked impacted. It does not automatically change the severity if a catalog item is added mid-incident.

Prerequisites

Configure Severity Matrix

Adding a new row to the Severities matrix table

Adding a new row to the Severities matrix table

  1. From the FireHydrant web UI, go to Settings > Severities matrix.
  2. Click on the tab for the type of component you'd like to configure automatic severities for: Services, Functionalities, or Environments.
  3. Click "+ Add impact" at the bottom left of the table to add rows to the matrix. A modal will pop up where you can select the appropriate Catalog component to conditionally assign severities and impacts for. Click "Save".
  4. Once the row is inserted into the table, you can configure different severities according to the condition(s) applied to the component. Click the dropdown and select the default severity for each condition applied to the given catalog component. For example, in the following example matrix:

  • If the api-server is marked as Unavailable when starting an incident, FireHydrant will automatically assign a SEV1 to the incident.
  • Conversely, if the app-admin-web is marked as Unavailable when starting an incident, FireHydrant will assign a SEV2 to the incident instead.

πŸ“˜

Note:

If multiple components from the matrix are marked impacted at incident start, the higher severity takes precedence.

  1. To save your severity matrix changes, click "Submit" on the bottom right of the table.

Now when anyone in your organization kicks off an incident you can be assured that it is being set to the appropriate severity based on business needs.

Next Steps

There are other powerful ways to automate things in FireHydrant via Runbooks. For example:

  • Take a look at the Add Incident Impacts Runbook step to automatically mark certain components as impacted.
    • This allows you to use the numerous other conditions available to Runbooks. For example, "if component A is down, then automatically mark component B down too," etc.
  • You can also look at the Update Incident Details Runbook step.
    • This step allows you to change the incident's details like Severity, Priority, etc. automatically. With the availability of numerous conditions, you can configure flexible automation like, "If incident has X service impacted and a tag cache-issue is added to the incident, then change the severity of the incident to SEV2.
  • You can potentially skip the Severity Matrix entirely if you want to predefine the types of incidents your organization has.
    • This simplifies the declaration process so users simply need to select a type, and everything else - from Runbooks, to severity assignment, team assignment, task lists, and more - are already pre-configured.