devz-docz

Aggregation of onboarding and general devz standards that I have gatherd over my career.

View on GitHub

InfraSec / Alert Providers

In general, I will be using a third party provider to take alerts from the systems which are doing the monitoring – whether that’s AWS Cloudwatch, something like Sensu or Nagios, or a third party monitoring service like Pingdom – to the responders who can actual take care of the problem.

Context

Choosing which provider to use for a project is an important decision; not only do you want to know how easy it is to tie in parts of the infrastructure and services you’re building into the alerting system, but you are depending on the reliability of this service to ensure your own reliability. If the alerting system doesn’t notify you of problems, you won’t be able to address them in a timely manner.

Requirements

When making these decisions, here are some things you’ll want to take into account:

Other Considerations

Some other questions that I don’t have good answers for yet:

Comparisons

This is a comparison of three major alerting providers: OpsGenie (owned by Atlassian), PagerDuty, and VictorOps (owned by Splunk). These three provide the basic services listed above, but vary in maturity and my experience with them.

These comparisons were made with the OpGenie “Standard” ($19/user/month), PagerDuty “Platform Business” ($39/user/month), and VictorOps “Growth” ($29/user/month) tiers, as they are the ones that capture the bulk of the features I’re looking for. Where capabilities vary more based on a lower or higher tier, notes have been made in the table with the price of the option per user per month.

Please feel free to add or update this table with other information you find helpful in making a decision (including adding other vendors if it’s merited).

Feature/Objective OpsGenie Standard ($19) PagerDuty Platform Business ($39) VictorOps Growth ($29)
SLA 99.9% of alerts within 5m 99.9% of alerts within 5m (Couldn’t find this on their website?)
Notification Amounts Unlimited (users x 100 domestically at $9) Unlimited Globally (Domestically at $29) Unlimited
Notification Types Email/Push/SMS/Voice Email/Push/SMS/Voice Email/Push/SMS/Voice
SSO Yes Yes Available at $49
Team Functionality Yes Yes Yes
Scheduling Yes Yes Yes
Escalation Policies Yes Yes Yes
API Yes (Documentation) Yes (Documentation) Yes (Documentation)
Cloudwatch Integration Yes Yes Yes
Slack Integration Yes Yes Yes
Other Notable Integrations CircleCI, Jira, New Relic, StatusPage Jira, New Relic, Statuspage Jira, New Relic, Statuspage
Data Retention 1 year (unlimited at $29) Unlimited Not listed (but unlimited noted at $49)
Terraform Provider No (abandoned) Yes No official provider, some limited attempts