Overall Satisfaction with PagerDuty
We use PagerDuty extensively at Magnetic. It is our prime incident management solution for all services and environments (Production, QA, Staging). The Engineering and Operations team have been using PagerDuty widely for the last couple of years. PagerDuty helps us ensure uptime while enabling us to maintain a good work-life balance.
- It has been particularly instrumental in setting escalation policies and on-call rotation schedules; thereby abstracting the noise from the entire team. By being able to set severity levels on incidents, we make sure team member don't get paged in the middle of the night for a non-critical issue. Recent feature additions like the ability to "Add Responders" to an incident help collaborate better as a team and contribute to business value of the company in a streamlined fashion. PagerDuty API and integrations have enabled us to use it in conjunction with monitoring tools like Zabbix and CloudWatch.
- One of the features I would like to see is the ability to snooze a bulk of incidents. As an example, if there are 15 alerts and we want to select all of them and snooze them through the UI, that should be a feature. Currently, we would have to do it incident by incident.
- PagerDuty has helped us reduce infrastructure costs by integrating with CloudWatch and alerting when we're under-utilizing our resources.