OpsGenie is an IT monitoring and incident response platform for development and operations teams, providing alerts and schedule management escalations. OpsGenie is now part of Atlassian since the late 2018 acquisition.
$0
up to 5 users
Splunk On-Call
Score 6.5 out of 10
N/A
Formerly known as VictorOps, Splunk On-Call is an incident response system for developers, devops and operations teams that helps reduce outage time.
Incident response is well suited to OpsGenie, and this is where it really shines—whether it's an outage, a security incident, or similar. My experience is mostly with security, and it offers a great audit trail. It minimises the need to cut and paste from different platforms when creating reports and ensures that what was said and what was done (along with any evidence) is persisted and reflected in the incident detail.
I recommend Splunk on-call is more suited where there are high incident queues; multiple teams need to be involved in handling a P1 severity issue. Multiple levels of escalation are needed environment where automated action is required. I recommend the solution for large-scale & medium-scale business units. For small-scale business units, I see the functional value is less.
OpsGenie New Jira design has made it difficult for those not familiar with that style.
OpsGenie could benefit from nested escalation flows for team schedules. Creating a product alert that uses and Tech Schedule as well as an Incident Manager Schedule that already exists would create less overhead and ease management.
In general terms OpsGenie is a well done tool for solving the alert incident management, the usability is super ok during the configuration and during the alert. The main opportunity I found is the reporting and analytics section which is a little difficult to understand at a first sight and the refresh is not automatic, some little frictions but frictions at all
VictorOps support has proven excellent for us. Because it is such a widely used tool, there is a lot of documentation on usage, and a large community of users to lean on. Also, many engineers have had experience working with VictorOps already, and the tool is so easy to setup / manage that much support isn't really necessary.
We also looked at PagerDuty but decided to go with OpsGenie as it had more features on the plan we needed compared to PagerDuty which would have required us to spend a lot more for what we felt were non-premium features. Everything felt like an add-on - automation for an additional $20 a user per month seemed like a lot on top of the base plan
Splunk On-Call integrates better with our Splunk Cybersecurity and Reporting products due to the same family tree of the same eco system. We were previously using built-in on-call from individual applications and while adequate, they were difficult to manage and support SLA varied greatly across different applications. In addition we also used xMatters which did not integrate well with SAP products nor Citrix products so we were still using more than a single on-call product which was solved by implementing Splunk On-Call
Helped us track bugs and issues that came up during product launch periods which reduced overhead that normally came with needing to manually contact the right team members
Prevented last minute breaking issues from falling through the cracks, decreased time to fix by automatically alerting the team members and allowing the product and project teams to easily see what active alerts are in progress