Datadog is a monitoring service for IT, Dev and Ops teams who write and run applications at scale, and want to turn the massive amounts of data produced by their apps, tools and services into actionable insight.
$18
per month per host
Grafana OnCall
Score 9.0 out of 10
N/A
Grafana OnCall is an open source, easy-to-use on-call management tool built to help teams improve their collaboration and resolve incidents faster. Grafana OnCall (formerly known as Amixr) was started in 2018 at Amixr Inc., which Grafana Labs acquired in 2021.
$0
Pricing
Datadog
Grafana OnCall
Editions & Modules
Log Management
$1.27
per month (billed annually) per host
Infrastructure
$15.00
per month (billed annually) per host
Standard
$18
per month per host
Enterprise
$27
per month per host
DevSecOps Pro
$27
per month per host
APM
$31.00
per month (billed annually) per host
DevSecOps Enterprise
$41
per month per host
No answers on this topic
Offerings
Pricing Offerings
Datadog
Grafana OnCall
Free Trial
Yes
No
Free/Freemium Version
Yes
Yes
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
Optional
No setup fee
Additional Details
Discount available for annual pricing. Multi-Year/Volume discounts available (500+ hosts/mo).
Datadog can be pricey for larger scale businesses, so it really depends on your use case. For us, we have a small single deployment application and a small developer team, so our costs are mostly reasonable. There are more features than we can explore which can be somewhat overwhelming. It is mostly easy and intuitive to use but for larger scale you may consider rolling your own solutions.
Setting up the Grafana OnCall configuration is from the UI is a little bit complex, you need to create an integration, from there an escalation chain ann after that you need to change the template for the notificaion. Also the documentation is not entirely clear, particularly the section on provisioning with Terraform, because production stack is provisioned with infra structure as code.
The thing which Datadog does really well, one of them are its broad range of services integrations and features which makes it one step observability solution for all. We can monitor all types of our application, infrastructure, hosts, databases etc with Datadog.
Its custom dashboard feature which helps us to visualize the data in a better way . It supports different types of charts through those charts we can create our dashboard more attractive.
Its AI powered alerting capability though that we can easily identify the root cause and also it has a low noise alerting capability which means it correlated the similar type of issues.
Alert windows cause lag in notifications (e.g. if the alert window is X errors in 1 hour, we won't get alerted until the end of the 1 hour range)
I would appreciate more supportive examples for how to filter and view metrics in the explorer
I would like a more clear interface for metrics that are missing in a time frame, rather than only showing tags/etc. for metrics that were collected within the currently viewed time frame
There is some room for improvement, but the Datadog team sends out updates frequently, and the UI is user-friendly for engineers, with no significant loading issues or region-specific problems. That was one of the key reasons we preferred Datadog; our company has employees worldwide, and it wasn't difficult to transition to the tool.
setting up the Grafana OnCall functionality is complex but if you setup this and you know how to setup then the rest is easy. IAC is provided with terraform The UI for the alert groups shows the alerts in one view and from there you can go to the relevant alert trigger to solve the problems.
The support team usually gets it right. We did have a rather complicate issue setting up monitoring on a domain controller. However, they are usually responsive and helpful over chat. The downside would be I don’t think they have any phone support. If that is important to you this might not be a good fit.
It's a one-stop solution for all our needs whereas in other open-source tools, we have an operational overhead to keep and manage the uptime of these tools as well and also manage their versioning, upgrade, and patching cycle. Also if there are any bugs then we have to raise an open source issue and many problems as we have to keep 2 to 3 people aligned to manage the stack.
Unlike Grafana OnCall tool Splunk On-Call solution looks more complicated and noticeably inferior in the visualization of the information presented (UI is not user friendly). Alerts re-route process looks totally illogical and you need some time to get with it. However Splunk On-Call looks more featurable out-of-box and has iOS and Android apps while Grafana OnCall provides a general dashboard apps only.