Datadog is a monitoring service for IT, Dev and Ops teams who write and run applications at scale, and want to turn the massive amounts of data produced by their apps, tools and services into actionable insight.
$18
per month per host
OpsGenie
Score 7.7 out of 10
N/A
OpsGenie is an IT monitoring and incident response platform for development and operations teams, providing alerts and schedule management escalations. OpsGenie is now part of Atlassian since the late 2018 acquisition.
$0
up to 5 users
Splunk Enterprise
Score 8.6 out of 10
N/A
Splunk is software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface. It captures, indexes and correlates real-time data in a searchable repository from which it can generate graphs, reports, alerts, dashboards and visualizations.
N/A
Pricing
Datadog
OpsGenie
Splunk Enterprise
Editions & Modules
Log Management
$1.27
per month (billed annually) per host
Infrastructure
$15.00
per month (billed annually) per host
Standard
$18
per month per host
Enterprise
$27
per month per host
DevSecOps Pro
$27
per month per host
APM
$31.00
per month (billed annually) per host
DevSecOps Enterprise
$41
per month per host
Free
$0.00
up to 5 users
Essentials
$9.00
per user/per month
Standard
$19.00
per user/per month
Enterprise
$29.00
per user/per month
No answers on this topic
Offerings
Pricing Offerings
Datadog
OpsGenie
Splunk Enterprise
Free Trial
Yes
Yes
Yes
Free/Freemium Version
Yes
Yes
Yes
Premium Consulting/Integration Services
No
No
No
Entry-level Setup Fee
Optional
No setup fee
No setup fee
Additional Details
Discount available for annual pricing. Multi-Year/Volume discounts available (500+ hosts/mo).
Datadog has been harder to setup out-of-the-box compared to its alternatives, although it's graphs and dashboards have been more useful. Other tools handle individual tasks better. For example, Splunk has been the best logging tool I've used, and New Relic is great for CPU and …
It has been easier to work with Datadog for all our business needs and get things on their roadmap if we found it lacking. Currently we use a mix of various tools as they were existing prior to Datadog came. We are evaluating new offering like Datadog's latest log management to …
I am listing how Datadog is better than below chosen NotSensu - Datadog has more integrations and easy to use UI. Prometheus - Datadog Integration are more in number than, simple installation process
We are still trying other products, but people still like Datadog. After setting up a dashboard, it's great for monitoring instances on Datadog. Also, the DevOps team had a good time setting up Datadog. It means Datadog was way easier to set up compared to those others.
We also looked at PagerDuty but decided to go with OpsGenie as it had more features on the plan we needed compared to PagerDuty which would have required us to spend a lot more for what we felt were non-premium features. Everything felt like an add-on - automation for an …
I didn't participate in the tool selection, and I don't know other similar tools. Although I think OpsGenie suits well in all the scenarios we have, so I would choose this tools again.
Splunk was better in terms of analyzing unstructured data. Also Splunk has had a very good and strong community and is also has a more tried and tested performance. I personally found the dash boarding capability of Splunk better than Datadog.
Splunk is the undisputed leader in Security Information and Event Management. Splunk Enterprise Security allows for massive, complex correlation of data across disparate sources. It can be deployed entirely on-premise or in a private cloud, giving total "data sovereignty." Datad…
Splunk Enterprise has been a well-established solution for many clients for a long time and is a major player in the market. Its track record and performance make it the best choice. Furthermore, its ease of use for developing, using, and managing data makes it the best among …
Splunk Enterprise is a very seasoned software , while other comparable software keep on adding new features and keep evolving, Splunk Enterprise has reached a state where new user onboarded doesnt have to request any basic feature or develop modules to simple tasks. Log parsing …
We are using this because it has lots of advantage over others. And it seems to be a good fit for us. Splunk provides lot more features than others and its UI is user-friendly, so for a new developer, it would not be too difficult to use it and do something around it.
Datadog may be better suited for teams that have a more out-of-the-box infrastructure, on the primary platforms Datadog supports. You may also have better results if you have a bigger team dedicated to devops and/or a bigger budget. We found that trying to adapt it to our use case (small team, .NET on AWS Fargate) wasn't feasible. We continually ran into roadblocks that required us to dig through documentation (and at times, having to figure out some documentation was wrong), go back and forth with support, and in my opinion, waste money on excessive and unintended usages due to opaque pricing models and inaccurate usage reports, as well as broken/non-functional rate sampling controls.
Incident response is well suited to OpsGenie, and this is where it really shines—whether it's an outage, a security incident, or similar. My experience is mostly with security, and it offers a great audit trail. It minimises the need to cut and paste from different platforms when creating reports and ensures that what was said and what was done (along with any evidence) is persisted and reflected in the incident detail.
It's well suited for what I do, which is network security operations. And that's for anything from troubleshooting incidents, troubleshooting performance, troubleshooting for the purpose of a compliance and auditing. It's not best suited for users who are new in terms of they're new to the product and they have expectations that probably Splunk cannot meet.
The thing which Datadog does really well, one of them are its broad range of services integrations and features which makes it one step observability solution for all. We can monitor all types of our application, infrastructure, hosts, databases etc with Datadog.
Its custom dashboard feature which helps us to visualize the data in a better way . It supports different types of charts through those charts we can create our dashboard more attractive.
Its AI powered alerting capability though that we can easily identify the root cause and also it has a low noise alerting capability which means it correlated the similar type of issues.
Alert windows cause lag in notifications (e.g. if the alert window is X errors in 1 hour, we won't get alerted until the end of the 1 hour range)
I would appreciate more supportive examples for how to filter and view metrics in the explorer
I would like a more clear interface for metrics that are missing in a time frame, rather than only showing tags/etc. for metrics that were collected within the currently viewed time frame
OpsGenie New Jira design has made it difficult for those not familiar with that style.
OpsGenie could benefit from nested escalation flows for team schedules. Creating a product alert that uses and Tech Schedule as well as an Incident Manager Schedule that already exists would create less overhead and ease management.
We are using Splunk extensively in our projects and we have recently upgraded to Splunk version 6.0 which is quite efficient and giving expected results. We keep track of updates and new features Splunk introduces periodically and try to introduce those features in our day to day activities for improvement in our reporting system and other tasks.
There are so many features that it can be hard to figure out where you need to go for your own use case. For example, RUM monitoring us buried in a "Digital Experience" sidebar setting when this is one of our key use cases that I sometimes struggle to find in the application. It appears that ECS + Fargate monitoring was recently released which is great because we had to build a lambda reporting solution for ephemeral task monitoring. But this new feature was never on my radar until I starting clicking around the application.
In general terms OpsGenie is a well done tool for solving the alert incident management, the usability is super ok during the configuration and during the alert. The main opportunity I found is the reporting and analytics section which is a little difficult to understand at a first sight and the refresh is not automatic, some little frictions but frictions at all
You can literally throw in a single word into Splunk and it will pull back all instances of that word across all of your logs for the time span you select (provided you have permission to see that data). We have several users who have taken a few of the free courses from Splunk that are able to pull data out of it everyday with little help at all.
The support team usually gets it right. We did have a rather complicate issue setting up monitoring on a domain controller. However, they are usually responsive and helpful over chat. The downside would be I don’t think they have any phone support. If that is important to you this might not be a good fit.
Splunk maintains a well resourced support system that has been consistent since we purchased the product. They help out in a timely manner and provide expert level information as needed. We typically open cases online and communicate when possible via e-mail and are able to resolve most issues with that method.
The online course was simple clear and described the main capabilities of the solution. There is also an initial module that can be done for free so anyone can familiarize themselves with the functionality of this solution. On the other hand, however, there could be more free online courses. Maybe even with a certificate, this would broaden the group of people who are familiar with the platform while increasing familiarity with the solution itself.
Our logs are very important, and Datadog manages them exceptionally well. We frequently use Datadog services for our investigations. Use case: Monitor your apps, infrastructure, APIs, and user experience.
Key features:
Logs, metrics, and APM (Application Performance Monitoring)
Real-time alerting and dashboards
Supports Kubernetes, AWS, GCP, and other integrations
RUM (Real User Monitoring) and Synthetics
✅ Best for backend, server, and distributed systems monitoring.
We also looked at PagerDuty but decided to go with OpsGenie as it had more features on the plan we needed compared to PagerDuty which would have required us to spend a lot more for what we felt were non-premium features. Everything felt like an add-on - automation for an additional $20 a user per month seemed like a lot on top of the base plan
I didn't get to fully evaluate Logstash as our corporation was already using Logstash, but both seemed like viable solutions to the problem that we were having. I wanted to evaluate Logstash some more, both did seem like they would work for the business needs that we had, we went with splunk as many teams were already using it.
Helped us track bugs and issues that came up during product launch periods which reduced overhead that normally came with needing to manually contact the right team members
Prevented last minute breaking issues from falling through the cracks, decreased time to fix by automatically alerting the team members and allowing the product and project teams to easily see what active alerts are in progress
I don't have any numbers to share but Splunk has positively served as a 24/7 monitoring tool that has saved hours of work by self-detecting, saving statistics and alerting problems in the system or from external interfaces as soon as they happen.
Splunk dashboards does a solid job in collecting, analyzing data and creating reports that contain an entire day's activity and then automatically sent out to the business.
Splunk is very easy to learn and very useful to any program or business application.