Datadog is a monitoring service for IT, Dev and Ops teams who write and run applications at scale, and want to turn the massive amounts of data produced by their apps, tools and services into actionable insight.
$18
per month per host
Splunk Observability Cloud
Score 7.8 out of 10
N/A
Splunk Observability Cloud aims to enable operational agility and better customer experience through real-time AI-driven streaming analytics allowing accurate alerts in seconds. It is designed to shorten MTTD and MTTR by providing real-time visibility into cloud infrastructure and services.
$180
per year per host
Pricing
Datadog
Splunk Observability Cloud
Editions & Modules
Log Management
$1.27
per month (billed annually) per host
Infrastructure
$15.00
per month (billed annually) per host
Standard
$18
per month per host
Enterprise
$27
per month per host
DevSecOps Pro
$27
per month per host
APM
$31.00
per month (billed annually) per host
DevSecOps Enterprise
$41
per month per host
Infrastructure
$15
per month (billed annually) per host
App & Infra
$60
per month (billed annually) per host
End-to-End
$75
per month (billed annually) per host
Offerings
Pricing Offerings
Datadog
Splunk Observability Cloud
Free Trial
Yes
Yes
Free/Freemium Version
Yes
No
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
Optional
No setup fee
Additional Details
Discount available for annual pricing. Multi-Year/Volume discounts available (500+ hosts/mo).
Splunk is better for Multicloud and UI is very good as compared to other Solutions. Also, time saving in case of Developer Productivity as Detectors can be saved as code.
SignalFX is a strong competitor in the monitoring SaaS space and provide the basic necessities for production grade monitoring and alerting. Other solutions may offer easier adoption and other helpful features, but will have trouble competing for cost for organizations that …
Where Datadog is good: - Real-time Visibility During Incidents: During high-severity incidents, Datadog dashboards, coupled with real-time logging and APM traces, provide immediate insight into system health and enable fast triage. For example, we’ve used trace ID correlation between logs and APM to quickly identify downstream service failures due to network degradation during a major outage. - Service Ownership at Scale: With over 50 engineering teams, providing self-service monitoring is essential. We use Datadog monitors, SLO dashboards, and templates so teams can track their own service health without reinventing the wheel. Tagging and RBAC features help us scope data access appropriately. Where Datadog can improve: While Datadog’s logging capabilities are powerful, storing all application logs in Datadog can become cost-prohibitive at high volumes.
The query language is relatively easy and flexible when looking into an application's problems. These queries can then be used for alerts, reports, and dashboards. I believe Splunk is a platform that can help a system grow into its proactive application management, using incidents to add insights as needed without trying to work out every scenario in advance.
Alert windows cause lag in notifications (e.g. if the alert window is X errors in 1 hour, we won't get alerted until the end of the 1 hour range)
I would appreciate more supportive examples for how to filter and view metrics in the explorer
I would like a more clear interface for metrics that are missing in a time frame, rather than only showing tags/etc. for metrics that were collected within the currently viewed time frame
Good: Stable system with low error rate Easy to use for simple use cases Bad: UI is not very clear for complex usage Mobile view (when logged in from phone) is bad No library for .net
Datadog's user interface is quite friendly and easy to navigate. With menus clearly categorized, and ability to bookmark important dashboards, one can easily find what they're looking for. For dashboards, ability to move and resize visualizations and group them, is really helpful to organize dashboards. Automatic suggestions from Datadog for important visualizations based on the metrics and logs would provide another level of ease of use.
When there is an issue, it’s a win if one can easily identify the root cause. To do the same, it should allow the user to dig deep with multiple data points and compare the data and identify the anomaly. In this use case, it’s good to drive from Splunk 011y.
The support team usually gets it right. We did have a rather complicate issue setting up monitoring on a domain controller. However, they are usually responsive and helpful over chat. The downside would be I don’t think they have any phone support. If that is important to you this might not be a good fit.
We are still trying other products, but people still like Datadog. After setting up a dashboard, it's great for monitoring instances on Datadog. Also, the DevOps team had a good time setting up Datadog. It means Datadog was way easier to set up compared to those others.
Splunk Infrastructure Monitoring provides far superior options for anybody using a complex hybrid multi-cloud environment and allows both your SOC and NOC to work together on the same data while driving their own insights. We found other products are still in the old world view of servers and agents residing together within a single data centre, but modern apps are no longer like this.