Likelihood to Recommend I would recommend IBM Instana based on its strengths in automating application performance monitoring, dynamic tracing, and its ability to handle modern, containerized environments effectively. The automatic discovery and mapping feature, along with AI-powered anomaly detection, provide valuable insights for proactive issue resolution. However, I would consider factors such as the organization's specific technology stack, scalability requirements, and budget constraints. Conducting a thorough evaluation, including a trial or proof of concept, would be essential to ensure that IBM Instana aligns with our unique needs and contributes positively to our technology and business objectives.
Read full review Splunk Infrastructure Monitoring is well suited for any complicated environment where you have apps and servers across multiple clouds and platforms and products. If you have a data centre where all your apps and servers are in one single network, you could probably get away with older solutions. But for any modern, complex, hybrid-cloud microservices environment, Splunk Infrastructure Monitoring is a must-have.
Read full review Pros Collecting Kubernetes and Infra level logs and presenting in easy to understand visuals. Tracing the end-to-end journey of an event and also categorizing by technology and Endpoints Built-in and custom alerts help to monitor almost all of scenarios without much additional configuration. Ability to create Applications to logical group important flows Read full review SignalFX handles historical metric aggregation exceptionally well, providing a multifaceted approach to event detection based on anomalies. SignalFX's cost is incredibly flexible with their pricing model of DPM (data-point per minute) vs the traditional "per host" model that most monitoring SaaS use. SignalFX support is responsive and knowledgeable, very eager to help solve your immediate problems. SignalFX integrations is vast and constantly growing, making adoption easy even when multiple different open-source technologies are used in your stack. Read full review Cons I believe that the "live" option in monitoring does not truly update the status in real-time, thus I must manually update to feel comfortable. The call analysis tool might be improved, third-party resources are restricted, and the pricing is slightly more than competitors in comparable categories. Read full review Better integration with our clients (native mobile clients SDKs will be great). A way to easily tag filters and move them across metrics/formula. Alert system to easy false [alarm] and hard to configure. Be able to have group more dimensions (and have more values on each). Read full review Likelihood to Renew Good: Stable system with low error rate Easy to use for simple use cases Bad: UI is not very clear for complex usage Mobile view (when logged in from phone) is bad No library for .net
Read full review Usability I find that learning the interface can take some time. We need a better show-and-tell on how the Teams pages, Dashboard Groups, Dashboards and charts delay. Advance SignalFlow is sometimes hard to build. Some better samples of advanced SignalFlow would be helpful. For example, Splunk SPL has a vast resource of examples.
Read full review Alternatives Considered We chose IBM Instana for several reasons, and the most important one is that we didn't have to make changes in our code for nearly all of our applications for it to work. We are happy with the decision we made to use IBM Instana.
Read full review They’re not for the same purpose but we’re using NewRelic and Honeycomb for monitoring purposes. NewRelic is used for HTTP client monitoring for system related throughput, error, database and external client monitoring. Honeycomb is used to monitor actual HTTP request/response values. Splunk [Infrastructure Monitoring] is used for real-time application related throughout and error monitoring.
Read full review Return on Investment System down time is shortened as our ability to troubleshoot issues goes up. Potential downtime is stopped as we are able to slow down calls/ availability of calls in response to monitoring the dashboard. We are able to react to issues quickly through the smart alert system rather than manual checks. Read full review Reduced downtime. Caused us to get a lot of spam when we redeployed apps and old instances stopped sending metrics. Muting alerts solves this, but people often forget to do it or do it incorrectly. Helped us find historical info about instances/apps. Read full review ScreenShots Splunk Infrastructure Monitoring Screenshots