Overall Satisfaction with Site24x7
Site24x7 is used by our IT department as a monitoring tool for all of our critical infrastructure. It ensures that the proper individuals get notified when a critical server/service goes offline or if CPU usage or another parameter goes outside of the defined range. It solves the business problem of ensuring that we are able to keep to our SLA's in terms of service uptime.
- Ability to write complex rules that can define when alerts are triggered.
- Ability to organize different servers under groups.
- SSO Integration.
- We tend to get a lot of false positives as it is difficult to perfectly configure rules especially for very specific events like High CPU or disk usage.
- The interface is very extensive and at first, it's very difficult to navigate and figure out what is where.
- Some of the UI and usability could be improved, like setting maintenance periods and defining them.
- We are better able to be notified of critical downtime, allowing us to act faster to restore critical services/applications.
- Sometimes we get flooded with emails due to false alerts and that can often be distracting to many of us.
- Thanks to their add-on pricing scheme, it eliminates the need to buy expensive modules straightaway, thus saving money.
I haven't had a lot of experience evaluating Datadog but from the research and basic demo that we got I understand that it is significantly more expensive, but it does seem to have more AI features and can assist in predicting when/if applications or essential services may go down. This kind of insight is something that Site24x7 does not have.
Site24x7 is well suited for alerting IT of critical events such as server/application downtime. It can send emails, phone calls or even text alerts. Where it gets less reliable is when notifying us when a specific application is "In Trouble". This could be either due to high CPU load or if the disk space has reached a certain threshold. These alerts tend to be less reliable and I myself often don't find them extremely trustworthy. Continuous tweaking of the rule for the alert is required to make sure it is accurately notifying people that something is in fact amiss.