analytics implemented with pandas are great performance debugging tools
Use Cases and Deployment Scope
We use pandas in our analytics framework to calculate and analyze performance metrics of the operational data. It is mostly about response time for various APIs and resource consumption.
Pros
- It is easy to do statistical analysis
- It is easy to clean the data
- It is easy to produce graphs and charts to visualize
Cons
- There are a lot of libraries and ways to do visualization. Sometimes it is very confusing.
- Error handling can be a challenge. Sometimes the error messages do not provide valuable clues for the debugging.
- In our case, there are a bunch of different frameworks and libraries working together. I would rather work with one framework, well tuned for my use case
Return on Investment
- Performance debugging was time consuming and mostly poorly automated exploratory process. Once we started use pandas for these tasks, it really moved the needle. Pandas are instrumental to provide actionable insights. As a result we were able to improve notably cloud software resource utilization and performance
- Analytics implemented with pandas allow us to detect and. address problems in our APIs before they are notable to our customers
Usability
Alternatives Considered
Splunk AppDynamics, Splunk IT Service Intelligence (ITSI), Dynatrace and New Relic
Other Software Used
Splunk AppDynamics, Dynatrace, Splunk Enterprise