Hive is solid data analytical tool
June 07, 2018

Hive is solid data analytical tool

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User

Overall Satisfaction with Apache Hive

Hive is currently used in our Data Warehouse in our company. It helps us give more structure to our data and as Hive sits on top of Hadoop, the MR engine. It is a big plus when you want to run a complex query and get faster results. This helps us facilitate the Business Intelligence team to use Hive as a self-querying tool.
  • It's Fast!
  • You can store a different kind of data structures here other than the standard ones
  • Good scalability
  • Good redundancy too
  • It's not as ACID compliant as an RDBMS. It's a recently added feature and still needs work.
  • This is not the tool to go for online data processing.
  • It does not support sub-queries.
  • It can't process data in real time.
  • Helps to get good data insights from a vast and complex data stored
  • It's easy to learn HiveSQL
  • You don't have to worry about scalability as much with Hive
I have used Storm for real-time processing, but that only addresses a few data points. But for a larger access to data, Hive is well suited.
This is best suited for data analysts and scientists, it's not a programmers tool. You may still need an RDBMS to read data from as updates and deletes can get a bit more complicated, you can run batch jobs, this will have to be facilitated by additional tools.
Its good for fast query processing, for storing large amounts of data.