Data warehouse made simple, yet powerful!
October 07, 2020
Data warehouse made simple, yet powerful!

Overall Satisfaction with Apache Hive
We are using Apache Hive in our whole company as the main data warehouse software solution covering all needed data warehousing tasks. It is being used to interact with huge datasets located in a distributed storage. Since we are using a variety of data formats Apache Hive enables us to query anything with unified SQL syntax.
- Gives access to files stored in a variety of data storage systems
- Facilitates ETL operations, reporting and data analysis
- Supports queries expressed in a declarative language very similar to SQL
- Not suitable for for online transaction processing workloads
- Much more complicated than any typical RDBMS
- Licensing model based on Apache License 2.0
- We were able to achieve huge speed-up in the area of data insertion.
- Data replication for disaster recovery is finally working for our customers.
- Apache Pig and Hadoop
From my experience Apache Hive is the easiest for database admins and experts to learn due to similarities between SQL and HiveQL, which makes it the best choice for majority of customers. The remaining platforms are more hard to use and are targeted mainly towards programmers, which are able to fully utilize features available only via programmable APIs.
Do you think Apache Hive delivers good value for the price?
Yes
Are you happy with Apache Hive's feature set?
Yes
Did Apache Hive live up to sales and marketing promises?
Yes
Did implementation of Apache Hive go as expected?
Yes
Would you buy Apache Hive again?
Yes