Apache HBase vs. Apache Hive

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
HBase
Score 7.3 out of 10
N/A
The Apache HBase project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable.N/A
Apache Hive
Score 8.1 out of 10
N/A
Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.N/A
Pricing
Apache HBaseApache Hive
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
HBaseApache Hive
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache HBaseApache Hive
Considered Both Products
HBase
Chose Apache HBase
HBase is more secure. Easily scalable. HBase is for wide-column store while MongoDB is for document store. Triggers available in HBase while in Mongodb triggers are not available.
Chose Apache HBase
Compared NoSQL databases with traditional databases for faster retrieval and consistency. As MongoDB is a NoSQL supports dynamic fields, however, query performance is bad for aggregations and added maintenance. When compared with MySQL and Teradata, it could not scale up as …
Chose Apache HBase
HBase is what you should use if you want a production ready scalable, JSON friendly, key-value, NoSQL, enterprise storage option. It excels over MongoDB due to integration with the extensive Hadoop stack and all the tools, frameworks and benefits there.

HBase has superior …
Chose Apache HBase
Typically, Cassandra is faster on reads and HBase is faster on writes. You use Cassandra when you want to use a website, HBase is just an overall good general use database engine. Cassandra has its own storage engine and HBase uses HDFS and all its benefits. MongoDB is …
Apache Hive
Chose Apache Hive
Apache Hive is a query language developed by Facebook to query over a large distributed dataset. Apache is a query engine that runs on top of HDFS, so it utilizes the resources of HDFS Hadoop setup, while Apache Spark is an in memory compute engine, and that's why [it is] much …
Top Pros
Top Cons
Features
Apache HBaseApache Hive
NoSQL Databases
Comparison of NoSQL Databases features of Product A and Product B
Apache HBase
7.7
5 Ratings
13% below category average
Apache Hive
-
Ratings
Performance7.15 Ratings00 Ratings
Availability7.85 Ratings00 Ratings
Concurrency7.05 Ratings00 Ratings
Security7.85 Ratings00 Ratings
Scalability8.65 Ratings00 Ratings
Data model flexibility7.15 Ratings00 Ratings
Deployment model flexibility8.25 Ratings00 Ratings
Best Alternatives
Apache HBaseApache Hive
Small Businesses
IBM Cloudant
IBM Cloudant
Score 7.9 out of 10
Google BigQuery
Google BigQuery
Score 8.7 out of 10
Medium-sized Companies
IBM Cloudant
IBM Cloudant
Score 7.9 out of 10
Cloudera Enterprise Data Hub
Cloudera Enterprise Data Hub
Score 9.0 out of 10
Enterprises
IBM Cloudant
IBM Cloudant
Score 7.9 out of 10
Oracle Exadata
Oracle Exadata
Score 8.5 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache HBaseApache Hive
Likelihood to Recommend
7.7
(10 ratings)
8.0
(35 ratings)
Likelihood to Renew
7.9
(10 ratings)
10.0
(1 ratings)
Usability
-
(0 ratings)
8.5
(7 ratings)
Support Rating
-
(0 ratings)
7.0
(6 ratings)
User Testimonials
Apache HBaseApache Hive
Likelihood to Recommend
Apache
Hbase is well suited for large organizations with millions of operations performing on tables, real-time lookup of records in a table, range queries, random reads and writes and online analytics operations. Hbase cannot be replaced for traditional databases as it cannot support all the features, CPU and memory intensive. Observed increased latency when using with MapReduce job joins.
Read full review
Apache
Software work execution is on a large scale, it is good to use for new projects or organizational changes, data lineage mapping has always been dubious but this one has had good results. You can store and synchronize data from different departments, the storage process can be manual but it is best automated.
Read full review
Pros
Apache
  • Scalability. HBase can scale to trillions of records.
  • Fast. HBase is extremely fast to scan values or retrieve individual records by key.
  • HBase can be accessed by standard SQL via Apache Phoenix.
  • Integrated. I can easily store and retrieve data from HBase using Apache Spark.
  • It is easy to set up DR and backups.
  • Ingest. It is easy to ingest data into HBase via shell, Java, Apache NiFi, Storm, Spark, Flink, Python and other means.
Read full review
Apache
  • Apache Hive allows use to write expressive solutions to complex problems thanks to its SQL-like syntax.
  • Relatively easy to set up and start using.
  • Very little ramp-up to start using the actual product, documentation is very thorough, there is an active community, and the code base is constantly being improved.
Read full review
Cons
Apache
  • There are very few commands in HBase.
  • Stored procedures functionality is not available so it should be implemented.
  • HBase is CPU and Memory intensive with large sequential input or output access while as Map Reduce jobs are primarily input or output bound with fixed memory. HBase integrated with Map-reduce jobs will result in random latencies.
Read full review
Apache
  • Some queries, particularly complex joins, are still quite slow and can take hours
  • Previous jobs and queries are not stored sometimes
  • Switching to Impala can sometimes be time-consuming (i.e. the system hangs, or is slow to respond).
  • Sometimes, directories and tables don't load properly which causes confusion
Read full review
Likelihood to Renew
Apache
There's really not anything else out there that I've seen comparable for my use cases. HBase has never proven me wrong. Some companies align their whole business on HBase and are moving all of their infrastructure from other database engines to HBase. It's also open source and has a very collaborative community.
Read full review
Apache
Since I do not know the second data warehouse solution that integrate with HDFS as well as Hive.
Read full review
Usability
Apache
No answers on this topic
Apache
Hive is a very good big data analysis and ad-hoc query platform, which supports scaling also. The BI processes can be easily integrated with Hadoop via the Hive. It can deal with a much larger data set that traditional RDBMS can not. It is a "must-have" component of the big data domain.
Read full review
Support Rating
Apache
No answers on this topic
Apache
Apache Hive is a FOSS project and its open source. We need not definitely comment on anything about the support of open source and its developer community. But, it has got tremendous developer support, awesome documentation. I would justify the fact that much support can be gathered from the community backup.
Read full review
Alternatives Considered
Apache
Cassandra os great for writes. But with large datasets, depending, not as great as HBASE. Cassandra does support parquet now. HBase still performance issues. Cassandra has use cases of being used as time series. HBase, it fails miserably. GeoSpatial data, Hbase does work to an extent. HA between the two are almost the same.
Read full review
Apache
Besides Hive, I have used Google BigQuery, which is costly but have very high computation speed. Amazon Redshift is the another product, I used in my recent organisation. Both Redshift and BigQuery are managed solution whereas Hive needs to be managed
Read full review
Return on Investment
Apache
  • As Hbase is a noSql database, here we don't have transaction support and we cannot do many operations on the data.
  • Not having the feature of primary or a composite primary key is an issue as the architecture to be defined cannot be the same legacy type. Also the transaction concept is not applicable here.
  • The way data is printed on console is not so user-friendly. So we had to use some abstraction over HBase (eg apache phoenix) which means there is one new component to handle.
Read full review
Apache
  • Apache hive is secured and scalable solution that helps in increasing the overall organization productivity.
  • Apache hive can handle and process large amount of data in a sufficient time manner.
  • It simplifies writing SQL queries, hence helping the organization as most companies use SQL for all query jobs.
Read full review
ScreenShots