The Apache HBase project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable.
N/A
Hive
Score 9.0 out of 10
N/A
Hive Technology offers their eponymous project management and process management application, providing integrations with many popularly used applications for productivity, cloud storage, and collaboration.
$24
per month per user
Pricing
Apache HBase
Hive
Editions & Modules
No answers on this topic
Free
$0
Lite
$24
per month per user
Growth
$34
per month per user
Pro
$59
per month per user
Elite
Contact Sales
Offerings
Pricing Offerings
HBase
Hive
Free Trial
No
Yes
Free/Freemium Version
No
Yes
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
No setup fee
Additional Details
—
A discount is offered for annual pricing.
More Pricing Information
Community Pulse
Apache HBase
Hive
Features
Apache HBase
Hive
NoSQL Databases
Comparison of NoSQL Databases features of Product A and Product B
Apache HBase
7.7
5 Ratings
14% below category average
Hive
-
Ratings
Performance
7.15 Ratings
00 Ratings
Availability
7.85 Ratings
00 Ratings
Concurrency
7.05 Ratings
00 Ratings
Security
7.85 Ratings
00 Ratings
Scalability
8.65 Ratings
00 Ratings
Data model flexibility
7.15 Ratings
00 Ratings
Deployment model flexibility
8.25 Ratings
00 Ratings
Project Management
Comparison of Project Management features of Product A and Product B
Apache HBase
-
Ratings
Hive
9.1
15 Ratings
16% above category average
Task Management
00 Ratings
9.015 Ratings
Resource Management
00 Ratings
9.015 Ratings
Gantt Charts
00 Ratings
10.014 Ratings
Scheduling
00 Ratings
7.014 Ratings
Workflow Automation
00 Ratings
9.014 Ratings
Team Collaboration
00 Ratings
10.015 Ratings
Support for Agile Methodology
00 Ratings
10.012 Ratings
Support for Waterfall Methodology
00 Ratings
8.011 Ratings
Document Management
00 Ratings
10.013 Ratings
Email integration
00 Ratings
10.013 Ratings
Mobile Access
00 Ratings
8.011 Ratings
Timesheet Tracking
00 Ratings
10.09 Ratings
Change request and Case Management
00 Ratings
10.011 Ratings
Budget and Expense Management
00 Ratings
7.09 Ratings
Professional Services Automation
Comparison of Professional Services Automation features of Product A and Product B
Hbase is well suited for large organizations with millions of operations performing on tables, real-time lookup of records in a table, range queries, random reads and writes and online analytics operations. Hbase cannot be replaced for traditional databases as it cannot support all the features, CPU and memory intensive. Observed increased latency when using with MapReduce job joins.
Hive is a powerful tool for data analysis and management that is well-suited for a wide range of scenarios. Here are some specific examples of scenarios where Hive might be particularly well-suited: Data warehousing: Hive is often used as a data warehousing platform, allowing users to store and analyze large amounts of structured and semi-structured data. It is especially good at handling data that is too large to be stored and analyzed on a single machine, and supports a wide variety of data formats. Batch processing: Hive is designed for batch processing of large datasets, making it well-suited for tasks such as data ETL (extract, transform, load), data cleansing, and data aggregation.Simple queries on large datasets: Hive is optimized for simple queries on large datasets, making it a good choice for tasks such as data exploration and summary statistics. Data transformation: Hive allows users to perform data transformations and manipulations using custom scripts written in Java, Python, or other programming languages. This can be useful for tasks such as data cleansing, data aggregation, and data transformation. On the other hand, here are some specific examples of scenarios where Hive might be less appropriate: Real-time queries: Hive is a batch-oriented system, which means that it is designed to process large amounts of data in a batch mode rather than in real-time. While it is possible to use Hive for real-time queries, it may not be the most efficient choice for this type of workload. Complex queries: Hive is optimized for simple queries on large datasets, but may struggle with more complex queries or queries that require multiple joins or subqueries.Very large datasets: While Hive is designed to scale horizontally and can handle large amounts of data, it may not scale as well as some other tools for very large datasets or complex workloads.
Simplicity, it offers a clean environment without risking the outcome. An example of this are the timesheets that allow a fast way to keep track of progress
Interaction, the different options make it faster and easier to interact and collaborate in the development of a product. An example of this would be Hive Notes for meetings
The different visualisations it offers allow to explore the best ways to affront your projects. I really like the Gantt mappings view to understand who can be contacted at each point
Stored procedures functionality is not available so it should be implemented.
HBase is CPU and Memory intensive with large sequential input or output access while as Map Reduce jobs are primarily input or output bound with fixed memory. HBase integrated with Map-reduce jobs will result in random latencies.
There's really not anything else out there that I've seen comparable for my use cases. HBase has never proven me wrong. Some companies align their whole business on HBase and are moving all of their infrastructure from other database engines to HBase. It's also open source and has a very collaborative community.
Our CSR is easily accessible and they have support built into the app itself. They also have a pretty robust support site. We also took advantage of the free trial and learned so much by putting Hive through the paces and figuring out the best way to mold it to our needs.
Cassandra os great for writes. But with large datasets, depending, not as great as HBASE. Cassandra does support parquet now. HBase still performance issues. Cassandra has use cases of being used as time series. HBase, it fails miserably. GeoSpatial data, Hbase does work to an extent. HA between the two are almost the same.
Hive is a bit different than Jira and Monday, which I used mostly. Overall does a great job managing project and helps with team communication. Removes dependency of asking team members for updates by going to conference rooms. With Hive, the team updates the status, and we can easily track it.
As Hbase is a noSql database, here we don't have transaction support and we cannot do many operations on the data.
Not having the feature of primary or a composite primary key is an issue as the architecture to be defined cannot be the same legacy type. Also the transaction concept is not applicable here.
The way data is printed on console is not so user-friendly. So we had to use some abstraction over HBase (eg apache phoenix) which means there is one new component to handle.