No SQL Database to Support Near Real Time Analytics
April 19, 2018

No SQL Database to Support Near Real Time Analytics

Vinaybabu Raghunandha Naidu | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User

Overall Satisfaction with HBase

HBase was used in my previous organization(PayPal) where we needed a database for storing and retrieving records in near real time. It was used within consumer analytics and other sub-teams. It supported our near real-time use analytical cases by proving a faster lookup of records with consistency reads/writes. Apart from that, helped in querying the records much faster than other NoSQL databases.
  • Faster lookup of records using the row keys. It helped to fetch thousands of records in a much faster way using the row keys
  • As it is a columnar data store, helped us to improve the query performance and aggregations
  • Sharding helps us to optimize the data storage and retrieval. HBase provides automatic or manually sharding of tables.
  • Dynamic addition of columns and column family helped us to modify the schema with ease.
  • Identified issues with Hmaster when handling a huge number of nodes
  • Cannot have multiple indexes as row key is the only column which could be indexed.
  • HBase does not support partial row keys which limit its query performance.
  • It supports the near real-time use cases when integrated with Spark Streaming.
  • It helps to store huge volume of records with consistent reads/writes.
  • Maintenance is the pain point as it requires some maintenance and monitoring of regional servers and nodes
Compared NoSQL databases with traditional databases for faster retrieval and consistency. As MongoDB is a NoSQL supports dynamic fields, however, query performance is bad for aggregations and added maintenance. When compared with MySQL and Teradata, it could not scale up as fast as Hbase and added cost involved to it. HBase can be easily scalable to a huge volume of records, have a faster lookup and provides consistency
Compared to other MySQL databases Hbase provides better query results and dynamic in nature. It can be integrated to different computational engines like spark to support to real time use cases.
Hbase is well suited for large organizations with millions of operations performing on tables, real-time lookup of records in a table, range queries, random reads and writes and online analytics operations.

Hbase cannot be replaced for traditional databases as it cannot support all the features, CPU and memory intensive. Observed increased latency when using with MapReduce job joins.

HBase Feature Ratings

Performance
8
Availability
7
Concurrency
8
Security
6
Scalability
6
Data model flexibility
9
Deployment model flexibility
9