Skip to main content
TrustRadius
Apache Hive

Apache Hive

Overview

What is Apache Hive?

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Read more
Recent Reviews

TrustRadius Insights

Apache Hive is a versatile software that has been widely used across various departments and organizations for different use cases. It has …
Continue reading

Help your dev team !

8 out of 10
April 12, 2022
Incentivized
We build our data lake and perform queries on large amounts of data. We group data from multiple sources into a common structure, making …
Continue reading

very useful for OLTP

10 out of 10
April 06, 2022
Incentivized
We use Apache to process large data and get the output with less process time. The framework is very much useful for data processing and …
Continue reading

Big Data the SQL way

8 out of 10
September 23, 2020
Incentivized
I am working as a Research Assistant where I have to process tons of data to produce appropriate findings. Our NLP lab used it for all its …
Continue reading
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is Apache Hive?

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

24 people also want pricing

Alternatives Pricing

What is ClicData?

ClicData is a 100% cloud-based business intelligence platform that allows users to connect, process, blend, visualize and share data from a single place. As an automated platform, users are able to rely on the latest version of company data, to ensure users make the right decisions. Hundreds of…

What is retailMetrix?

RetailMetrix is a data analytics platform for retailers with the mission of enabling retailers to get value from their data. RetailMatrix processes and stores sales, labor and customer data using data warehouse technologies. Its dashboards and reports allows team to find the data that matters to…

Return to navigation

Product Demos

Apache Hive Hadoop Ecosystem - Big Data Analytics Tutorial by Mahesh Huddar

YouTube

Connecting Microsoft Power BI to Apache Hive using Simba Hive ODBC driver

YouTube

Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive

YouTube
Return to navigation

Product Details

Apache Hive Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo

Frequently Asked Questions

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Reviewers rate Usability highest, with a score of 8.5.

The most common users of Apache Hive are from Enterprises (1,001+ employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(97)

Community Insights

TrustRadius Insights are summaries of user sentiment data from TrustRadius reviews and, when necessary, 3rd-party data sources. Have feedback on this content? Let us know!

Apache Hive is a versatile software that has been widely used across various departments and organizations for different use cases. It has proven to be particularly helpful in handling large datasets, migrating data between different operating systems, synchronizing programs, and fetching and generating product metrics. Users have found value in using Hive for data analytics, engineering, data science, product management, and IT-related tasks such as improving analysis of big datasets stored in Hadoop HDFS.

Furthermore, Apache Hive has simplified the process of filtering and cleaning data using SQL, reducing the learning curve for handling big data. It allows users to run SQL queries against data in Hadoop, enabling efficient analysis of large datasets without the need to learn a new language. Additionally, Hive has been utilized for building reports, analyzing data stored in the Hadoop file system, processing events gathered in HDFS, and converting them into parquet files for fast querying.

Overall, users have praised Apache Hive for its scalability, accessibility, and cost-effectiveness in storing and retrieving analytics data. It has provided an intuitive solution for storing large datasets, querying big sets of data using SQL, aggregating massive datasets into distilled information for data-driven decision making, and creating external and internal tables in Hadoop/BigData projects. With its ability to process both unstructured and structured data efficiently, Hive has become an essential tool for data analysts, engineers, and business analysts across organizations.

Attribute Ratings

Reviews

(1-25 of 35)
Companies can't remove reviews or game the system. Here's why
Score 9 out of 10
Vetted Review
Verified User
Incentivized
  • Good ROI for being able to access data easily across the network, we have large amounts of data and this is a good system to access it
  • Good ROI for being easy to learn how to use for new employees, not much time spent which saves costs
  • Good ROI for being able to integrate with Spark and other applications, hence data can be analyzed through programs
Score 9 out of 10
Vetted Review
Verified User
Incentivized
  • Apache hive is secured and scalable solution that helps in increasing the overall organization productivity.
  • Apache hive can handle and process large amount of data in a sufficient time manner.
  • It simplifies writing SQL queries, hence helping the organization as most companies use SQL for all query jobs.
akshay kashyap | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Incentivized
  • Improved the performance than a traditional DBMS.
  • Scalability is much better due to support of HDFS distributed processing.
  • Made queries much more efficient than a traditional database such as oracle.
  • Have to maintain on premise hardware is one dependency.
September 23, 2020

Big Data the SQL way

Score 8 out of 10
Vetted Review
Verified User
Incentivized
  • A good engine for data analysis
  • Easy syntax lead to fast learning for NLP team.
  • Shifted to spark later on which supports almost all Hive functions and was faster
Score 8 out of 10
Vetted Review
Verified User
Incentivized
  • It exposes the distributed calculation world (Hadoop) to the users but doesn't require the user to have the in-depth understanding of boilerplate details, it reduces the time of learning and let the data analyst can focus their efforts on the core business.
Jordan Moore | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Incentivized
  • Allows analysts to use their SQL skills against large datasets.
  • Slow queries allow for opportunities to discover bottlenecks, parameters to tune, and alternative tools or ways to architect a system.
Bharadwaj (Brad) Chivukula | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Incentivized
  • Hive has been instrumental to transform the technical landscape without putting Business Partners at risk when converting to Hadoop Ecosystem; it helps see your unstructured data in a structured format.
  • Primary Querying engine for Data Analytics.
  • Data analytics, making vast amounts of data available for general BI uses.
Return to navigation