Overview
What is Apache Hive?
Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.
With Apache Hive, you can enter the world of Big Data
Best Distributed Database in the market
Help your dev team !
Spectacular SQL-like interface for accessing Hadoop
This system makes active data of value.
Best query platform for ETL.
It is an advance to the ease of the processes
Capabilities of Apache Hive
Excellent bigdata warehouse solution
Our use …
very useful for OLTP
Apache Hive
Walk into the World of Big Data with Apache Hive
Reliable and Cheaper one stop Data warehouse solution
Big Data the SQL way
Apache Hive: Big data querying tool w/SQL interface, but slower, more costly computation
Awards
Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards
Pricing
What is Apache Hive?
Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.
Entry-level set up fee?
- No setup fee
Offerings
- Free Trial
- Free/Freemium Version
- Premium Consulting/Integration Services
Would you like us to let the vendor know that you want pricing?
24 people also want pricing
Alternatives Pricing
What is ClicData?
ClicData is a 100% cloud-based business intelligence platform that allows users to connect, process, blend, visualize and share data from a single place. As an automated platform, users are able to rely on the latest version of company data, to ensure users make the right decisions. Hundreds of…
What is retailMetrix?
RetailMetrix is a data analytics platform for retailers with the mission of enabling retailers to get value from their data. RetailMatrix processes and stores sales, labor and customer data using data warehouse technologies. Its dashboards and reports allows team to find the data that matters to…
Product Demos
Apache Hive Hadoop Ecosystem - Big Data Analytics Tutorial by Mahesh Huddar
Connecting Microsoft Power BI to Apache Hive using Simba Hive ODBC driver
Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive
Product Details
- About
- Tech Details
- FAQs
What is Apache Hive?
Apache Hive Technical Details
Operating Systems | Unspecified |
---|---|
Mobile Application | No |
Frequently Asked Questions
Comparisons
Compare with
Reviews and Ratings
(97)Community Insights
- Business Problems Solved
Apache Hive is a versatile software that has been widely used across various departments and organizations for different use cases. It has proven to be particularly helpful in handling large datasets, migrating data between different operating systems, synchronizing programs, and fetching and generating product metrics. Users have found value in using Hive for data analytics, engineering, data science, product management, and IT-related tasks such as improving analysis of big datasets stored in Hadoop HDFS.
Furthermore, Apache Hive has simplified the process of filtering and cleaning data using SQL, reducing the learning curve for handling big data. It allows users to run SQL queries against data in Hadoop, enabling efficient analysis of large datasets without the need to learn a new language. Additionally, Hive has been utilized for building reports, analyzing data stored in the Hadoop file system, processing events gathered in HDFS, and converting them into parquet files for fast querying.
Overall, users have praised Apache Hive for its scalability, accessibility, and cost-effectiveness in storing and retrieving analytics data. It has provided an intuitive solution for storing large datasets, querying big sets of data using SQL, aggregating massive datasets into distilled information for data-driven decision making, and creating external and internal tables in Hadoop/BigData projects. With its ability to process both unstructured and structured data efficiently, Hive has become an essential tool for data analysts, engineers, and business analysts across organizations.
Attribute Ratings
Reviews
(1-25 of 35)- Improved performance compared to a database management system.
- HDFS-based distributed processing greatly improves scalability.
- Improved query performance compared to oracle databases.
Best Distributed Database in the market
- It gave ease of use and performance wise to read/insert large set of data.
- It helped to improve site performance to load faster
Help your dev team !
- Reduce time to market
- Improve client satisfaction
- Saves money :)
Spectacular SQL-like interface for accessing Hadoop
- Good ROI for being able to access data easily across the network, we have large amounts of data and this is a good system to access it
- Good ROI for being easy to learn how to use for new employees, not much time spent which saves costs
- Good ROI for being able to integrate with Spark and other applications, hence data can be analyzed through programs
This system makes active data of value.
- When developing projects you will obtain correct figures and true information about what you need or what you have to develop.
- We are currently trying to integrate with other tools so the software will help us.
Best query platform for ETL.
- fast results
- reduced time complexity
- code debugging is easy
It is an advance to the ease of the processes
- It used to be complicated by the many lines of connectors, but today you just need to understand where to click
- The software ensures the quality of the data, prepares and cleans it
Capabilities of Apache Hive
- Unable to work in OLTP environment is challenge
- Real time query is not supported currently
- Optimization for query is not proper.
Excellent bigdata warehouse solution
- Apache hive is secured and scalable solution that helps in increasing the overall organization productivity.
- Apache hive can handle and process large amount of data in a sufficient time manner.
- It simplifies writing SQL queries, hence helping the organization as most companies use SQL for all query jobs.
very useful for OLTP
- Very easy to write query.
- Not suitable OLTP environment but for OLAP.
- HiveQL is a declarative language like SQL.
Apache Hive
- Apache hive helped to manage data on HDFS.
- Apache hive helped to do data cleansing and data transformation.
- Apache hive queries were slow, so we had to use Impala (MPP) for exposing the data to end-users.
Walk into the World of Big Data with Apache Hive
- Improved the performance than a traditional DBMS.
- Scalability is much better due to support of HDFS distributed processing.
- Made queries much more efficient than a traditional database such as oracle.
- Have to maintain on premise hardware is one dependency.
Reliable and Cheaper one stop Data warehouse solution
- It's one of the top data warehouse solution
- All the metrics computation and adhoc analysis is done using Apache Hive
Big Data the SQL way
- A good engine for data analysis
- Easy syntax lead to fast learning for NLP team.
- Shifted to spark later on which supports almost all Hive functions and was faster
Apache Hive: Big data querying tool w/SQL interface, but slower, more costly computation
- Hive is crucial for our BI and reporting.
- Hive costs can ramp up quickly and can cause negative ROI when implementing inefficient schemas, queries, etc.
Hive: When SQL marries with Hadoop
- It exposes the distributed calculation world (Hadoop) to the users but doesn't require the user to have the in-depth understanding of boilerplate details, it reduces the time of learning and let the data analyst can focus their efforts on the core business.
Manage data for your warehouse as strong as a beehive using Apache HIve!
- We did not face a ROI problem with Apache Hive - as its open source.
Reliable, cheap and trustworthy!
- Finding information is easier
- Fault tolerant
- Reliability of data
Apache Hive: SQL, open-source querying tool
- No licensing costs
- Little training needed for most users
My Apache Hive Review
- Installation and set up of the clusters is easy.
- Effective handling of the complex queries and large set of data.
Hive is solid data analytical tool
- Helps to get good data insights from a vast and complex data stored
- It's easy to learn HiveSQL
- You don't have to worry about scalability as much with Hive
Hive - SQL-like query engine for big data platform
- It saves time on development because of its SQL-like syntax
- It makes it easy to access big data
- It's easy to connect reporting tools to Hive and build reports on top of it
One of the first SQL on Hadoop tools. Perhaps not the best.
- Allows analysts to use their SQL skills against large datasets.
- Slow queries allow for opportunities to discover bottlenecks, parameters to tune, and alternative tools or ways to architect a system.
Apache Hive Faster and Can handle large sets of data
- Positive impact for faster response time compared to other products
- Can handle large sets of data and complex queries
Bringing Structure to your Unstructured Data
- Hive has been instrumental to transform the technical landscape without putting Business Partners at risk when converting to Hadoop Ecosystem; it helps see your unstructured data in a structured format.
- Primary Querying engine for Data Analytics.
- Data analytics, making vast amounts of data available for general BI uses.