Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.
N/A
SAP Business Data Cloud
Score 8.4 out of 10
N/A
SAP Business Data Cloud is a fully managed SaaS solution that unifies and governs all SAP data and seamlessly connects with third-party data—giving line-of-business leaders context to make even more impactful decisions.
Software work execution is on a large scale, it is good to use for new projects or organizational changes, data lineage mapping has always been dubious but this one has had good results. You can store and synchronize data from different departments, the storage process can be manual but it is best automated.
1. Data extraction from Non-SAP environment 2.Seamless integration with SAP Analytics Cloud for reporting purposes, avoiding the need to create complex reporting dashboards 3. Dataproducts across different modules are available for use in the SAP environment 4 . Snowflakes and Databricks offer more flexibility to address complex use cases, including even application of AI.
Apache Hive allows use to write expressive solutions to complex problems thanks to its SQL-like syntax.
Relatively easy to set up and start using.
Very little ramp-up to start using the actual product, documentation is very thorough, there is an active community, and the code base is constantly being improved.
In the new analytics world, BDC has been a game changer for SAP Analytics. Extending the SAP data for the usage in Databricks, snow flake, GCP has opened new doors for Analytics . Shift from traditional data warehousing to Business Data fabric adapting to the change in the analytics world is the need of the hour and Sap has managed to pulled it off with BDC
Hive is a very good big data analysis and ad-hoc query platform, which supports scaling also. The BI processes can be easily integrated with Hadoop via the Hive. It can deal with a much larger data set that traditional RDBMS can not. It is a "must-have" component of the big data domain.
It has business friendly options, governance features and as expected, integration with SAP products. However it feels complex for somebody who is non SAP background and for building lighter reports. There is lot of scope for improvement as compared to in general options available in the market. Otherwise it is best for business cases
Apache Hive is a FOSS project and its open source. We need not definitely comment on anything about the support of open source and its developer community. But, it has got tremendous developer support, awesome documentation. I would justify the fact that much support can be gathered from the community backup.
support team is generally responsive and knowledgeable, and most issues are addressed within acceptable timelines. Documentation and standard guidance are helpful for common scenarios.
One of the best training session I attended and they covered most of the topics and answered all our questions. participants joined from different regions, infact they all had a different questions and it was different thoughts from all of then and helped to learn better. Though I was on travel, I could able yo attend the session.
I have done implementation of models in traditional bw and Using BDC. The integration of BDC with S4 hana for creating sap data products is seamless and reduces lot of implementation effort. The intelligent app feature is BDC also eases the implementation effort. If i have to compare the previous world with new BDC, implementation effort is largely saved
Besides Hive, I have used Google BigQuery, which is costly but have very high computation speed. Amazon Redshift is the another product, I used in my recent organisation. Both Redshift and BigQuery are managed solution whereas Hive needs to be managed
With a S4 backend a lot of core functionality is made simpler - authorization, data types, currency conversion. In particular if the front end choice is SAP Analytics Cloud. The lack of a good connection from Power BI to the datasphere application (instead of the underlying HANA cloud) is a major drawback in that scenario.