Apache Flume vs. Hortonworks Data Platform

Apache Flume

Apache Flume

9 Reviews and Ratings

Hortonworks Data Platform

Hortonworks Data Platform

37 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Flume	Score 7.1 out of 10	N/A	Apache Flume is a product enabling the flow of logs and other data into a Hadoop environment.	N/A
Hortonworks Data Platform	Score 5.0 out of 10	N/A	Hortonworks Data Platform (HDP) is an open source framework for distributed storage and processing of large, multi-source data sets. HDP modernizes IT infrastructure and keeps data secure—in the cloud or on-premises—while helping to drive new revenue streams, improve customer experience, and control costs. Hortonworks merged with Cloudera in eary 2019.	N/A

Pricing

Apache Flume

Hortonworks Data Platform

Editions & Modules

No answers on this topic

No answers on this topic

Offerings

Pricing Offerings
Apache Flume	Hortonworks Data Platform
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Apache Flume	Hortonworks Data Platform
Considered Both Products	Apache Flume Verified User Anonymous Chose Apache Flume Apache Flume is on par with Scribe with similar functions. Apache Kafka is a generation purpose while Apache Flume is specific to log aggregation. Google Pub/Sub and IBM MQ are costlier than Apache Flume ( open source ) and have a lot more cost associated with them. Apama … Incentivized Helpful? Juan Francisco Tavira Global Technology Centre - Middleware Chose Apache Flume Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so you need to know exactly … Incentivized Helpful?	Hortonworks Data Platform Andrea Bardone Project List 2018 - 2012 Chose Hortonworks Data Platform We chose [Hortonworks Data Platform] because it's free and because [it] was an IBM partner, suggested as big data platform after biginsights platform. You can install in more physical computer without high specs, then you can use it in order to learn how to deploy, configure a … Incentivized Helpful? Verified User Anonymous Chose Hortonworks Data Platform Hortonworks Data Platform is on par with, if not better than, Cloudera or Mapr. It provides a big list of components (25-30) that you can pick and use based on your needs. It provides an easy and convenient way to add/remove any of those. It provides a good way of integrating … Incentivized Helpful? Verified User Anonymous Chose Hortonworks Data Platform Cloudera has been often compared to Hortonworks. We considered the both products and decided to try Hortonworks data platform, by several reasons. One of them was pricing and technical support. Generally speaking Cloudera outperforms Hortonworks in terms of functionalities, but … Incentivized Helpful? Fernando López Bello Big Data & Cognitive Computing Practice Leader Chose Hortonworks Data Platform There are many alternatives, but in order to provide a short list: - Cloudera CDP is the obvious contendant or alternative, being a leader in big data platforms - MapR Incentivized Helpful? Steve Gonzales Principal Manager Chose Hortonworks Data Platform Cloudera is a more mature platform. It does not require upgrades as often. However, if you need advanced capabilities, you might be lacking with the Cloudera distribution platform. Many of the other tools in the ecosystem are the same or similar. Cost might also be a factor; … Incentivized Helpful? Verified User Anonymous Chose Hortonworks Data Platform With its great performance and other benefits, we eventually moved from Cloudera to the Hortonworks platform. Incentivized Helpful? Bharadwaj (Brad) Chivukula Sr.Technical Manager/Delivery Manager Chose Hortonworks Data Platform Licensing cost is high when compared to other distribution partners VM setup - It's not as good as what Cloudera provides Incentivized Helpful? Piyush Routray Senior Software Developer Chose Hortonworks Data Platform While Apache Hadoop is completely open sourced, Hortonworks Data Platform offers support as well as keeps pace with the open source versions. Also, the HDP open sources its own products, thus giving back to the community. I find using the Hortonworks Data Platform more … Incentivized Helpful? Wonoh Kim Principal Software Engineer Chose Hortonworks Data Platform Apache, Cloudera, MapR, and IBM. Hortonworks Data Platform is more efficient to use than Apache since you don't need to configure everything by yourself. Again, Cloudera, MapR, and IBM is proprietary software. Incentivized Helpful?

Best Alternatives
	Apache Flume	Hortonworks Data Platform
Small Businesses	No answers on this topic	No answers on this topic
Medium-sized Companies	Cloudera Manager Score 9.9 out of 10	Cloudera Manager Score 9.9 out of 10
Enterprises	IBM Analytics Engine Score 7.1 out of 10	IBM Analytics Engine Score 7.1 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Flume	Hortonworks Data Platform
Likelihood to Recommend	8.0 (0 ratings)	7.0 (0 ratings)
Support Rating	5.0 (0 ratings)	- (0 ratings)
Implementation Rating	- (0 ratings)	9.0 (0 ratings)

User Testimonials
	Apache Flume	Hortonworks Data Platform
Likelihood to Recommend	Apache Flume is well suited in small batch and near real time processing projects, taking data from one point to another with local processing (I mean not external enrichment). Filtering, transforming and multiple push destinations are common grounds for Flume. It is not so nice to use if your data needs external enrichment (taking data from external databases or web services), as transactions and (micro)batches may lead to reprocessing and it relies upon the application to avoid duplicates. Incentivized Juan Francisco Tavira Global Technology Centre - Middleware Read full review	I recommend [Hortonworks Data Platform] as Big Data platform in order to start your developments. It's free. It's easy to use. You can install in more server or use a sandbox with you favorite virtualization platform ( vmware or oracle virtualbox). There is also a containerized version. Manage our data in hdfs is simple; you can interact with server with REST API. Incentivized Andrea Bardone Project List 2018 - 2012 Read full review
Pros	Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage It is very easy to setup and run Very open to personalization, you can create filters, enrichment, new sources and destinations Incentivized Juan Francisco Tavira Global Technology Centre - Middleware Read full review	It is a well suited data platform to support big data storage and analysis, with computational efficiency, good performance, and stability. It is free to use. Online development community is well supported. Hortonworks engineers seem to have good experience and skill sets. It is easy and fast to integrate with other tools or components for big data handling and analysis. Incentivized Verified User Anonymous Read full review
Cons	It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though) Incentivized Verified User Anonymous Read full review	As an open source project collection, it relies strongly on community activity. You still have the option to contract premium consulting or training services. Altough it is quickly evolving into Data Science tools availability (eg. Tensorflow incorporate in HDP 3), it can be cumbersome from a developer transitioning from a traditional IDE, into the notebook vs. datalake metaphore. As expected for a big data infranstructure, the resource requirements base line is rather high. This means that if used on premise, you need to think of about 10 machines for a minimal reasonable deploy. Incentivized Fernando López Bello Big Data & Cognitive Computing Practice Leader Read full review
Support Rating	Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure. Incentivized Verified User Anonymous Read full review	No answers on this topic
Implementation Rating	No answers on this topic	Try not to change variable names. Incentivized Wonoh Kim Principal Software Engineer Read full review
Alternatives Considered	Apache Flume is on par with Scribe with similar functions. Apache Kafka is a generation purpose while Apache Flume is specific to log aggregation. Google Pub/Sub and IBM MQ are costlier than Apache Flume ( open source ) and have a lot more cost associated with them. Apama Streaming Analytics and Tibco Steaming are more comprehensive streaming solutions than Apache Flume so for deeper performance guarantees, it is easier to use Apache Flume. Incentivized Verified User Anonymous Read full review	While Apache Hadoop is completely open sourced, Hortonworks Data Platform offers support as well as keeps pace with the open source versions. Also, the HDP open sources its own products, thus giving back to the community. I find using the Hortonworks Data Platform more intuitive than Cloudera or MapR versions. Incentivized Piyush Routray Senior Software Developer Read full review
Return on Investment	Positive impact on ROI due to a reduction in manual labor to generate and maintain compliance reports based on logs. Positive impact on the business objective by reducing the need for provisioning compute for log aggregate IT stack in advance but adding on an as-needed basis. Incentivized Verified User Anonymous Read full review	It provides a convenient way of quickly setting up a big data environment, easily setting up clusters with different configurations. It provides several security architectures that can be used as well. Since it provides a big list of components and packaged together, it is a great tool for companies to get set and utilize it for their use cases. Since it uses Ambari extensively to install, upgrade and manage software, it is very convenient and easy to support and operationalize the components. Alerting and notifications, ability to create custom alerts give you the capability to add any number of alerts to meet your custom needs. It provides a great way to maintain other software by creating mpacks and the ability to add custom code, and you can add other software to be managed in a centralized tool. The use and support of popular and useful open source software and the company's contribution to the community makes HDP a very useful tool that enables a quick, secure, easily maintainable suite of components that can help companies meet the needs of the business. What is great is that new components keep getting added based on any new useful tool that comes available, like Druid, and made available as part of the suite of components. That helps businesses keep up with new capabilities as they become available, and use them to solve their problems. Incentivized Verified User Anonymous Read full review
ScreenShots