Apache Flume

Apache Flume

Score 7.1 out of 10
Apache Flume


What is Apache Flume?

Apache Flume is a product enabling the flow of logs and other data into a Hadoop environment.

Read more

Recent Reviews

Read all reviews
Return to navigation

Product Details

What is Apache Flume?

Apache Flume is a product enabling the flow of logs and other data into a Hadoop environment.

Apache Flume Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation


View all alternatives
Return to navigation

Reviews and Ratings


Attribute Ratings


(1-1 of 1)
Companies can't remove reviews or game the system. Here's why
Score 8 out of 10
Vetted Review
Verified User
Apache Flume is used for aggregating and analyzing log data in near-real-time across the organization for compliance purposes with a goal to generate monthly compliance reports based on log data.
  • Apache Flume being a log-centric system, it is able to parse and aggregate log data very well.
  • It is easy to customize it for different source (producers) for log data ingestion as well as for sinks (consumers).
  • It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data
  • Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though)
Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
  • Positive impact on ROI due to a reduction in manual labor to generate and maintain compliance reports based on logs.
  • Positive impact on the business objective by reducing the need for provisioning compute for log aggregate IT stack in advance but adding on an as-needed basis.
Apache Flume is on par with Scribe with similar functions. Apache Kafka is a generation purpose while Apache Flume is specific to log aggregation. Google Pub/Sub and IBM MQ are costlier than Apache Flume ( open source ) and have a lot more cost associated with them. Apama Streaming Analytics and Tibco Steaming are more comprehensive streaming solutions than Apache Flume so for deeper performance guarantees, it is easier to use Apache Flume.
Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.
Return to navigation