Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage
It is very easy to setup and run
Very open to personalization, you can create filters, enrichment, new sources and destinations

Apache Flume develops new functionality at a slower pace than other OpenSource projects, it is well behing Kafka and has some compatibiliy issues with latest releases
It lack HA or FT, it relies on third party management software like Hortonworks or Cloudera

Flume has simplified a lot many of our ingest procedures, easier to deploy and integrate than a classical EAI, reducing the time to market
But opposed to EAIs if the project starts to grow in complexity Apache Flume project may not be as suitable

Logstash

Apache Kafka, Logstash, TIBCO BusinessWorks, TIBCO Enterprise Message Service

Verified User

Analyst in Information Technology (51-200 employees employees)

Apache Flume being a log-centric system, it is able to parse and aggregate log data very well.
It is easy to customize it for different source (producers) for log data ingestion as well as for sinks (consumers).

It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data
Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though)

Positive impact on ROI due to a reduction in manual labor to generate and maintain compliance reports based on logs.
Positive impact on the business objective by reducing the need for provisioning compute for log aggregate IT stack in advance but adding on an as-needed basis.

TIBCO Streaming (StreamBase), Apache Kafka, Google Cloud Pub/Sub, IBM MQ and Apama Streaming Analytics

Apama Streaming Analytics, TIBCO Streaming (StreamBase)

Apache Flume

What is Apache Flume?