What users are saying about
8 Ratings
59 Ratings
8 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8 out of 100
59 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.9 out of 100

Likelihood to Recommend

Apache Flume

Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
Anonymous | TrustRadius Reviewer

Apache Kafka

Despite the disadvantages I list, I really believe that Kafka is the right choice whenever you need a queueing or message broker system. Kafka is way too battle-tested and scales too well to ever not consider it. The only exception is if your use case requires many, many small topics. Also, Kafka doesn't support delay queues out of the box and so you will need to "hack" it through special code on the consumer side.
Anonymous | TrustRadius Reviewer

Pros

Apache Flume

  • Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage
  • It is very easy to setup and run
  • Very open to personalization, you can create filters, enrichment, new sources and destinations
Juan Francisco Tavira | TrustRadius Reviewer

Apache Kafka

  • Apache Kafka is able to handle a large number of I/Os (writes) using 3-4 cheap servers.
  • It scales very well over large workloads and can handle extreme-scale deployments (eg. Linkedin with 300 billion user events each day).
  • The same Kafka setup can be used as a messaging bus, storage system or a log aggregator making it easy to maintain as one system feeding multiple applications.
Anonymous | TrustRadius Reviewer

Cons

Apache Flume

  • It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data
  • Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though)
Anonymous | TrustRadius Reviewer

Apache Kafka

  • Still a bit inmature, some clients have required recoding in the last few versions
  • New feaures coming very fast, several upgrades a year may be required
  • Not many commercial companies provide support
Juan Francisco Tavira | TrustRadius Reviewer

Likelihood to Renew

Apache Flume

No score
No answers yet
No answers on this topic

Apache Kafka

Apache Kafka 9.0
Based on 1 answer
Kafka is quickly becoming core product of the organization, indeed it is replacing older messaging systems. No better alternatives found yet
Juan Francisco Tavira | TrustRadius Reviewer

Support Rating

Apache Flume

Apache Flume 5.0
Based on 2 answers
Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.
Anonymous | TrustRadius Reviewer

Apache Kafka

Apache Kafka 8.8
Based on 6 answers
We are using the Apache open source version of Kafka. The community is a good place to ask questions. and we can get most of our problems resolved there.
Anonymous | TrustRadius Reviewer

Alternatives Considered

Apache Flume

Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so
you need to know exactly what you want.On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
Juan Francisco Tavira | TrustRadius Reviewer

Apache Kafka

I would only use RabbitMQ over Kafka when you need to have delay queues or tons of small topics/queues around.I don't know too much about Pulsar - currently evaluating it - but it's supposed to have the same or better throughput while allowing for tons of queues. Stay tuned - I might update this review after we finish evaluating Pulsar. It's much less battle-tested though.
Anonymous | TrustRadius Reviewer

Return on Investment

Apache Flume

  • Flume has simplified a lot many of our ingest procedures, easier to deploy and integrate than a classical EAI, reducing the time to market
  • But opposed to EAIs if the project starts to grow in complexity Apache Flume project may not be as suitable
Juan Francisco Tavira | TrustRadius Reviewer

Apache Kafka

  • Positive impact on ROI since now we can use one large deployment of Apache Kafka that can be used for multiple scenarios ( storage systems, log aggregate, messaging queue ).
  • It is open-source so there are no licenses or subscription fees reducing the cost of deployment.
  • Data can now be ingested and analyzed in real-time making it easy to fine-tune the customer experience and decision making for internal IT.
Anonymous | TrustRadius Reviewer

Pricing Details

Apache Flume

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Apache Kafka

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Add comparison