Apache Kafka vs. Apache Lucene

Apache Kafka

Apache Kafka

152 Reviews and Ratings

Apache Lucene

Apache Lucene

9 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Kafka	Score 8.8 out of 10	N/A	Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. The Kafka event streaming platform is used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.	N/A
Apache Lucene	Score 10.0 out of 10	N/A	Apache Lucene is an open source and free text search engine library written in Java. It is a technology suitable for applications that requires full-text search, and is available cross-platform.	N/A

Pricing

Apache Kafka

Apache Lucene

Editions & Modules

No answers on this topic

No answers on this topic

Offerings

Pricing Offerings
Apache Kafka	Apache Lucene
Free Trial
No	No
Free/Freemium Version
No	Yes
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

A free and open source product.

More Pricing Information

Community Pulse
	Apache Kafka	Apache Lucene
Considered Both Products	Apache Kafka AS Ankit Singh Senior Engineering Manager Chose Apache Kafka It has very minimal overhead and doesn't have a steep learning curve. Incentivized Helpful? VT Victor Tay Engineer Chose Apache Kafka Apache Kafka is built for scale. From high throughput and real-time data streaming, it has a strong advantage over RabbitMQ with its low latency. This put Apache Kafka at the forefront as the platform of choice for large datasets messaging and ensuring scalability when data … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka It had the clustering functionality and gave tolerance against machine failure. Incentivized Helpful? Alok Pabalkar Co-Founder & CTO Chose Apache Kafka - The biggest advantage of using Apache Kafka is that it is cloud agnostic - It handles super high volume, is fault tolerance, high performance Incentivized Helpful? Animesh Kumar Senior Member of Technical Staff Chose Apache Kafka Apache Kafka can work at a higher scale as compared to SQS. It can work with higher size per message and millions of messages per second. Moreover it can be scaled horizontally by adding more brokers to the cluster. SQS is good enough for simple use cases like making a task … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka I used other messaging/queue solutions that are a lot more basic than Confluent Kafka, as well as another solution that is no longer in the market called Xively, which was bought and "buried" by Google. In comparison, these solutions offer way fewer functionalities and respond … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Apache Kafka is open-sourced, scales great has cloud agnostics and performs better than Amazon Kinesis [in my view]. Amazon Kinesis has some limitations and vendor lockin is not something I [like]. With Confluent operators you can easily install it on a kubernetes cluster. Incentivized Helpful? Tyler Twitchell Senior System Engineer Chose Apache Kafka We really needed to get away from using a SQL database to act as a queue for processing records, so a new solution was needed. Kafka is a leading software application initially designed for queuing messages which is essentially what we were looking for. It has a great user … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Kafka is simple and lower in price. Incentivized Helpful? Borislav Traykov DevOps Team Leader Chose Apache Kafka For us, Kafka really doesn't have a 1:1 alternative. We have used ActiveMQ extensively and we still use it as a lighter option for small messages. The situation is similar with Redis - although it could be used like a Kafka alternative, we do use it just as a per-component … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Apache Kafka is much more scalable and more reliable. Does not depend on memory, works well on rotational disks and that makes it a cheaper to use solution on low hardware requirements. Running multiple consumers on the same topic can also mean processing the same data again … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka All stack tech helps our app and system. These technologies allow us to have the data available faster between different regions (due to our particular configuration) and thus the data and processing load of each system is lower. This allows the systems to be used more … Incentivized Helpful? Viral Patel Senior Software Engineer Chose Apache Kafka We had lots of problems with active mq. That is why we started using Apache Kafka. Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Kafka is not a real messaging broker implementation as RabbitMQ or TIBCO EMS/JMS are. Although it can be used as messaging, we like the idea behind the Kafka (data isn't "passing by," instead it remains centra, so the client can revisit the data if necessary). This also … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Confluent Cloud is still based on Apache Kafka but it has a subscription fee so, from a long term perspective, it is wiser to deploy your own Kafka instance that spans public and private cloud. Amazon Kinesis, Google Cloud Pub/Sub do not do well for a very number of messages … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka I would only use RabbitMQ over Kafka when you need to have delay queues or tons of small topics/queues around. I don't know too much about Pulsar - currently evaluating it - but it's supposed to have the same or better throughput while allowing for tons of queues. Stay tuned - I … Incentivized Helpful? Juan Francisco Tavira Global Technology Centre - Middleware Chose Apache Kafka Kafka is faster and more scalable, also "free" as opensource (albeit we deploy using a commercial distribution). Infrastructure tends to be cheaper. On the other hand, projects must adapt to Kafka APIs that sometimes change and BAU increases until a major 1.x version comes out … Incentivized Helpful?	Apache Lucene Sirish Vadala Applications Developer Information Technology Specialist Chose Apache Lucene The search and index performance of [Apache] Lucene is excellent and the quality of results is good, if not better. For implementing it with small scale applications it is a no brainer, Lucene is the best and most cost effective solution. Learning curve is not too steep either. Incentivized Helpful? Verified User Anonymous Chose Apache Lucene Apache Solr, Apache Spark, Apache Kafka, Apache Tomcat, Apache Cordova, Apache Derby and Apache Web Server Incentivized Helpful? Craig J. Stadler Search Engineer Chose Apache Lucene I have tried Elastic and Sphinx, each has their benefits but I feel like Apache Lucene overall is the best performing and easiest to setup and maintain. Incentivized Helpful?

Best Alternatives
	Apache Kafka	Apache Lucene
Small Businesses	No answers on this topic	Yext Score 7.4 out of 10
Medium-sized Companies	IBM MQ Score 8.8 out of 10	Guru Score 9.3 out of 10
Enterprises	IBM MQ Score 8.8 out of 10	Guru Score 9.3 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Kafka	Apache Lucene
Likelihood to Recommend	8.0 (0 ratings)	10.0 (0 ratings)
Likelihood to Renew	9.0 (0 ratings)	- (0 ratings)
Usability	8.0 (0 ratings)	- (0 ratings)
Support Rating	8.4 (0 ratings)	- (0 ratings)

User Testimonials
	Apache Kafka	Apache Lucene
Likelihood to Recommend	For brokering messages, Confluent Kafka is well suited since it offers a managed solution ready to use. Scenarios where the solution is not very well suited are for example, where pricing is an issue. The solution costs quite a lot for basic usage (for example: for 3 clusters, pricing is above 100k$ a year). Incentivized Verified User Anonymous Read full review	Apache Lucene offers great full-text search library that makes it easy to add search functionality to a website or other applications. Lucene is ideal if you want low-level access to the indexes and its APIs. For general purposes, Apache Solr, the web application built atop of Lucene can be used instead. Apache Solr comes with caching, HTTP/ JSON APIs and a simple web administration console. Incentivized Verified User Anonymous Read full review
Pros	Apache Kafka is able to handle a large number of I/Os (writes) using 3-4 cheap servers. It scales very well over large workloads and can handle extreme-scale deployments (eg. Linkedin with 300 billion user events each day). The same Kafka setup can be used as a messaging bus, storage system or a log aggregator making it easy to maintain as one system feeding multiple applications. Incentivized Verified User Anonymous Read full review	Fast indexing, with proper optimization I can index a Gig of data in 2 mins. Easy integration with web crawlers Quick and Accurate Results Flexible sorting option for results based on the search field and relevance Incentivized Sirish Vadala Applications Developer Information Technology Specialist Read full review
Cons	The Kafka Tool is a community-made Java application that looks and feels from the past century. Logging can be confusing. This certainly shows when we have to do troubleshooting. Hybrid scenarios - pub/sub, but there are services in and outside a Kubernetes cluster. Then there are a ~3 options, but only 2 (the harder ones) are production-safe. Incentivized Borislav Traykov DevOps Team Leader Read full review	We had difficulty porting the project to a cluster based environment on the cloud. For our particular use case of retrieving documents based on text pattern matching, the program worked efficiently however, we did not find many resources for image pattern recognition based on their metadata. Incentivized Verified User Anonymous Read full review
Likelihood to Renew	Kafka has suited our use case very well so far. Going forward we are planning to expand our platform manifold so the load on Kafka and our reliance on Kafka is going to increase only. Animesh Kumar Senior Member of Technical Staff Read full review	No answers on this topic
Usability	Apache Kafka is highly recommended to develop loosely coupled, real-time processing applications. Also, Apache Kafka provides property based configuration. Producer, Consumer and broker contain their own separate property file Incentivized JV Jimesh V Shah Senior Software Engineer Read full review	No answers on this topic
Support Rating	Support for Apache Kafka (if willing to pay) is available from Confluent that includes the same time that created Kafka at Linkedin so they know this software in and out. Moreover, Apache Kafka is well known and best practices documents and deployment scenarios are easily available for download. For example, from eBay, Linkedin, Uber, and NYTimes. Incentivized Verified User Anonymous Read full review	No answers on this topic
Alternatives Considered	Apache Kafka is built for scale. From high throughput and real-time data streaming, it has a strong advantage over RabbitMQ with its low latency. This put Apache Kafka at the forefront as the platform of choice for large datasets messaging and ensuring scalability when data scale up tremendously. RabbitMQ however has its strengths in traditional messaging. Routing and message delivery reliability are the bedrock of RabbitMQ and this is where RabbitMQ excels. In my previous workplace, RabbitMQ was of choice as reliability matters more than scale. In two words. Apache Kafka for scale, RabbitMQ for reliability. And for cloud deployment and large dataset messaging in what I am doing now, Apache Kafka is the default choice. VT Victor Tay Engineer Read full review	The search and index performance of [Apache] Lucene is excellent and the quality of results is good, if not better. For implementing it with small scale applications it is a no brainer, Lucene is the best and most cost effective solution. Learning curve is not too steep either. Incentivized Sirish Vadala Applications Developer Information Technology Specialist Read full review
Return on Investment	Positive: bursts of traffic on special holidays are easy to handle because Kafka can absorb and buffer all the messages we need to process long enough to let an understaffed set of back-end services catch up on processing. Hard to put a number to it but we probably save $5k a month having fewer machines running. Positive: makes decoupling the web and API services from the deeper back-end services easier by providing topics as an interface. This allowed us to split up our teams and have them develop independently of each other, speeding up software development. Negative: our engineers have made mistakes such as accidentally dropping a few thousand messages due to the CLI being confusing to use, and as a result a customer lost some of their precious data. I'd say that was more our fault than Kafka's though. Incentivized Verified User Anonymous Read full review	Very good at using minimal hardware sets saving money on hosting. Very good at housing multiple cores or instances. Incentivized Craig J. Stadler Search Engineer Read full review
ScreenShots		Apache Lucene Screenshots