Apache Kafka vs. IBM InfoSphere Information Server

Apache Kafka

IBM InfoSphere Information Server

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Kafka	Score 8.8 out of 10	N/A	Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. The Kafka event streaming platform is used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.	N/A
IBM InfoSphere Information Server	Score 8.0 out of 10	N/A	IBM InfoSphere Information Server is a data integration platform used to understand, cleanse, monitor and transform data. The offerings provide massively parallel processing (MPP) capabilities.	N/A

Pricing

Apache Kafka

IBM InfoSphere Information Server

Editions & Modules

No answers on this topic

Offerings

Pricing Offerings
Apache Kafka	IBM InfoSphere Information Server
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

Additional Details

—

More Pricing Information

Community Pulse
	Apache Kafka	IBM InfoSphere Information Server
Considered Both Products	Apache Kafka AS Ankit Singh Senior Engineering Manager Chose Apache Kafka It has very minimal overhead and doesn't have a steep learning curve. Incentivized Helpful? VT Victor Tay Engineer Chose Apache Kafka Apache Kafka is built for scale. From high throughput and real-time data streaming, it has a strong advantage over RabbitMQ with its low latency. This put Apache Kafka at the forefront as the platform of choice for large datasets messaging and ensuring scalability when data … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka It had the clustering functionality and gave tolerance against machine failure. Incentivized Helpful? Alok Pabalkar Co-Founder & CTO Chose Apache Kafka - The biggest advantage of using Apache Kafka is that it is cloud agnostic - It handles super high volume, is fault tolerance, high performance Incentivized Helpful? Animesh Kumar Senior Member of Technical Staff Chose Apache Kafka Apache Kafka can work at a higher scale as compared to SQS. It can work with higher size per message and millions of messages per second. Moreover it can be scaled horizontally by adding more brokers to the cluster. SQS is good enough for simple use cases like making a task … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka I used other messaging/queue solutions that are a lot more basic than Confluent Kafka, as well as another solution that is no longer in the market called Xively, which was bought and "buried" by Google. In comparison, these solutions offer way fewer functionalities and respond … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Apache Kafka is open-sourced, scales great has cloud agnostics and performs better than Amazon Kinesis [in my view]. Amazon Kinesis has some limitations and vendor lockin is not something I [like]. With Confluent operators you can easily install it on a kubernetes cluster. Incentivized Helpful? Tyler Twitchell Senior System Engineer Chose Apache Kafka We really needed to get away from using a SQL database to act as a queue for processing records, so a new solution was needed. Kafka is a leading software application initially designed for queuing messages which is essentially what we were looking for. It has a great user … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Kafka is simple and lower in price. Incentivized Helpful? Borislav Traykov DevOps Team Leader Chose Apache Kafka For us, Kafka really doesn't have a 1:1 alternative. We have used ActiveMQ extensively and we still use it as a lighter option for small messages. The situation is similar with Redis - although it could be used like a Kafka alternative, we do use it just as a per-component … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Apache Kafka is much more scalable and more reliable. Does not depend on memory, works well on rotational disks and that makes it a cheaper to use solution on low hardware requirements. Running multiple consumers on the same topic can also mean processing the same data again … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka All stack tech helps our app and system. These technologies allow us to have the data available faster between different regions (due to our particular configuration) and thus the data and processing load of each system is lower. This allows the systems to be used more … Incentivized Helpful? Viral Patel Senior Software Engineer Chose Apache Kafka We had lots of problems with active mq. That is why we started using Apache Kafka. Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Kafka is not a real messaging broker implementation as RabbitMQ or TIBCO EMS/JMS are. Although it can be used as messaging, we like the idea behind the Kafka (data isn't "passing by," instead it remains centra, so the client can revisit the data if necessary). This also … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka Confluent Cloud is still based on Apache Kafka but it has a subscription fee so, from a long term perspective, it is wiser to deploy your own Kafka instance that spans public and private cloud. Amazon Kinesis, Google Cloud Pub/Sub do not do well for a very number of messages … Incentivized Helpful? Verified User Anonymous Chose Apache Kafka I would only use RabbitMQ over Kafka when you need to have delay queues or tons of small topics/queues around. I don't know too much about Pulsar - currently evaluating it - but it's supposed to have the same or better throughput while allowing for tons of queues. Stay tuned - I … Incentivized Helpful? Juan Francisco Tavira Global Technology Centre - Middleware Chose Apache Kafka Kafka is faster and more scalable, also "free" as opensource (albeit we deploy using a commercial distribution). Infrastructure tends to be cheaper. On the other hand, projects must adapt to Kafka APIs that sometimes change and BAU increases until a major 1.x version comes out … Incentivized Helpful?	IBM InfoSphere Information Server Karina Gonzalez Manager Data Management Chose IBM InfoSphere Information Server I particularly believe that Information Server, especially DataStage, is superior in many aspects to the Oracle Data Integrator tool. Several market analysts such as Gartner and / or Forrester better position DataStage on the Oracle solution. Incentivized Helpful? Verified User Anonymous Chose IBM InfoSphere Information Server DataStage is more robust and stable than ODI The ability to perform complex transformations or implement business rules is much more developed in DS Incentivized Helpful? Gunes INAL Data Warehouse Consultant Chose IBM InfoSphere Information Server Denodo not an ETL tool but you can manage your data with it as well as Infosphere Incentivized Helpful? Gonzalo Angeleri Regional Product & Solution Architect Manager Chose IBM InfoSphere Information Server Information Server and Informatica PowerCenter are the two leading integration platforms worldwide. Information Server has a better integration with other IBM products such as MDM or Cognos but the decision to use one or another platform is more a price decision and quantity of … Incentivized Helpful? Verified User Anonymous Chose IBM InfoSphere Information Server Information can be used for large but simple implementation. InfoSphere provides more intergaration options with outside world. Abinitio is great product but stopped innvoating and is not kept up to date with market needs and changes. Talend is good new product. Good Big Data … Incentivized Helpful?

Features

Apache Kafka

IBM InfoSphere Information Server

Data Source Connection

Comparison of Data Source Connection features of Product A and Product B
	Apache Kafka - Ratings	IBM InfoSphere Information Server 8.7 Ratings 5% above category average
Connect to traditional data sources	00 Ratings	9.90 Ratings
Connecto to Big Data and NoSQL	00 Ratings	7.50 Ratings

Data Transformations

Comparison of Data Transformations features of Product A and Product B
	Apache Kafka - Ratings	IBM InfoSphere Information Server 9.6 Ratings 17% above category average
Simple transformations	00 Ratings	10.00 Ratings
Complex transformations	00 Ratings	9.20 Ratings

Data Modeling

Comparison of Data Modeling features of Product A and Product B
	Apache Kafka - Ratings	IBM InfoSphere Information Server 8.0 Ratings 2% above category average
Data model creation	00 Ratings	8.70 Ratings
Metadata management	00 Ratings	7.70 Ratings
Business rules and workflow	00 Ratings	8.40 Ratings
Collaboration	00 Ratings	8.00 Ratings
Testing and debugging	00 Ratings	7.10 Ratings

Data Governance

Comparison of Data Governance features of Product A and Product B
	Apache Kafka - Ratings	IBM InfoSphere Information Server 9.7 Ratings 19% above category average
Integration with data quality tools	00 Ratings	10.00 Ratings
Integration with MDM tools	00 Ratings	9.50 Ratings

Best Alternatives
	Apache Kafka	IBM InfoSphere Information Server
Small Businesses	No answers on this topic	Skyvia Score 10.0 out of 10
Medium-sized Companies	IBM MQ Score 9.0 out of 10	dbt Score 9.1 out of 10
Enterprises	IBM MQ Score 9.0 out of 10	InterSystems IRIS Score 8.1 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Kafka	IBM InfoSphere Information Server
Likelihood to Recommend	8.0 (0 ratings)	8.9 (0 ratings)
Likelihood to Renew	9.0 (0 ratings)	8.0 (0 ratings)
Usability	8.0 (0 ratings)	- (0 ratings)
Support Rating	8.4 (0 ratings)	- (0 ratings)

User Testimonials
	Apache Kafka	IBM InfoSphere Information Server
Likelihood to Recommend	For brokering messages, Confluent Kafka is well suited since it offers a managed solution ready to use. Scenarios where the solution is not very well suited are for example, where pricing is an issue. The solution costs quite a lot for basic usage (for example: for 3 clusters, pricing is above 100k$ a year). Incentivized Verified User Anonymous Read full review	You can use infosphere: -If you have multiple targets and source systems and they are different than each other. -If your infostructure is so big and unplaned well so you can't find what you want to see. -If your databases not so strong to process your data Incentivized Gunes INAL Data Warehouse Consultant Read full review
Pros	Apache Kafka is able to handle a large number of I/Os (writes) using 3-4 cheap servers. It scales very well over large workloads and can handle extreme-scale deployments (eg. Linkedin with 300 billion user events each day). The same Kafka setup can be used as a messaging bus, storage system or a log aggregator making it easy to maintain as one system feeding multiple applications. Incentivized Verified User Anonymous Read full review	It is very strong to make transformations/data derivations It is very easy to connect to various external data sources. It has an interface (stages) for each connection that simplifies the task It is a stable platform. And that parallelism helps make it fast for loading, if the process is well designed Verified User Anonymous Read full review
Cons	The Kafka Tool is a community-made Java application that looks and feels from the past century. Logging can be confusing. This certainly shows when we have to do troubleshooting. Hybrid scenarios - pub/sub, but there are services in and outside a Kubernetes cluster. Then there are a ~3 options, but only 2 (the harder ones) are production-safe. Incentivized Borislav Traykov DevOps Team Leader Read full review	Lack of a strong web development environment. Metadata propagation in Jobs is somewhat complex. The possibility to develop jobs in Parallel and/or Server Engines is confusing. Karina Gonzalez Manager Data Management Read full review
Likelihood to Renew	Kafka has suited our use case very well so far. Going forward we are planning to expand our platform manifold so the load on Kafka and our reliance on Kafka is going to increase only. Animesh Kumar Senior Member of Technical Staff Read full review	Scale of implementation IBM techsupport Incentivized Verified User Anonymous Read full review
Usability	Apache Kafka is highly recommended to develop loosely coupled, real-time processing applications. Also, Apache Kafka provides property based configuration. Producer, Consumer and broker contain their own separate property file Incentivized JV Jimesh V Shah Senior Software Engineer Read full review	No answers on this topic
Support Rating	Support for Apache Kafka (if willing to pay) is available from Confluent that includes the same time that created Kafka at Linkedin so they know this software in and out. Moreover, Apache Kafka is well known and best practices documents and deployment scenarios are easily available for download. For example, from eBay, Linkedin, Uber, and NYTimes. Incentivized Verified User Anonymous Read full review	No answers on this topic
Alternatives Considered	Apache Kafka is built for scale. From high throughput and real-time data streaming, it has a strong advantage over RabbitMQ with its low latency. This put Apache Kafka at the forefront as the platform of choice for large datasets messaging and ensuring scalability when data scale up tremendously. RabbitMQ however has its strengths in traditional messaging. Routing and message delivery reliability are the bedrock of RabbitMQ and this is where RabbitMQ excels. In my previous workplace, RabbitMQ was of choice as reliability matters more than scale. In two words. Apache Kafka for scale, RabbitMQ for reliability. And for cloud deployment and large dataset messaging in what I am doing now, Apache Kafka is the default choice. VT Victor Tay Engineer Read full review	I particularly believe that Information Server, especially DataStage, is superior in many aspects to the Oracle Data Integrator tool. Several market analysts such as Gartner and / or Forrester better position DataStage on the Oracle solution. Karina Gonzalez Manager Data Management Read full review
Return on Investment	Positive: bursts of traffic on special holidays are easy to handle because Kafka can absorb and buffer all the messages we need to process long enough to let an understaffed set of back-end services catch up on processing. Hard to put a number to it but we probably save $5k a month having fewer machines running. Positive: makes decoupling the web and API services from the deeper back-end services easier by providing topics as an interface. This allowed us to split up our teams and have them develop independently of each other, speeding up software development. Negative: our engineers have made mistakes such as accidentally dropping a few thousand messages due to the CLI being confusing to use, and as a result a customer lost some of their precious data. I'd say that was more our fault than Kafka's though. Incentivized Verified User Anonymous Read full review	Information Server can positively impact the costs of companies by increasing the productivity of development and therefore reduce their time and costs. It is estimated that DataStage can increase a developer's productivity by 40% on average. Better data governance Improve data quality and reduce bad data impacts Incentivized Gonzalo Angeleri Regional Product & Solution Architect Manager Read full review
ScreenShots