TrustRadius: an HG Insights company

IBM StreamSets

Score8 out of 10

17 Reviews and Ratings

What is IBM StreamSets?

IBM® StreamSets enables users to create and manage smart streaming data pipelines through a graphical interface, facilitating data integration across hybrid and multicloud environments. IBM StreamSets can support millions of data pipelines for analytics, applications and hybrid integration.

Categories & Use Cases

IBM StreamSets - An Ideal Solution for Hybrid Data Integration

Use Cases and Deployment Scope

At Unify Logistics Solutions, we use IBM StreamSets to manage real-time data pipelines across our hybrid cloud and on-prem systems. It helps us solve key challenges like data drift, inconsistent formats, and latency in logistics tracking. With StreamSets, we can ingest, transform, and deliver data from IoT sensors, warehouse systems, and partner APIs into our analytics platforms with minimal manual effort.

Pros

  • Seamlessly pulls data from IoT sensors, warehouse systems, and external APIs into our analytics platforms with minimal latency.
  • Connects cloud and on-prem sources effortlessly, enabling unified data flow across our logistics ecosystem.
  • Automatically identifies changes in data structure or format, helping us avoid pipeline failures and maintain data integrity.

Cons

  • Some niche data sources require custom development or third-party plugins, which slows down integration.
  • The interface sometimes lags when handling large data flows or complex pipeline designs.
  • Certain advanced features aren’t well documented, leading to trial-and-error during setup.

Return on Investment

  • Time savings – Automated data pipelines reduced manual processing by 60%, freeing up engineering hours.
  • Easily scaled pipelines across cloud and on-prem systems, supporting business growth without major rework.

Usability

Alternatives Considered

Fivetran

Other Software Used

IBM FileNet Content Manager, Canva, IBM Cloud Object Storage

IBM StreamSets does the job

Use Cases and Deployment Scope

So in my organisation we majorly use IBM StreamSets to automate data flows between our CRM and analytics tools. Before it, we used to do it manually/ some other non effective tool and spend hours moving and cleaning data which was quite frustrating to be honest. Now we can set up pipelines that run quite smoothly and also keeps the reports accurate.

Pros

  • It makes building data pipelines quite super intuitive even for non coders.
  • Ir also handles real time data ingestion effortlessly so I always have up to date information for my reports.
  • It's great at monitoring data quality as well.

Cons

  • The error messages I feel aren t always very descriptive so troubleshooting can take longer
  • Maybe more customisation options for scheduling can be done, rest it works pretty well.

Return on Investment

  • Reduced manual handling, cutting down operational costs for our team.
  • It also accelerated our time to Insight, which has eventually led to faster decision making.
  • Data quality is improved.

Usability

Other Software Used

Snowflake, IBM Planning Analytics

Real-Time Data Pipelines Made Simple with IBM StreamSets

Use Cases and Deployment Scope

I mainly use IBM StreamSets to stream data from our on-prem systems to cloud applications and use them in real-time user applications to give them the latest information of various business reports that users create on different systems like client onboarding applications etc which then gets streamed to advisor applications where the advisor users create reports out of this available data and use it in their regular day to day work activities.

Pros

  • It helps streaming huge data that we have in our Teradata database to various reporting applications that runs on cloud seamlessly.
  • We also use IBM StreamSets to power few BI dashboards that our product managers use on regular basis to showcase various data with clients.
  • I think the data quality is way better compared to Informatica tool.

Cons

  • IBM should make things easy for beginners to get started with IBM StreamSets tool. Most new joinees in my team always find it difficult to do debugging in existing pipelines that we have.
  • The integration limitations are there. Like compared to Java where it integrates well but other frameworks like Python, .NET etc, the support is not so good.
  • The UI/UX interface, while intuitive for simple pipelines, sometime becomes cluttered and hard to navigate when managing complex pipelines involving more data streams.

Return on Investment

  • With IBM StreamSets we are able to do realtime data integration from our data warehouse to various user facing applications in cloud without much overheads.
  • Also, the low code pipeline design and reusable templates helps me create more pipelines at faster pace and make things prod ready in short time.
  • One negative thing that I can say is the licensing cost. Initially when we started using we had very less data pipeline so our usage was very minimal during the POC stage. Later when we started creating pipelines, we started seeing the licensing cost go up huge. So this is the only negative thing I feel.

Usability

Alternatives Considered

Informatica Cloud API & App Integration

Other Software Used

Snowflake, Liquibase, Denodo

IBM StreamSets the data giant

Use Cases and Deployment Scope

I used IBM StreamSets for data analysis. It is a brilliant tool for monitoring data for analysis and provide pie charts and graphs in an easily readable format which lets even a not so well trained but knows enough person it read it efficiently and accurately. The charts and graphs give thorough information about the data without missing any key points.

Pros

  • Graphs and charts are designed well
  • Data summation is amazing
  • Easy to read and understand the summed up information

Cons

  • Where the person's skillsets in data analysis is not of an expert.
  • Data monitoring and analysis.
  • Customer data for better customer acquisition

Return on Investment

  • Good for the Analysis and the product team to decide what to work on next. So, we have a better video watch time.
  • What are the pages, sections of the websites that are less touched, so they can be changed or removed altogether to save company's money.
  • What are the areas we need to improve on, on the basis of categories and sub-categories that are less watched.

Usability

Alternatives Considered

Fivetran, AWS Glue and Informatica PowerCenter (legacy)

Other Software Used

IBM API Connect, TemplateToaster, AWS Chatbot

good tool for creation and monitoring of live data pipelines

Use Cases and Deployment Scope

the solution allows a live ingestion of data from different sources and create customized pipelines, monitor them and use them for many different scopes. data are real time and can have various format, are then integrated and collected into one unique layer from which are available for control and execution

Pros

  • science and analysis of real time data
  • create reaction chains to specific events
  • unification of data into one unique layer

Cons

  • design of the pipeline is complicated
  • integration of AI to support the user
  • reporting and monitor dashboards

Return on Investment

  • time saving for automatic collection and integration of data
  • time saving thanks to live monitoring and reaction
  • time saving for standardization of data

Usability

Alternatives Considered

AWS Glue

Other Software Used

DELMIA Apriso, FactoryTalk by Rockwell Automation, SAP Manufacturing Execution