dbt vs. StreamSets DataOps Platform

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
dbt
Score 9.4 out of 10
N/A
dbt is an SQL development environment, developed by Fishtown Analytics, now known as dbt Labs. The vendor states that with dbt, analysts take ownership of the entire analytics engineering workflow, from writing data transformation code to deployment and documentation. dbt Core is distributed under the Apache 2.0 license, and paid Teams and Enterprise editions are available.
$0
per month per seat
StreamSets
Score 8.4 out of 10
N/A
StreamSets in San Francisco offers their DataOps Platform, a subscription based streaming analytics platform including StreamSets Data Collector data source management, Control Hub for data movement architecture management, StreamSets Data Collector Edge IoT manager, DataFlow Performance Manager (DPM), and StreamSets Data Protector compliance (e.g. GDPR) compliance module.N/A
Pricing
dbtStreamSets DataOps Platform
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
dbtStreamSets
Free Trial
YesNo
Free/Freemium Version
YesNo
Premium Consulting/Integration Services
YesNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details——
More Pricing Information
Community Pulse
dbtStreamSets DataOps Platform
Top Pros
Top Cons
Features
dbtStreamSets DataOps Platform
Data Transformations
Comparison of Data Transformations features of Product A and Product B
dbt
9.7
5 Ratings
15% above category average
StreamSets DataOps Platform
-
Ratings
Simple transformations9.55 Ratings00 Ratings
Complex transformations9.95 Ratings00 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
dbt
9.0
5 Ratings
10% above category average
StreamSets DataOps Platform
-
Ratings
Data model creation9.15 Ratings00 Ratings
Metadata management8.65 Ratings00 Ratings
Business rules and workflow8.05 Ratings00 Ratings
Collaboration9.83 Ratings00 Ratings
Testing and debugging9.55 Ratings00 Ratings
Streaming Analytics
Comparison of Streaming Analytics features of Product A and Product B
dbt
-
Ratings
StreamSets DataOps Platform
9.0
1 Ratings
11% above category average
Visualization Dashboards00 Ratings7.01 Ratings
Low Latency00 Ratings8.01 Ratings
Integrated Development Tools00 Ratings10.01 Ratings
Data wrangling and preparation00 Ratings10.01 Ratings
Data Enrichment00 Ratings10.01 Ratings
Best Alternatives
dbtStreamSets DataOps Platform
Small Businesses
Skyvia
Skyvia
Score 9.6 out of 10
IBM Streams
IBM Streams
Score 9.0 out of 10
Medium-sized Companies
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
Confluent
Confluent
Score 7.4 out of 10
Enterprises
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
Spotfire Streaming
Spotfire Streaming
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
dbtStreamSets DataOps Platform
Likelihood to Recommend
9.6
(7 ratings)
9.0
(1 ratings)
User Testimonials
dbtStreamSets DataOps Platform
Likelihood to Recommend
dbt Labs
If you can load your data first into your warehouse, dbt is excellent. It does the T(ransformation) part of ELT brilliantly but does not do the E(xtract) or L(oad) part. If you know SQL or your development team knows SQL, it's a framework and extension around that. So, it's easy to learn and easy to hire people with that technical skill (as opposed to specific Informatica, SnapLogic, etc. experience). dbt uses plain text files and integrates with GitHub. You can easily see the changes made between versions. In GUI-based UIs it was always hard to tell what someone had changed. Each "model" is essentially a "SELECT" statement. You never need to do a "CREATE TABLE" or "CREATE VIEW" - it's all done for you, leaving you to work on the business logic. Instead of saying "FROM specific_db.schema.table" you indicate "FROM ref('my_other_model')". It creates an internal dependency diagram you can view in a DAG. When you deploy, the dependencies work like magic in your various environments. They also have great documentation, an active slack community, training, and support. I like the enhancements they have been making and I believe they are headed in a good direction.
Read full review
StreamSets
Majorly for all Batch and Streaming Scenarios we are designing StreamSets pipelines, few best suited and tried out use cases below : 1. JDBC to ADLS data transfer based on source refresh frequency. 2. Kafka to GCS. 3. Kafka to Azure Event. 4. Hub HDFS to ADLS data transfer. 5. Schema generation to generate Avro. The easy to design Canvas, Scheduling Jobs, Fragment creation and utilization, an inbuilt wide range of Stage availability makes it an even more favorable tool for me to design data engineering pipelines.
Read full review
Pros
dbt Labs
  • user experience makes it easy to work with SQL and version control
  • customer success team and the dbt (data build tool) community help establish best practices
  • thorough and clear documentation
Read full review
StreamSets
  • A easy to use canvas to create Data Engineering Pipeline.
  • A wide range of available Stages ie. Sources, Processors, Executors, and Destinations.
  • Supports both Batch and Streaming Pipelines.
  • Scheduling is way easier than cron.
  • Integration with Key-Vaults for Secrets Fetching.
Read full review
Cons
dbt Labs
  • Slow load times of the dbt cloud environment (they're working on it via a new UI though)
  • More out-of-the-box solutions for managing procedures, functions, etc would be nice to have, but honestly, it's pretty easy to figure out how to adapt dbt macros
Read full review
StreamSets
  • Monitoring/Visualization can be improvised and enhanced a lot (e.g. to monitor a Job to see what happened 7 days back with data transfer).
  • The logging mechanism can be simplified (Logs can be filtered with "ERROR", "DEBUG", "ALL" etc but still takes some time to get familiar for understanding).
  • Auto Scalability for heavy load transfer (Taking much time for >5 million record transfer from JDBC to ADLS destination in Avro file transfer).
  • There should be a concept of creating Global variables which is missing.
Read full review
Alternatives Considered
dbt Labs
Most ETL pipeline products have a T layer, but dbt just does it better. The transformation is on steroids compared to the others. Also, just allows much more Adhoc solutions for very specific projects. Those ETL tools are probably better on the T part if you don't need too many transforms - also dbt is pretty much free dependent on how you work it, also extremely scalable.
Read full review
StreamSets
StreamSets is a one-stop solution to design Data engineering Pipelines and doesn't require deep Programming knowledge, It's so user-friendly that anyone in Team can contribute to the Idea of pipeline design. In Hadoop One has to be programming proficient to use its various components like Hive, HDFS, Kafka, etc but in StreamSets all these stages are built-in and ready to use with minor configuration.
Read full review
Return on Investment
dbt Labs
  • Simplified our BI layer for faster load times
  • Increased the quality of data reaching our end users
  • Makes complex transformations manageable
Read full review
StreamSets
  • Simplified Improvised Overall data ingestion and Integration Process.
  • Support to various Hetrogenous Source systems like RDBMS< Kafka, Salesforce, Key Vault.
  • Secure, easy to launch Integration tool.
Read full review
ScreenShots