dbt vs. StreamSets DataOps Platform

dbt

StreamSets DataOps Platform

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
dbt	Score 9.4 out of 10	N/A	dbt is an SQL development environment, developed by Fishtown Analytics, now known as dbt Labs. The vendor states that with dbt, analysts take ownership of the entire analytics engineering workflow, from writing data transformation code to deployment and documentation. dbt Core is distributed under the Apache 2.0 license, and paid Teams and Enterprise editions are available.	$0 per month per seat
StreamSets	Score 8.4 out of 10	N/A	StreamSets in San Francisco offers their DataOps Platform, a subscription based streaming analytics platform including StreamSets Data Collector data source management, Control Hub for data movement architecture management, StreamSets Data Collector Edge IoT manager, DataFlow Performance Manager (DPM), and StreamSets Data Protector compliance (e.g. GDPR) compliance module.	N/A

Pricing

dbt

StreamSets DataOps Platform

Editions & Modules

No answers on this topic

Offerings

Pricing Offerings
dbt	StreamSets
Free Trial
Yes	No
Free/Freemium Version
Yes	No
Premium Consulting/Integration Services
Yes	No

Entry-level Setup Fee

No setup fee

Additional Details

—

More Pricing Information

Community Pulse
	dbt	StreamSets DataOps Platform
Top Pros	Pro Version control Pro Customer success Pro Success team	Pro Easy to use Pro Create data Pro Way easier
Top Cons	Minus Test results Minus Data sets	Minus Data transfer Minus Takes some time Minus File transfer

Features

dbt

StreamSets DataOps Platform

Data Transformations

Comparison of Data Transformations features of Product A and Product B
	dbt 9.7 5 Ratings 15% above category average	StreamSets DataOps Platform - Ratings
Simple transformations	9.55 Ratings	00 Ratings
Complex transformations	9.95 Ratings	00 Ratings

Data Modeling

Comparison of Data Modeling features of Product A and Product B
	dbt 9.0 5 Ratings 10% above category average	StreamSets DataOps Platform - Ratings
Data model creation	9.15 Ratings	00 Ratings
Metadata management	8.65 Ratings	00 Ratings
Business rules and workflow	8.05 Ratings	00 Ratings
Collaboration	9.83 Ratings	00 Ratings
Testing and debugging	9.55 Ratings	00 Ratings

Streaming Analytics

Comparison of Streaming Analytics features of Product A and Product B
	dbt - Ratings	StreamSets DataOps Platform 9.0 1 Ratings 11% above category average
Visualization Dashboards	00 Ratings	7.01 Ratings
Low Latency	00 Ratings	8.01 Ratings
Integrated Development Tools	00 Ratings	10.01 Ratings
Data wrangling and preparation	00 Ratings	10.01 Ratings
Data Enrichment	00 Ratings	10.01 Ratings

Best Alternatives
	dbt	StreamSets DataOps Platform
Small Businesses	Skyvia Score 9.6 out of 10	IBM Streams Score 9.0 out of 10
Medium-sized Companies	IBM InfoSphere Information Server Score 8.1 out of 10	Confluent Score 7.4 out of 10
Enterprises	IBM InfoSphere Information Server Score 8.1 out of 10	Spotfire Streaming Score 8.0 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	dbt	StreamSets DataOps Platform
Likelihood to Recommend	9.6 (7 ratings)	9.0 (1 ratings)

User Testimonials
	dbt	StreamSets DataOps Platform
Likelihood to Recommend	dbt Labs If you can load your data first into your warehouse, dbt is excellent. It does the T(ransformation) part of ELT brilliantly but does not do the E(xtract) or L(oad) part. If you know SQL or your development team knows SQL, it's a framework and extension around that. So, it's easy to learn and easy to hire people with that technical skill (as opposed to specific Informatica, SnapLogic, etc. experience). dbt uses plain text files and integrates with GitHub. You can easily see the changes made between versions. In GUI-based UIs it was always hard to tell what someone had changed. Each "model" is essentially a "SELECT" statement. You never need to do a "CREATE TABLE" or "CREATE VIEW" - it's all done for you, leaving you to work on the business logic. Instead of saying "FROM specific_db.schema.table" you indicate "FROM ref('my_other_model')". It creates an internal dependency diagram you can view in a DAG. When you deploy, the dependencies work like magic in your various environments. They also have great documentation, an active slack community, training, and support. I like the enhancements they have been making and I believe they are headed in a good direction. Incentivized Judy Campion Data Architect Read full review	StreamSets Majorly for all Batch and Streaming Scenarios we are designing StreamSets pipelines, few best suited and tried out use cases below : 1. JDBC to ADLS data transfer based on source refresh frequency. 2. Kafka to GCS. 3. Kafka to Azure Event. 4. Hub HDFS to ADLS data transfer. 5. Schema generation to generate Avro. The easy to design Canvas, Scheduling Jobs, Fragment creation and utilization, an inbuilt wide range of Stage availability makes it an even more favorable tool for me to design data engineering pipelines. Incentivized Abhishek Katara Assistant Consultant Read full review
Pros	dbt Labs user experience makes it easy to work with SQL and version control customer success team and the dbt (data build tool) community help establish best practices thorough and clear documentation Obed Espina Senior Data Engineer Read full review	StreamSets A easy to use canvas to create Data Engineering Pipeline. A wide range of available Stages ie. Sources, Processors, Executors, and Destinations. Supports both Batch and Streaming Pipelines. Scheduling is way easier than cron. Integration with Key-Vaults for Secrets Fetching. Incentivized Abhishek Katara Assistant Consultant Read full review
Cons	dbt Labs Slow load times of the dbt cloud environment (they're working on it via a new UI though) More out-of-the-box solutions for managing procedures, functions, etc would be nice to have, but honestly, it's pretty easy to figure out how to adapt dbt macros Incentivized Verified User Anonymous Read full review	StreamSets Monitoring/Visualization can be improvised and enhanced a lot (e.g. to monitor a Job to see what happened 7 days back with data transfer). The logging mechanism can be simplified (Logs can be filtered with "ERROR", "DEBUG", "ALL" etc but still takes some time to get familiar for understanding). Auto Scalability for heavy load transfer (Taking much time for >5 million record transfer from JDBC to ADLS destination in Avro file transfer). There should be a concept of creating Global variables which is missing. Incentivized Abhishek Katara Assistant Consultant Read full review
Alternatives Considered	dbt Labs Most ETL pipeline products have a T layer, but dbt just does it better. The transformation is on steroids compared to the others. Also, just allows much more Adhoc solutions for very specific projects. Those ETL tools are probably better on the T part if you don't need too many transforms - also dbt is pretty much free dependent on how you work it, also extremely scalable. Incentivized YP Yoshlin Pillay Senior Data Entry Read full review	StreamSets StreamSets is a one-stop solution to design Data engineering Pipelines and doesn't require deep Programming knowledge, It's so user-friendly that anyone in Team can contribute to the Idea of pipeline design. In Hadoop One has to be programming proficient to use its various components like Hive, HDFS, Kafka, etc but in StreamSets all these stages are built-in and ready to use with minor configuration. Incentivized Abhishek Katara Assistant Consultant Read full review
Return on Investment	dbt Labs Simplified our BI layer for faster load times Increased the quality of data reaching our end users Makes complex transformations manageable Incentivized Verified User Anonymous Read full review	StreamSets Simplified Improvised Overall data ingestion and Integration Process. Support to various Hetrogenous Source systems like RDBMS< Kafka, Salesforce, Key Vault. Secure, easy to launch Integration tool. Incentivized Abhishek Katara Assistant Consultant Read full review
ScreenShots