What users are saying about
127 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.7 out of 100
12 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.5 out of 100

Likelihood to Recommend

Apache Spark

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are used only partially because very large data sets require too much time when the data sets get too large. The software is not well-suited for projects that are not big data in size. The graphics and analytical output are subpar compared to other tools.
Thomas Young | TrustRadius Reviewer

SAS Data Integration Studio

When data is in a system that needs a complex transformation to be usable for an average user. Such tasks as data residing in systems that have very different connection speeds. It can be integrated and used together after passing through the SAS Data Integration Studio removing timing issues from the users' worries. A part that is perhaps less appropriate is getting users who are not familiar with the source data to set up the load processes.
Donald Wildeboer | TrustRadius Reviewer

Feature Rating Comparison

Data Source Connection

Apache Spark
SAS Data Integration Studio
7.0
Connect to traditional data sources
Apache Spark
SAS Data Integration Studio
7.0
Connecto to Big Data and NoSQL
Apache Spark
SAS Data Integration Studio
7.0

Data Transformations

Apache Spark
SAS Data Integration Studio
8.5
Simple transformations
Apache Spark
SAS Data Integration Studio
9.0
Complex transformations
Apache Spark
SAS Data Integration Studio
8.0

Data Modeling

Apache Spark
SAS Data Integration Studio
7.6
Data model creation
Apache Spark
SAS Data Integration Studio
7.0
Metadata management
Apache Spark
SAS Data Integration Studio
7.0
Business rules and workflow
Apache Spark
SAS Data Integration Studio
9.0
Collaboration
Apache Spark
SAS Data Integration Studio
8.0
Testing and debugging
Apache Spark
SAS Data Integration Studio
7.0

Data Governance

Apache Spark
SAS Data Integration Studio
7.0
Integration with data quality tools
Apache Spark
SAS Data Integration Studio
7.0
Integration with MDM tools
Apache Spark
SAS Data Integration Studio
7.0

Pros

Apache Spark

  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Nitin Pasumarthy | TrustRadius Reviewer

SAS Data Integration Studio

  • Features to embed user defines SAS code to extract and transform data as per business requirements
  • Good security standards to manage users access and maintain different repositories
  • Enhanced Job Scheduling
  • Versioning control
Anonymous | TrustRadius Reviewer

Cons

Apache Spark

  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Anson Abraham | TrustRadius Reviewer

SAS Data Integration Studio

  • Sometimes parts of the data are not available, although this is generally because it is connected to so many different systems.
  • Price, it's not the cheapest software, however the value for dollars spent does seem to be good.
  • Sometimes all the different systems and platforms and folders and so forth get a bit overwhelming for new users.
Donald Wildeboer | TrustRadius Reviewer

Usability

Apache Spark

Apache Spark 8.7
Based on 3 answers
Apache integrates with multiple big data frameworks. It does not exert too much load on the disks. Moreover, it is easy to program and use. It reduces the headache of using different applications separately through its high-level APIs. Big data processing has never been as easy as it is with Apache Spark.
Partha Protim Pegu | TrustRadius Reviewer

SAS Data Integration Studio

No score
No answers yet
No answers on this topic

Support Rating

Apache Spark

Apache Spark 8.2
Based on 6 answers
1. It integrates very well with scala or python.2. It's very easy to understand SQL interoperability.3. Apache is way faster than the other competitive technologies.4. The support from the Apache community is very huge for Spark.5. Execution times are faster as compared to others.6. There are a large number of forums available for Apache Spark.7. The code availability for Apache Spark is simpler and easy to gain access to.8. Many organizations use Apache Spark, so many solutions are available for existing applications.
Yogesh Mhasde | TrustRadius Reviewer

SAS Data Integration Studio

SAS Data Integration Studio 9.0
Based on 1 answer
Good technical support from SAS.
Anonymous | TrustRadius Reviewer

Alternatives Considered

Apache Spark

Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing.
Anonymous | TrustRadius Reviewer

SAS Data Integration Studio

Because SAS Data Integration Studio is the third party it seems to work equally well with all our systems. That is to say that it doesn't really work better with Microsoft or Oracle but really just seems to work equally well with all of them. It has a very powerful back-end that allows us to transform and load our data quickly and efficiently programmer time wise.
Donald Wildeboer | TrustRadius Reviewer

Return on Investment

Apache Spark

  • It has had a very positive impact, as it helps reduce the data processing time and thus helps us achieve our goals much faster.
  • Being easy to use, it allows us to adapt to the tool much faster than with others, which in turn allows us to access various data sources such as Hadoop, Apache Mesos, Kubernetes, independently or in the cloud. This makes it very useful.
  • It was very easy for me to use Apache Spark and learn it since I come from a background of Java and SQL, and it shares those basic principles and uses a very similar logic.
Carla Borges | TrustRadius Reviewer

SAS Data Integration Studio

  • Can be used to automate SAS jobs
  • To extract and load data from multiple sources
  • Very high licensing cost
Anonymous | TrustRadius Reviewer

Pricing Details

Apache Spark

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

SAS Data Integration Studio

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Rating Summary

Likelihood to Recommend

Apache Spark
8.6
SAS Data Integration Studio
8.0

Usability

Apache Spark
8.7
SAS Data Integration Studio

Support Rating

Apache Spark
8.2
SAS Data Integration Studio
9.0

Add comparison