What users are saying about
115 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.3 out of 100
48 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 7.2 out of 100

Likelihood to Recommend

Apache Spark

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are used only partially because very large data sets require too much time when the data sets get too large. The software is not well-suited for projects that are not big data in size. The graphics and analytical output are subpar compared to other tools.
Thomas Young | TrustRadius Reviewer

Talend Data Integration

Talend Data Integration is easy to use and learn. Talend Data Integration can perform bulk data migrations of up to a few million records without any issues. We have not tried it or had use case which required more than a few million records at a time. It works particularly well with databases and CRM software.
Anonymous | TrustRadius Reviewer

Feature Rating Comparison

Data Source Connection

Apache Spark
Talend Data Integration
8.4
Connect to traditional data sources
Apache Spark
Talend Data Integration
9.9
Connecto to Big Data and NoSQL
Apache Spark
Talend Data Integration
6.8

Data Transformations

Apache Spark
Talend Data Integration
9.9
Simple transformations
Apache Spark
Talend Data Integration
9.9
Complex transformations
Apache Spark
Talend Data Integration
9.8

Data Modeling

Apache Spark
Talend Data Integration
7.3
Data model creation
Apache Spark
Talend Data Integration
8.1
Metadata management
Apache Spark
Talend Data Integration
8.8
Business rules and workflow
Apache Spark
Talend Data Integration
9.0
Collaboration
Apache Spark
Talend Data Integration
5.4
Testing and debugging
Apache Spark
Talend Data Integration
5.3

Data Governance

Apache Spark
Talend Data Integration
7.7
Integration with data quality tools
Apache Spark
Talend Data Integration
7.3
Integration with MDM tools
Apache Spark
Talend Data Integration
8.0

Pros

Apache Spark

  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Nitin Pasumarthy | TrustRadius Reviewer

Talend Data Integration

  • JSON parsing, if you are into highly nested JSON object parsing Talend has the ability to display the structure of JSON and allows you to define the extraction logic in the metadata. Same with XML source.
  • Customization, For mostly all ETL work you will have a component in Talend. And in case you have a very specific requirement you can end up easily designing a custom component that you can code and reuse and share with others.
  • Talend has a connectivity option to almost all the databases (relational or NOSQ)L and sources available. It also has generic JDBC/ ODBC drivers in case you need it.
  • I like the ease of deployment across environment (DEV and PROD) with the use of context variables.
Anonymous | TrustRadius Reviewer

Cons

Apache Spark

  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Anson Abraham | TrustRadius Reviewer

Talend Data Integration

  • Syncing with Git should be made easier
  • Can face issues with certain big data spaces
  • Support find it difficult to resolve complex issues
Anonymous | TrustRadius Reviewer

Usability

Apache Spark

No score
No answers yet
No answers on this topic

Talend Data Integration

Talend Data Integration 9.0
Based on 1 answer
We use Talend Data Integration day in and day out. It is the best and easiest tool to jump on to and use. We can build a basic integration super-fast. We could build basic integrations as fast as within the hour. It is also easy to build transformations and use Java to perform some operations.
Anonymous | TrustRadius Reviewer

Support

Apache Spark

Apache Spark 7.5
Based on 2 answers
1. It integrates very well with scala or python.2. It's very easy to understand SQL interoperability.3. Apache is way faster than the other competitive technologies.4. The support from the Apache community is very huge for Spark.5. Execution times are faster as compared to others.6. There are a large number of forums available for Apache Spark.7. The code availability for Apache Spark is simpler and easy to gain access to.8. Many organizations use Apache Spark, so many solutions are available for existing applications.
Yogesh Mhasde | TrustRadius Reviewer

Talend Data Integration

Talend Data Integration 8.0
Based on 2 answers
Good support, specially when it relates to PROD environment. The support team has access to the product development team. Things are internally escalated to development team if there is a bug encountered. This helps the customer to get quick fix or patch designed for problem exceptions. I have also seen support showing their willingness to help develop custom connector for a newly available cloud based big data solution
Anonymous | TrustRadius Reviewer

Alternatives Considered

Apache Spark

Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing.
Anonymous | TrustRadius Reviewer

Talend Data Integration

Talend is much versatile in assimilating various different business use cases. It covers more functionality and is packed with tons of features to explore. It has the ability to fine tune to each project, and not be a one-fits all solution. Problems with Excel, is that it is not suited for large projects, collaboration work. Alteryx is a very costly alternative.
Sanket Bharaswadkar | TrustRadius Reviewer

Return on Investment

Apache Spark

  • It has had a very positive impact, as it helps reduce the data processing time and thus helps us achieve our goals much faster.
  • Being easy to use, it allows us to adapt to the tool much faster than with others, which in turn allows us to access various data sources such as Hadoop, Apache Mesos, Kubernetes, independently or in the cloud. This makes it very useful.
  • It was very easy for me to use Apache Spark and learn it since I come from a background of Java and SQL, and it shares those basic principles and uses a very similar logic.
Carla Borges | TrustRadius Reviewer

Talend Data Integration

  • Easy to build complex data transformations.
  • Licensing model isn't that flexible.
  • Memory mangement for huge volumes of information. You have to modify ETL designs to handle it properly.
Josep Coves Barreiro | TrustRadius Reviewer

Pricing Details

Apache Spark

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Talend Data Integration

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Rating Summary

Likelihood to Recommend

Apache Spark
8.4
Talend Data Integration
9.8

Usability

Apache Spark
Talend Data Integration
9.0

Support

Apache Spark
7.5
Talend Data Integration
8.0

Add comparison