What users are saying about
112 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.4 out of 101
48 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 6.9 out of 101

Likelihood to Recommend

Apache Spark

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are used only partially because very large data sets require too much time when the data sets get too large. The software is not well-suited for projects that are not big data in size. The graphics and analytical output are subpar compared to other tools.
Thomas Young profile photo

Talend Data Integration

If your organisation or department works regularly with ETL jobs or sources data from multiple locations and needs to integrate them to provide for better data compatibility, then Talend Data Integration is a great tool and I would strongly recommend it. This mostly applies to established corporations, and not startups as the licensing fee is quite high.
No photo available

Feature Rating Comparison

Data Source Connection

Apache Spark
Talend Data Integration
8.2
Connect to traditional data sources
Apache Spark
Talend Data Integration
9.6
Connecto to Big Data and NoSQL
Apache Spark
Talend Data Integration
6.8

Data Transformations

Apache Spark
Talend Data Integration
9.1
Simple transformations
Apache Spark
Talend Data Integration
9.4
Complex transformations
Apache Spark
Talend Data Integration
8.8

Data Modeling

Apache Spark
Talend Data Integration
7.5
Data model creation
Apache Spark
Talend Data Integration
8.6
Metadata management
Apache Spark
Talend Data Integration
7.6
Business rules and workflow
Apache Spark
Talend Data Integration
9.0
Collaboration
Apache Spark
Talend Data Integration
5.4
Testing and debugging
Apache Spark
Talend Data Integration
7.0

Data Governance

Apache Spark
Talend Data Integration
8.8
Integration with data quality tools
Apache Spark
Talend Data Integration
9.4
Integration with MDM tools
Apache Spark
Talend Data Integration
8.2

Pros

Apache Spark

  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Nitin Pasumarthy profile photo

Talend Data Integration

  • data integration
  • support for different data sources
  • documentation
Sanket Bharaswadkar profile photo

Cons

Apache Spark

  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Anson Abraham profile photo

Talend Data Integration

  • The Talend Administration Console TAC is a great place to schedule and monitor your jobs. Probably the interface can be improved.
No photo available

Support

Apache Spark

No score
No answers yet
No answers on this topic

Talend Data Integration

Talend Data Integration 9.0
Based on 1 answer
Good support, specially when it relates to PROD environment. The support team has access to the product development team. Things are internally escalated to development team if there is a bug encountered. This helps the customer to get quick fix or patch designed for problem exceptions. I have also seen support showing their willingness to help develop custom connector for a newly available cloud based big data solution
No photo available

Alternatives Considered

Apache Spark

Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing.
No photo available

Talend Data Integration

Compared to Microsoft SQL Server Integration Services (SSIS) talend gives developers much more tools and flexibility in order to achieve different ETL processes. For instance, SSIS, separates processing from data management, and Talend mixes both stages so that you can perform complex processes like iterating sub-jobs for each data row. It also provides a huge component list compared to SSIS which allows retrieving and saving data from many various sources. The administration part is also wider than what is offered from SSIS. In other words, SSIS is like a toy compared to Talend Integration's capabilities.When comparing Talend with Kettle (Pentaho) it's is easy to find similarities because they are both similar tools. In my experience, I'd rather [use] Talend because, in my opinion, it is more focused at data management. Kettle is a component provided from a wider BI tool, and Pentaho is not only focused at data management. I also found Talend gives better performance and manages connection sockets better than Kettle.
Josep Coves Barreiro profile photo

Return on Investment

Apache Spark

  • It has had a very positive impact, as it helps reduce the data processing time and thus helps us achieve our goals much faster.
  • Being easy to use, it allows us to adapt to the tool much faster than with others, which in turn allows us to access various data sources such as Hadoop, Apache Mesos, Kubernetes, independently or in the cloud. This makes it very useful.
  • It was very easy for me to use Apache Spark and learn it since I come from a background of Java and SQL, and it shares those basic principles and uses a very similar logic.
Carla Borges profile photo

Talend Data Integration

  • We decided to move away from a high cost commercial dData integration tool and opted to use Talend.
  • Talend has been able to serve all our data integration needs for last two and half years and we have never felt limited to any available feature in the tool.
  • Option to use local machine as a data integration service and run jobs on the desktop is usefull to carry out POCs and load personal data to the data warehouse.
No photo available

Pricing Details

Apache Spark

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Talend Data Integration

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Add comparison