What users are saying about
102 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.5 out of 101
40 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 7.9 out of 101

Add comparison

Likelihood to Recommend

Apache Spark

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are used only partially because very large data sets require too much time when the data sets get too large. The software is not well-suited for projects that are not big data in size. The graphics and analytical output are subpar compared to other tools.
Thomas Young profile photo

Talend Data Integration

If your organisation or department works regularly with ETL jobs or sources data from multiple locations and needs to integrate them to provide for better data compatibility, then Talend Data Integration is a great tool and I would strongly recommend it. This mostly applies to established corporations, and not startups as the licensing fee is quite high.
No photo available

Feature Rating Comparison

Data Source Connection

Apache Spark
Talend Data Integration
8.1
Connect to traditional data sources
Apache Spark
Talend Data Integration
9.5
Connecto to Big Data and NoSQL
Apache Spark
Talend Data Integration
6.7

Data Transformations

Apache Spark
Talend Data Integration
9.2
Simple transformations
Apache Spark
Talend Data Integration
9.5
Complex transformations
Apache Spark
Talend Data Integration
9.0

Data Modeling

Apache Spark
Talend Data Integration
7.6
Data model creation
Apache Spark
Talend Data Integration
8.6
Metadata management
Apache Spark
Talend Data Integration
7.6
Business rules and workflow
Apache Spark
Talend Data Integration
9.0
Collaboration
Apache Spark
Talend Data Integration
5.6
Testing and debugging
Apache Spark
Talend Data Integration
7.1

Data Governance

Apache Spark
Talend Data Integration
8.8
Integration with data quality tools
Apache Spark
Talend Data Integration
9.5
Integration with MDM tools
Apache Spark
Talend Data Integration
8.1

Pros

  • in memory data engine and hence faster processing
  • does well to lay on top of hadoop file system for big data analytics
  • very good tool for streaming data
Shiv Shivakumar profile photo
  • Easy to use interface
  • Designing jobs is very straightforward
  • Allows the use of custom components to suit specific needs
No photo available

Cons

  • could do a better job for analytics dashboards to provide insights on a data stream and hence not have to rely on data visualization tools along with spark
  • also there is room for improvement in the area of data discovery
Shiv Shivakumar profile photo
  • Syncing with Git should be made easier
  • Can face issues with certain big data spaces
  • Support find it difficult to resolve complex issues
No photo available

Support

No score
No answers yet
No answers on this topic
Talend Data Integration9.0
Based on 1 answer
Good support, specially when it relates to PROD environment. The support team has access to the product development team. Things are internally escalated to development team if there is a bug encountered. This helps the customer to get quick fix or patch designed for problem exceptions. I have also seen support showing their willingness to help develop custom connector for a newly available cloud based big data solution
No photo available

Alternatives Considered

Even with Python, MapReduce is lengthy coding. Combination of Python with Apache Spark will not only shorten the code, but it will effectively increase the speed of algorithms. Occasionally, I use MapReduce, but Apache Spark will replace MapReduce very soon. It has many built-in and faster features.
Kartik Chavan profile photo
Most other tools of similar nature work well for small and medium sized data warehouses, but fail to maintain performance for very large data warehouses. However, Talend works decently well on large data as well. On the other hand, there are software tools like Oracle Data Integrator that are especially made for large orgs, and Talend cannot match up to those benchmarks.
No photo available

Return on Investment

  • Apache Spark has faster performance compared to MapReduce.
  • Combination of Python & Spark is the best. Shorter code, faster and efficient performance.
  • Can replace RDBMS
Kartik Chavan profile photo
  • Talend Data Integration has streamlined data warehouse management over the entire organisation.
  • A more organised DWH means efficient data usage.
  • This cuts down on both cost as well as time.
No photo available

Pricing Details

Apache Spark

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

Talend Data Integration

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details