What users are saying about
4 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.8 out of 100
75 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.1 out of 100

Likelihood to Recommend

Apache Sqoop

Sqoop is great for sending data between a JDBC compliant database and a Hadoop environment. Sqoop is built for those who need a few simple CLI options to import a selection of database tables into Hadoop, do large dataset analysis that could not commonly be done with that database system due to resource constraints, then export the results back into that database (or another). Sqoop falls short when there needs to be some extra, customized processing between database extract, and Hadoop loading, in which case Apache Spark's JDBC utilities might be preferred
Jordan Moore | TrustRadius Reviewer

Informatica PowerCenter

PowerCenter is well equipped to handle large amounts of data movement in an organization with many disparate sources and a structured development team. It excels in enforcing enterprise development standards through things like metadata manager and the monitoring capabilities (as well as being able to design monitoring rules for everything from naming standards to design practices). It is especially well suited at handling flat-file data in addition to its many connectors and native support for just about any ANSI standard database. For large development teams or the desire to remain flexible at an enterprise scale, Powercenter is a top-tier solution.For small projects or even smaller development teams with mostly a single data source, expect frustration with being able to quickly test a solution as the design flow is very structured. It is also designed in a way that segregation of duties at a very high level can also cause small development teams to be counter-productive. Each step in the design process is a separate application, and although stitched together, is not without its problems. In order to design a simple mapping for example, you would first need a connection established to the source (example, ODBC) and keep in mind that it will automatically name the container according to how you named your connection. You would then open the designer tool, import a connection as a source, optionally check it in, create a target, optionally check it in as well, and design a transformation mapping. In order to test or run it, you will need to open a separate application (Workflow Manager) and create a workflow from your mapping, then create a session for that workflow and a workflow for those one or more sessions at which point you can test it. After running it, in order to observe, you then need to open a separate application (Monitor) to see what it is doing and how well. For a developer coming from something like SSIS, this can be daunting and cumbersome for building a simple POC and trying to test it (although from the inverse, building an enterprise scalable ETL solution from SSIS is its own challenge).
Jody Gitchel | TrustRadius Reviewer

Feature Rating Comparison

Data Source Connection

Apache Sqoop
Informatica PowerCenter
8.9
Connect to traditional data sources
Apache Sqoop
Informatica PowerCenter
9.5
Connecto to Big Data and NoSQL
Apache Sqoop
Informatica PowerCenter
8.3

Data Transformations

Apache Sqoop
Informatica PowerCenter
8.6
Simple transformations
Apache Sqoop
Informatica PowerCenter
8.7
Complex transformations
Apache Sqoop
Informatica PowerCenter
8.6

Data Modeling

Apache Sqoop
Informatica PowerCenter
8.3
Data model creation
Apache Sqoop
Informatica PowerCenter
7.4
Metadata management
Apache Sqoop
Informatica PowerCenter
8.9
Business rules and workflow
Apache Sqoop
Informatica PowerCenter
8.5
Collaboration
Apache Sqoop
Informatica PowerCenter
8.7
Testing and debugging
Apache Sqoop
Informatica PowerCenter
7.8

Data Governance

Apache Sqoop
Informatica PowerCenter
8.1
Integration with data quality tools
Apache Sqoop
Informatica PowerCenter
8.1
Integration with MDM tools
Apache Sqoop
Informatica PowerCenter
8.1

Pros

Apache Sqoop

  • Provides generalized JDBC extensions to migrate data between most database systems
  • Generates Java classes upon reading database records for use in other code utilizing Hadoop's client libraries
  • Allows for both import and export features
Jordan Moore | TrustRadius Reviewer

Informatica PowerCenter

  • Informatica has a wide range of support for databases. Pretty much every mainstream DBMS is compatible here.
  • Designing ETL mappings and workflows is a very intuitive process, and takes minimal learning time and effort even for a beginner.
  • Informatica's biggest strength is its sheer performance. It is unmatched in terms of handling large volumes of data.
Anonymous | TrustRadius Reviewer

Cons

Apache Sqoop

  • Sqoop2 development seems to have stalled. I have set it up outside of a Cloudera CDH installation, and I actually prefer it's "Sqoop Server" model better than just the CLI client version that is Sqoop1. This works especially well in a microservices environment, where there would be only one place to maintain the JDBC drivers to use for Sqoop.
Jordan Moore | TrustRadius Reviewer

Informatica PowerCenter

  • One of the challenges of PowerCenter is the lack of integration between the components and functionality provided by PowerCenter. PowerCenter consists of multiple components such has the repository service, integration service, metadata service. Considerable time and resources were required to install and configure these components before PowerCenter was available for use.
  • In order to connect to various data sources such as Netezza database or SAS datasets, PowerCenter requires the installation and configuration of separate plug-ins. We spent considerable time trouble-shooting and debugging problems while trying to get the various plug-ins integrated with PowerCenter and get them up and running as described in the documentation.
  • PowerCenter works well with structured data. That is, it is easy to work with input and output data that is pre-defined, fixed, and unchanging. It is much more difficult to work with dynamic data in which new fields are added or removed ad-hoc or if data format changes during the data ingest process. We have not been as successful in using PowerCenter for dynamic data.
  • One of the challenges of learning PowerCenter is that it is difficult to find documentation or publications that help you learn the various details about PowerCenter software. Unlike SAS Institute, Informatica does not publish books about PowerCenter. The documentation available with PowerCenter is sparse; we have learned many aspects of this technology through trial and error.
Anonymous | TrustRadius Reviewer

Likelihood to Renew

Apache Sqoop

No score
No answers yet
No answers on this topic

Informatica PowerCenter

Informatica PowerCenter 10.0
Based on 4 answers
Our team enjoys using Informatica and feels that it is one of the best ETL tools on the market.
Robert Goodman | TrustRadius Reviewer

Usability

Apache Sqoop

No score
No answers yet
No answers on this topic

Informatica PowerCenter

Informatica PowerCenter 9.5
Based on 2 answers
Positives;- Multi User Development Environment- Speed of transformation- Seamless integration between other Informatica products.Negatives;- There should be less windows to maintain developers' focus while using. You probably need 2 big monitors when you start development with Informatica Power Center.- Oracle Analytical functions should be natively used.- E-LT support as well as ETL support.
Gurcan Orhan | TrustRadius Reviewer

Performance

Apache Sqoop

No score
No answers yet
No answers on this topic

Informatica PowerCenter

Informatica PowerCenter 9.5
Based on 2 answers
PowerCenter is robust and fast, and it does a great job meeting all the needs, not just the most commercially vocal needs. In the hands of an expert power user, you can accomplish almost anything with your data. It is not for new users or intermittent users-- for that the Cloud version is a better fit. Be prepared for costly connectors (priced differently for each source or destination you are working with), and just be planful of your projects so you are not paying for connectors you no longer need or want
Anonymous | TrustRadius Reviewer

Support

Apache Sqoop

No score
No answers yet
No answers on this topic

Informatica PowerCenter

Informatica PowerCenter 8.0
Based on 1 answer
Informatica power center is a leader of the pack of ETL tools and has some great abilities that make it stand out from other ETL tools. It has been a great partner to its clients over a long time so it's definitely dependable. With all the great things about Informatica, it has a bit of tech burden that should be addressed to make it more nimble, reduce the learning curve for new developers, provide better connectivity with visualization tools.
Anonymous | TrustRadius Reviewer

Alternatives Considered

Apache Sqoop

  • Sqoop comes preinstalled on the major Hadoop vendor distributions as the recommended product to import data from relational databases. The ability to extend it with additional JDBC drivers makes it very flexible for the environment it is installed within.
  • Spark also has a useful JDBC reader, and can manipulate data in more ways than Sqoop, and also upload to many other systems than just Hadoop.
  • Kafka Connect JDBC is more for streaming database updates using tools such as Oracle GoldenGate or Debezium.
  • Streamsets and Apache NiFi both provide a more "flow based programming" approach to graphically laying out connectors between various systems, including JDBC and Hadoop.
Jordan Moore | TrustRadius Reviewer

Informatica PowerCenter

PowerCenter is the industry leader when it comes to interfacing with multiple source and target systems. The graphical interface increases employee productivity while reducing human resource expenditures and training requirements. These other tools offer some similar capabilities, but lack the range and depth when compared with the PowerCenter platform.
Brian Randolph | TrustRadius Reviewer

Return on Investment

Apache Sqoop

  • When combined with Cloudera's HUE, it can enable non-technical users to easily import relational data into Hadoop.
  • Being able to manipulate large datasets in Hadoop, and them load them into a type of "materialized view" in an external database system has yielded great insights into the Hadoop datalake without continuously running large batch jobs.
  • Sqoop isn't very user-friendly for those uncomfortable with a CLI.
Jordan Moore | TrustRadius Reviewer

Informatica PowerCenter

  • Positive - Easy to maintain processes built in Informatica Power Center.
  • Positive - Rapidly build and deploy ETL data mappings.
  • Positive - Develop the overall workflow process to run all ETL processes for the project.
  • Negative - Informatica Power Center can be a bit expensive, so your application needs to warrant the enterprise support.
Anonymous | TrustRadius Reviewer

Pricing Details

Apache Sqoop

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Informatica PowerCenter

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Rating Summary

Likelihood to Recommend

Apache Sqoop
9.0
Informatica PowerCenter
8.6

Likelihood to Renew

Apache Sqoop
Informatica PowerCenter
10.0

Usability

Apache Sqoop
Informatica PowerCenter
9.5

Performance

Apache Sqoop
Informatica PowerCenter
9.5

Support

Apache Sqoop
Informatica PowerCenter
8.0

Add comparison