Apache Sqoop

4 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.7 out of 101

Informatica Enterprise Data Integration

46 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.3 out of 101

Add comparison

Likelihood to Recommend

Apache Sqoop

Sqoop is great for sending data between a JDBC compliant database and a Hadoop environment. Sqoop is built for those who need a few simple CLI options to import a selection of database tables into Hadoop, do large dataset analysis that could not commonly be done with that database system due to resource constraints, then export the results back into that database (or another). Sqoop falls short when there needs to be some extra, customized processing between database extract, and Hadoop loading, in which case Apache Spark's JDBC utilities might be preferred
Jordan Moore profile photo

Informatica Enterprise Data Integration

Great for ETL especially when sourcing from various sources. Also handy when field by field or row by row transformations are needed. It may be less suited where quick bulk changes can be done
Brandon Fitzpatrick profile photo

Feature Rating Comparison

Data Source Connection

Apache Sqoop
Informatica Enterprise Data Integration
9.5
Connect to traditional data sources
Apache Sqoop
Informatica Enterprise Data Integration
10.0
Connecto to Big Data and NoSQL
Apache Sqoop
Informatica Enterprise Data Integration
9.0

Data Transformations

Apache Sqoop
Informatica Enterprise Data Integration
9.5
Simple transformations
Apache Sqoop
Informatica Enterprise Data Integration
10.0
Complex transformations
Apache Sqoop
Informatica Enterprise Data Integration
9.0

Data Modeling

Apache Sqoop
Informatica Enterprise Data Integration
8.0
Data model creation
Apache Sqoop
Informatica Enterprise Data Integration
8.2
Metadata management
Apache Sqoop
Informatica Enterprise Data Integration
8.8
Business rules and workflow
Apache Sqoop
Informatica Enterprise Data Integration
7.6
Collaboration
Apache Sqoop
Informatica Enterprise Data Integration
7.6
Testing and debugging
Apache Sqoop
Informatica Enterprise Data Integration
8.1

Data Governance

Apache Sqoop
Informatica Enterprise Data Integration
7.9
Integration with data quality tools
Apache Sqoop
Informatica Enterprise Data Integration
8.1
Integration with MDM tools
Apache Sqoop
Informatica Enterprise Data Integration
7.6

Pros

  • Provides generalized JDBC extensions to migrate data between most database systems
  • Generates Java classes upon reading database records for use in other code utilizing Hadoop's client libraries
  • Allows for both import and export features
Jordan Moore profile photo
  • Tracking Changes in slowly changing dimensions
  • Fact & multi-dimensional loading
  • Integration of data from SAP and Salesforce is better with this ETL tool compared to other tools in the market
  • Modularity
hai Mani profile photo

Cons

  • Sqoop2 development seems to have stalled. I have set it up outside of a Cloudera CDH installation, and I actually prefer it's "Sqoop Server" model better than just the CLI client version that is Sqoop1. This works especially well in a microservices environment, where there would be only one place to maintain the JDBC drivers to use for Sqoop.
Jordan Moore profile photo
  • Several partnerships diminishing the value of technologies
  • Unable to get list of objects from Repository (like sources & targets) that don't have any dependency
  • Scheduling: The built-in scheduling tool has many constraints such as handling Unix/VB scripts etc. Most enterprises use third party tools for this.
hai Mani profile photo

Likelihood to Renew

No score
No answers yet
No answers on this topic
Informatica Enterprise Data Integration8.9
Based on 3 answers
Availability of options to integrate with application oriented databases like SAP, Salesforce etc.Product Support.Reliability.
hai Mani profile photo

Alternatives Considered

  • Sqoop comes preinstalled on the major Hadoop vendor distributions as the recommended product to import data from relational databases. The ability to extend it with additional JDBC drivers makes it very flexible for the environment it is installed within.
  • Spark also has a useful JDBC reader, and can manipulate data in more ways than Sqoop, and also upload to many other systems than just Hadoop.
  • Kafka Connect JDBC is more for streaming database updates using tools such as Oracle GoldenGate or Debezium.
  • Streamsets and Apache NiFi both provide a more "flow based programming" approach to graphically laying out connectors between various systems, including JDBC and Hadoop.
Jordan Moore profile photo
Because of these key features :
  • Industry-leading data integration technology
  • Proven best-practice implementation methodology
  • End-to-end data migration services
No photo available

Return on Investment

  • When combined with Cloudera's HUE, it can enable non-technical users to easily import relational data into Hadoop.
  • Being able to manipulate large datasets in Hadoop, and them load them into a type of "materialized view" in an external database system has yielded great insights into the Hadoop datalake without continuously running large batch jobs.
  • Sqoop isn't very user-friendly for those uncomfortable with a CLI.
Jordan Moore profile photo
  • Increased reporting strategy
  • Data discovery and profiling
  • Master data management
  • ata lineage and data archiving
No photo available

Pricing Details

Apache Sqoop

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

Informatica Enterprise Data Integration

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
Additional Pricing Details