What users are saying about
9 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener'>trScore algorithm: Learn more.</a>Score 7.4 out of 100
Based on 9 reviews and ratings
4 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener'>trScore algorithm: Learn more.</a>Score 8.8 out of 100
Based on 4 reviews and ratings
Attribute Ratings
- Apache Sqoop is rated higher in 1 area: Likelihood to Recommend
Likelihood to Recommend

8.0
Apache Flume
80%
2 Ratings

9.0
Apache Sqoop
90%
1 Rating
Support Rating

5.0
Apache Flume
50%
2 Ratings

Apache Sqoop
N/A
0 Ratings
Likelihood to Recommend
Apache Flume
Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).

Verified User
Analyst in Information Technology
Airlines/Aviation Company, 51-200 employeesApache Sqoop
Sqoop is great for sending data between a JDBC compliant database and a Hadoop environment. Sqoop is built for those who need a few simple CLI options to import a selection of database tables into Hadoop, do large dataset analysis that could not commonly be done with that database system due to resource constraints, then export the results back into that database (or another). Sqoop falls short when there needs to be some extra, customized processing between database extract, and Hadoop loading, in which case Apache Spark's JDBC utilities might be preferred
Consultant
Avalon Consulting, LLCInformation Technology and Services, 51-200 employees
Pros
Apache Flume
- Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage
- It is very easy to setup and run
- Very open to personalization, you can create filters, enrichment, new sources and destinations
Global Technology Centre - Middleware
ProdubanFinancial Services, 10,001+ employees
Apache Sqoop
- Provides generalized JDBC extensions to migrate data between most database systems
- Generates Java classes upon reading database records for use in other code utilizing Hadoop's client libraries
- Allows for both import and export features
Consultant
Avalon Consulting, LLCInformation Technology and Services, 51-200 employees
Cons
Apache Flume
- It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data
- Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though)

Verified User
Analyst in Information Technology
Airlines/Aviation Company, 51-200 employeesApache Sqoop
- Sqoop2 development seems to have stalled. I have set it up outside of a Cloudera CDH installation, and I actually prefer it's "Sqoop Server" model better than just the CLI client version that is Sqoop1. This works especially well in a microservices environment, where there would be only one place to maintain the JDBC drivers to use for Sqoop.
Consultant
Avalon Consulting, LLCInformation Technology and Services, 51-200 employees
Pricing Details
Apache Flume
General
Free Trial
—Free/Freemium Version
—Premium Consulting/Integration Services
—Entry-level set up fee?
No
Starting Price
—Apache Sqoop
General
Free Trial
—Free/Freemium Version
—Premium Consulting/Integration Services
—Entry-level set up fee?
No
Starting Price
—Support Rating
Apache Flume
Apache Flume 5.0
Based on 2 answers
Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.

Verified User
Analyst in Information Technology
Airlines/Aviation Company, 51-200 employeesApache Sqoop
No score
No answers yet
No answers on this topic
Alternatives Considered
Apache Flume
Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so
you need to know exactly what you want.On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
you need to know exactly what you want.On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
Global Technology Centre - Middleware
ProdubanFinancial Services, 10,001+ employees
Apache Sqoop
- Sqoop comes preinstalled on the major Hadoop vendor distributions as the recommended product to import data from relational databases. The ability to extend it with additional JDBC drivers makes it very flexible for the environment it is installed within.
- Spark also has a useful JDBC reader, and can manipulate data in more ways than Sqoop, and also upload to many other systems than just Hadoop.
- Kafka Connect JDBC is more for streaming database updates using tools such as Oracle GoldenGate or Debezium.
- Streamsets and Apache NiFi both provide a more "flow based programming" approach to graphically laying out connectors between various systems, including JDBC and Hadoop.
Consultant
Avalon Consulting, LLCInformation Technology and Services, 51-200 employees
Return on Investment
Apache Flume
- Flume has simplified a lot many of our ingest procedures, easier to deploy and integrate than a classical EAI, reducing the time to market
- But opposed to EAIs if the project starts to grow in complexity Apache Flume project may not be as suitable
Global Technology Centre - Middleware
ProdubanFinancial Services, 10,001+ employees
Apache Sqoop
- When combined with Cloudera's HUE, it can enable non-technical users to easily import relational data into Hadoop.
- Being able to manipulate large datasets in Hadoop, and them load them into a type of "materialized view" in an external database system has yielded great insights into the Hadoop datalake without continuously running large batch jobs.
- Sqoop isn't very user-friendly for those uncomfortable with a CLI.
Consultant
Avalon Consulting, LLCInformation Technology and Services, 51-200 employees