4 Reviews and Ratings
18 Reviews and Ratings
No answers on this topic
Sqoop is great for sending data between a JDBC compliant database and a Hadoop environment. Sqoop is built for those who need a few simple CLI options to import a selection of database tables into Hadoop, do large dataset analysis that could not commonly be done with that database system due to resource constraints, then export the results back into that database (or another). Sqoop falls short when there needs to be some extra, customized processing between database extract, and Hadoop loading, in which case Apache Spark's JDBC utilities might be preferred Incentivized
Datameer is a great tool if someone is capable of keeping the most recent version of the tool up to date along with the most recent version of the distribution of Hadoop. The tool is easy to support but it must have someone who can run the back end processes
Provides generalized JDBC extensions to migrate data between most database systemsGenerates Java classes upon reading database records for use in other code utilizing Hadoop's client librariesAllows for both import and export featuresIncentivized
It leverages scalability, flexibility and cost-effectiveness of hadoop to deliver an end-user focused analytic platform for big data without involvement of IT.It overcomes Hadoop`s complexity by providing GUI interface with pre-built functions across integration, analytics and data visualization .Excel feature is awesome for business users which is already provided by Datameer.Using datameer now user can do smart analytic using Decision Trees, Column dependency and recommendation.Recently HTML5 inclusion is making application to available on a wider range of devices, including the iPad and other mobile devices which does not support Flash.It can be used in premise or in a cloud computing environment.Wizard-based data integration designed for IT and business users to schedule and do transformation of large sets of structured, semi-structured and unstructured data without any knowledge of Hadoop ecosystem.
Sqoop2 development seems to have stalled. I have set it up outside of a Cloudera CDH installation, and I actually prefer it's "Sqoop Server" model better than just the CLI client version that is Sqoop1. This works especially well in a microservices environment, where there would be only one place to maintain the JDBC drivers to use for Sqoop.Incentivized
Concentration issues are possible while using a lot of tabs at once.In most cases, the length of a tutorial video is excessive.A more condensed design is certainly a viable option.Incentivized
Employees with intermediate SQL and Hive knowledge can generate reports faster than using Datameer . It does have visualization tool but I don't think it is anything that cannot be accomplished by importing the data in Excel
Easy to use for most things, starts to require some planning as your projects get more complex.
Sqoop comes preinstalled on the major Hadoop vendor distributions as the recommended product to import data from relational databases. The ability to extend it with additional JDBC drivers makes it very flexible for the environment it is installed within.Spark also has a useful JDBC reader, and can manipulate data in more ways than Sqoop, and also upload to many other systems than just Hadoop.Kafka Connect JDBC is more for streaming database updates using tools such as Oracle GoldenGate or Debezium.Streamsets and Apache NiFi both provide a more "flow based programming" approach to graphically laying out connectors between various systems, including JDBC and Hadoop.Incentivized
Pricing, support, and ease of use. We plan to scale up our data over the net few years and Datameer gives us all the things we need in one tool. Handles large transformations quickly and works with all the cloud data warehouses. Datameer's per-user pricing sealed the deal for us as we plan to transfer much more data over the next few years. We looked at Fivetran but the usage pricing discourages growth. We also looked at Informatica but it was too expensive and didn't work as well with other BI tools like Datameer does.
When combined with Cloudera's HUE, it can enable non-technical users to easily import relational data into Hadoop.Being able to manipulate large datasets in Hadoop, and them load them into a type of "materialized view" in an external database system has yielded great insights into the Hadoop datalake without continuously running large batch jobs.Sqoop isn't very user-friendly for those uncomfortable with a CLI.Incentivized
We have not been able to reach our business objectives just yet.Hadoop its a hard sell in most companies still.Legacy skills are still highly on demand and as long as an easier path leverage SQL for example is available, it would be hard to gain more adoption.Incentivized