Likelihood to Recommend Datameer is a great tool if someone is capable of keeping the most recent version of the tool up to date along with the most recent version of the distribution of Hadoop. The tool is easy to support but it must have someone who can run the back end processes
Read full review If you can load your data first into your warehouse, dbt is excellent. It does the T(ransformation) part of ELT brilliantly but does not do the E(xtract) or L(oad) part. If you know SQL or your development team knows SQL, it's a framework and extension around that. So, it's easy to learn and easy to hire people with that technical skill (as opposed to specific Informatica,
SnapLogic , etc. experience). dbt uses plain text files and integrates with GitHub. You can easily see the changes made between versions. In GUI-based UIs it was always hard to tell what someone had changed. Each "model" is essentially a "SELECT" statement. You never need to do a "CREATE TABLE" or "CREATE VIEW" - it's all done for you, leaving you to work on the business logic. Instead of saying "FROM specific_db.schema.table" you indicate "FROM ref('my_other_model')". It creates an internal dependency diagram you can view in a DAG. When you deploy, the dependencies work like magic in your various environments. They also have great documentation, an active slack community, training, and support. I like the enhancements they have been making and I believe they are headed in a good direction.
Read full review Pros It leverages scalability, flexibility and cost-effectiveness of hadoop to deliver an end-user focused analytic platform for big data without involvement of IT. It overcomes Hadoop`s complexity by providing GUI interface with pre-built functions across integration, analytics and data visualization . Excel feature is awesome for business users which is already provided by Datameer. Using datameer now user can do smart analytic using Decision Trees, Column dependency and recommendation. Recently HTML5 inclusion is making application to available on a wider range of devices, including the iPad and other mobile devices which does not support Flash. It can be used in premise or in a cloud computing environment. Wizard-based data integration designed for IT and business users to schedule and do transformation of large sets of structured, semi-structured and unstructured data without any knowledge of Hadoop ecosystem. Read full review user experience makes it easy to work with SQL and version control customer success team and the dbt (data build tool) community help establish best practices thorough and clear documentation Read full review Cons Concentration issues are possible while using a lot of tabs at once. In most cases, the length of a tutorial video is excessive. A more condensed design is certainly a viable option. Read full review Slow load times of the dbt cloud environment (they're working on it via a new UI though) More out-of-the-box solutions for managing procedures, functions, etc would be nice to have, but honestly, it's pretty easy to figure out how to adapt dbt macros Read full review Likelihood to Renew Employees with intermediate SQL and Hive knowledge can generate reports faster than using Datameer . It does have visualization tool but I don't think it is anything that cannot be accomplished by importing the data in Excel
Read full review Usability Easy to use for most things, starts to require some planning as your projects get more complex.
Mike Blizman Administration of Hadoop cluster - Cloudera, Datameer
Read full review Alternatives Considered Pricing, support, and ease of use. We plan to scale up our data over the net few years and Datameer gives us all the things we need in one tool. Handles large transformations quickly and works with all the cloud data warehouses.
Datameer's per-user pricing sealed the deal for us as we plan to transfer much more data over the next few years. We looked at
Fivetran but the usage pricing discourages growth. We also looked at Informatica but it was too expensive and didn't work as well with other BI tools like Datameer does.
Read full review Most ETL pipeline products have a T layer, but dbt just does it better. The transformation is on steroids compared to the others. Also, just allows much more Adhoc solutions for very specific projects. Those ETL tools are probably better on the T part if you don't need too many transforms - also dbt is pretty much free dependent on how you work it, also extremely scalable.
Read full review Return on Investment We have not been able to reach our business objectives just yet. Hadoop its a hard sell in most companies still. Legacy skills are still highly on demand and as long as an easier path leverage SQL for example is available, it would be hard to gain more adoption. Read full review Simplified our BI layer for faster load times Increased the quality of data reaching our end users Makes complex transformations manageable Read full review ScreenShots