Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
Datameer is a great tool if someone is capable of keeping the most recent version of the tool up to date along with the most recent version of the distribution of Hadoop. The tool is easy to support but it must have someone who can run the back end processes
It leverages scalability, flexibility and cost-effectiveness of hadoop to deliver an end-user focused analytic platform for big data without involvement of IT.
It overcomes Hadoop`s complexity by providing GUI interface with pre-built functions across integration, analytics and data visualization .
Excel feature is awesome for business users which is already provided by Datameer.
Using datameer now user can do smart analytic using Decision Trees, Column dependency and recommendation.
Recently HTML5 inclusion is making application to available on a wider range of devices, including the iPad and other mobile devices which does not support Flash.
It can be used in premise or in a cloud computing environment.
Wizard-based data integration designed for IT and business users to schedule and do transformation of large sets of structured, semi-structured and unstructured data without any knowledge of Hadoop ecosystem.
Employees with intermediate SQL and Hive knowledge can generate reports faster than using Datameer . It does have visualization tool but I don't think it is anything that cannot be accomplished by importing the data in Excel
Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.
Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so you need to know exactly what you want. On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
Pricing, support, and ease of use. We plan to scale up our data over the net few years and Datameer gives us all the things we need in one tool. Handles large transformations quickly and works with all the cloud data warehouses.
Datameer's per-user pricing sealed the deal for us as we plan to transfer much more data over the next few years. We looked at Fivetran but the usage pricing discourages growth. We also looked at Informatica but it was too expensive and didn't work as well with other BI tools like Datameer does.