Apache Flume is a product enabling the flow of logs and other data into a Hadoop environment.
N/A
Datameer
Score 8.4 out of 10
N/A
Datameer helps businesses clean up, combine, and organize data to make sense of it and use it for reports and machine learning.
N/A
IBM InfoSphere Information Server
Score 8.0 out of 10
N/A
IBM InfoSphere Information Server is a data integration platform used to understand, cleanse, monitor and transform data. The offerings provide massively parallel processing (MPP) capabilities.
N/A
Pricing
Apache Flume
Datameer
IBM InfoSphere Information Server
Editions & Modules
No answers on this topic
Team/Enterprise
Contact for pricing
per month Team
No answers on this topic
Offerings
Pricing Offerings
Apache Flume
Datameer
IBM InfoSphere Information Server
Free Trial
No
Yes
No
Free/Freemium Version
No
No
No
Premium Consulting/Integration Services
No
No
No
Entry-level Setup Fee
No setup fee
No setup fee
No setup fee
Additional Details
—
—
—
More Pricing Information
Community Pulse
Apache Flume
Datameer
IBM InfoSphere Information Server
Features
Apache Flume
Datameer
IBM InfoSphere Information Server
Data Source Connection
Comparison of Data Source Connection features of Product A and Product B
Apache Flume
-
Ratings
Datameer
-
Ratings
IBM InfoSphere Information Server
8.7
4 Ratings
5% above category average
Connect to traditional data sources
00 Ratings
00 Ratings
9.94 Ratings
Connecto to Big Data and NoSQL
00 Ratings
00 Ratings
7.54 Ratings
Data Transformations
Comparison of Data Transformations features of Product A and Product B
Apache Flume
-
Ratings
Datameer
-
Ratings
IBM InfoSphere Information Server
9.6
4 Ratings
16% above category average
Simple transformations
00 Ratings
00 Ratings
10.04 Ratings
Complex transformations
00 Ratings
00 Ratings
9.24 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
Apache Flume
-
Ratings
Datameer
-
Ratings
IBM InfoSphere Information Server
8.0
4 Ratings
2% above category average
Data model creation
00 Ratings
00 Ratings
8.72 Ratings
Metadata management
00 Ratings
00 Ratings
7.74 Ratings
Business rules and workflow
00 Ratings
00 Ratings
8.44 Ratings
Collaboration
00 Ratings
00 Ratings
8.04 Ratings
Testing and debugging
00 Ratings
00 Ratings
7.14 Ratings
Data Governance
Comparison of Data Governance features of Product A and Product B
Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
Datameer is a great tool if someone is capable of keeping the most recent version of the tool up to date along with the most recent version of the distribution of Hadoop. The tool is easy to support but it must have someone who can run the back end processes
Information Server is extremely useful to replace manual developments that require a lot of coding effort. It significantly increases the productivity of the initial development and the future maintenance of the processes since it has a visual development environment with self-documentation.
It leverages scalability, flexibility and cost-effectiveness of hadoop to deliver an end-user focused analytic platform for big data without involvement of IT.
It overcomes Hadoop`s complexity by providing GUI interface with pre-built functions across integration, analytics and data visualization .
Excel feature is awesome for business users which is already provided by Datameer.
Using datameer now user can do smart analytic using Decision Trees, Column dependency and recommendation.
Recently HTML5 inclusion is making application to available on a wider range of devices, including the iPad and other mobile devices which does not support Flash.
It can be used in premise or in a cloud computing environment.
Wizard-based data integration designed for IT and business users to schedule and do transformation of large sets of structured, semi-structured and unstructured data without any knowledge of Hadoop ecosystem.
Employees with intermediate SQL and Hive knowledge can generate reports faster than using Datameer . It does have visualization tool but I don't think it is anything that cannot be accomplished by importing the data in Excel
Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.
Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so you need to know exactly what you want. On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
Pricing, support, and ease of use. We plan to scale up our data over the net few years and Datameer gives us all the things we need in one tool. Handles large transformations quickly and works with all the cloud data warehouses.
Datameer's per-user pricing sealed the deal for us as we plan to transfer much more data over the next few years. We looked at Fivetran but the usage pricing discourages growth. We also looked at Informatica but it was too expensive and didn't work as well with other BI tools like Datameer does.