Likelihood to Recommend Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
Read full review I spent more than 1 year with SAP Vora, SAP Datahub and SAP Leonardo with ML, iOt. I believe this product has potential but it is not easy to adopt. SAP has to keep in mind how open-source big data technologies are able to deliver quick results. I know SAP is stabilizing and fighting hard against many open source technologies, but it still has a long way to go there.
Read full review Pros Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage It is very easy to setup and run Very open to personalization, you can create filters, enrichment, new sources and destinations Read full review Modelling with SAP HANA and Hadoop Realtime Analysis using Vora and HANA as a Streaming engine Time series Analysis on large chunks of datasets Machine learning capabilities on Hadoop tables and spark contexts Read full review Cons It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though) Read full review Vora 2.0 in on premise scenarios could be improved, as adoption of the cloud is not an easy sell. Kubernetes and Docker integration need to be more seamless and quick to understand. If this is simplified, it will be easy to adopt Data hub orchestration and integrations could be simplified so that quick adoption within SAP BW, ECC, S4 HANa scenarios is possible. Read full review Support Rating Apache Flume is open-source so support is limited. Never the less, it has great documentation and best practices documents from their end-users so it is not hard to use, setup and configure.
Read full review Alternatives Considered Apache Flume is a very good solution when your project is not very complex at transformation and enrichment, and good if you have an external management suite like Cloudera, Hortonworks, etc. But it is not a real EAI or ETL like AB Initio or Attunity so you need to know exactly what you want. On the other hand being an opensource project give Apache a lot of room to personalize thanks to its plug-able architecture and has a very nice performance having a very low CPU and Memory footprint, a single server can do the job on many occasions, as opposed to the multi-server architecture of paid products.
Read full review Return on Investment Flume has simplified a lot many of our ingest procedures, easier to deploy and integrate than a classical EAI, reducing the time to market But opposed to EAIs if the project starts to grow in complexity Apache Flume project may not be as suitable Read full review Negative impact would be Poc and RFI will need more time to adopt and decision making gets delayed Positive impact would be it's a great leap from SAP to adopt a Big data technologies and AI within cloud stream. But selling is going to take time. Read full review ScreenShots