Druid gets the job done
Use Cases and Deployment Scope
We use Druid for rapid ingest of a variety of data sources, including traditional databases, Kafka topics, and data stored in Hadoop. Our users enjoy the easy creation of ingest specs, and the ability to ingest only the relevant columns/fields required for their programs and queries. Being able to translate and enrich data during ingest is a huge plus.
Pros
- Rapid ingest
- Limiting ingest to only the relevant fields/columns
- Easy ingest spec creation
Cons
- Security configuration is problematic
- Cluster management could have more features
- Troubleshooting incomplete tasks/jobs is a chore
Likelihood to Recommend
<div>It is extremely well suited to rapid ingest of data from large data sources, due to the fact that you can restrict what is ingested by column/field, so that you only pull in the data you actually want or need.
</div><div>
</div><div>As stated earlier, the open source version could use better cluster management tools, and troubleshooting tools for failing jobs/tasks.
</div>
