ClickHouse vs. Informatica Cloud Data Quality

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
ClickHouse
Score 7.7 out of 10
N/A
ClickHouse is an open-source, column-oriented OLAP database system enabling real-time analytical reports using SQL queries. With linear scalability, it handles trillions of rows and petabytes of data. ClickHouse Cloud offers a scalable serverless solution for real-time analytics.N/A
Informatica Cloud Data Quality
Score 6.6 out of 10
N/A
The vendor states that Informatica Data Quality empowers companies to take a holistic approach to managing data quality across the entire organization, and that with Informatica Data Quality, users are able to ensure the success of data-driven digital transformation initiatives and projects across users, types, and scale, while also automating mission-critical tasks.N/A
Pricing
ClickHouseInformatica Cloud Data Quality
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
ClickHouseInformatica Cloud Data Quality
Free Trial
YesNo
Free/Freemium Version
YesNo
Premium Consulting/Integration Services
YesNo
Entry-level Setup FeeOptionalNo setup fee
Additional DetailsPay for what is used: It automatically scales up and down compute resources based on the user's workload It scales storage and compute separately It automatically scales unused resources down to zero so that users don’t pay for idle services
More Pricing Information
Community Pulse
ClickHouseInformatica Cloud Data Quality
Features
ClickHouseInformatica Cloud Data Quality
Data Quality
Comparison of Data Quality features of Product A and Product B
ClickHouse
-
Ratings
Informatica Cloud Data Quality
8.2
4 Ratings
3% below category average
Data source connectivity00 Ratings8.94 Ratings
Data profiling00 Ratings8.74 Ratings
Master data management (MDM) integration00 Ratings8.24 Ratings
Data element standardization00 Ratings7.14 Ratings
Match and merge00 Ratings7.94 Ratings
Address verification00 Ratings8.44 Ratings
Best Alternatives
ClickHouseInformatica Cloud Data Quality
Small Businesses
InterSystems IRIS
InterSystems IRIS
Score 7.8 out of 10
HubSpot Data Hub
HubSpot Data Hub
Score 8.1 out of 10
Medium-sized Companies
InterSystems IRIS
InterSystems IRIS
Score 7.8 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Enterprises
SAP IQ
SAP IQ
Score 10.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
ClickHouseInformatica Cloud Data Quality
Likelihood to Recommend
10.0
(2 ratings)
9.0
(19 ratings)
Likelihood to Renew
-
(0 ratings)
6.6
(14 ratings)
Usability
-
(0 ratings)
8.0
(1 ratings)
Availability
-
(0 ratings)
9.0
(2 ratings)
Performance
-
(0 ratings)
9.0
(1 ratings)
Online Training
-
(0 ratings)
10.0
(1 ratings)
Implementation Rating
-
(0 ratings)
10.0
(1 ratings)
Product Scalability
-
(0 ratings)
9.0
(1 ratings)
User Testimonials
ClickHouseInformatica Cloud Data Quality
Likelihood to Recommend
ClickHouse, Inc.
The most important thing when using ClickHouse is to be clear that the scenarios in which you want to use it really are the right ones. Many users think that when a database is very fast for a specific use case, it can be extrapolated to other contexts (most of the time different) in which a previous analysis has not been carried out.
ClickHouse is an analytical database, as such, it should be used for such purposes, where the information is stored correctly, the data volumes are really large and the queries to be performed are not the typical traditional queries on several columns with multiple aggregations. ClickHouse is not the solution for this.
On the other hand, if your case is not one of the above, it is quite possible that ClickHouse can help you. Where ClickHouse shines is when you are looking for aggregation over a particular column in large volumes of data.
Read full review
Informatica
For effective data collaboration, systematic verification of customer information, and address, among others, Informatica Data Quality is a fruitful application to consider. Besides, Informatica Data Quality controls quality through a cleansing process, giving the company a professional outline of candid data profiling and reputable analytics. Finally, Informatica Data Quality allows the simplistic navigation of content, with a dashboard that supports predictability.
Read full review
Pros
ClickHouse, Inc.
  • Their MergeTree table engine provide impressive performance for data insert in bulk
  • Not only data insert but also the way MergeTree engine uses Primary Keys to sort the data and perform data skipping based on the granules its also their secret for ridiculous fast queries
  • Data compression its also great
  • They provide especial table engines that allow you to read data directly from other sources like S3
  • Since its written with C++ you have very granular data types and especial ones like enum, LowCardinality and etc, they save you a lot of storage since are stored as integer values
  • ClickHouse functions besides the ones that respect ANSI Standards are also awesome and useful
Read full review
Informatica
  • The matching algorithms in IDQ are very powerful if you understand the different types that they offer (e.g., Hamming Distance, Jaro, Bigram, etc..). We had to play around with it to see which best suit our own needs of identifying and eliminating duplicate customers. Setting up the whole process (e.g., creating the KeyGenerator Transformation, setting up the matching threshold, etc..) can be somewhat time consuming and a challenge if you don't first standardize your data.
  • The integration with PowerCenter is great if you have both. You can either import your mappings directly to PowerCenter or to an XML file. The only downside is that some of the transformations are unique to IDQ, so you are not really able to edit them once in PowerCenter.
  • The standardizer transformation was key in helping us standardize our customer data (e.g., names, addresses, etc..). It was helpful due to having create a reference table containing the standardized value and the associated unstandardized values. What was great was that if you used Informatica Analyst, a business analyst could login and correct any of the values.
Read full review
Cons
ClickHouse, Inc.
  • Avro data manipulation
  • Kafka consistency
  • DDL operations errors (by replica configuration)
Read full review
Informatica
  • Several partnerships diminishing the value of technologies
  • Unable to get list of objects from Repository (like sources & targets) that don't have any dependency
  • Scheduling: The built-in scheduling tool has many constraints such as handling Unix/VB scripts etc. Most enterprises use third party tools for this.
Read full review
Likelihood to Renew
ClickHouse, Inc.
No answers on this topic
Informatica
As pointed out earlier, due all the robust features IDQ has, our use f the product is successful and stable. IDQ is being used in multiple sources (from CRM application and in batch mode). As this is an iterative process, we are looking to improve our system efficiency using IDQ.
Read full review
Usability
ClickHouse, Inc.
No answers on this topic
Informatica
Easy to use not only for developers but also business users
Read full review
Reliability and Availability
ClickHouse, Inc.
No answers on this topic
Informatica
The application works well except an occasional error out while using the system. It usually gets fixed when restarting the Infa server
Read full review
Performance
ClickHouse, Inc.
No answers on this topic
Informatica
Performance works just fine. It was able to load 200+ business terms, 150+ DQ automation, etc. very well.
Read full review
Alternatives Considered
ClickHouse, Inc.
ClickHouse outperforms, especially in costs, since its compression/indexing engines are so smart, and even with very low computing power, you can already perform huge analyses of the data.
Read full review
Informatica
IDQ is used by a department at my organisation to ensure and enhance the data quality.
The usage was started with address standardization and now it had been brought to altogether a next level of quality check where it fixes duplicates, junk characters, standardize the names, streets, product descriptions.
In the past we had issues mainly with duplicate customers and products and this were affecting the sales projection and estimates.
Read full review
Scalability
ClickHouse, Inc.
No answers on this topic
Informatica
Scalability works as expected and it is truly an enterprise system.
Read full review
Return on Investment
ClickHouse, Inc.
  • Queries that used to take more than 2 minutes now take less than 1 second
  • Possibility to analyze use cases in real time (before was impossible)
  • The applications are more complete and the users decisions are better
Read full review
Informatica
  • Integration with tools like PowerCenter helped faster delivery of product, and at the same time conversion
  • Reduce overall project cost due to bad data , bad quality, exceptions identified nearing go-live and post production
  • Employee efficiency is increased exponentially due to more automated, customized tool
Read full review
ScreenShots