Data Cleansing Tools
Best Data Cleansing Tools
TrustMaps are two-dimensional charts that compare products based on satisfaction ratings and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap, and those above the median line are considered Top Rated.
Data Cleansing Tools Overview
What are Data Cleansing Tools?
Data cleansing or data scrubbing is a process for removing corrupt, inaccurate or inconsistent data from a database. Regular data-cleansing corrects records containing incorrect formatting, typographical mistakes, or other errors. Cleansing might also mean harmonizing records so that they are consistent with each other.
Data cleansing software systematically searches for discrepancies or anomalies by using algorithms or lookup tables. It then corrects the issues. An automated process of this kind is much more efficient than trying to fix errors by hand.
Regular data cleansing operations are critically important as erroneous data records can ultimately lead to erroneous conclusions and misguided investments decisions. Data cleansing tools often have featural overlap with data deduplication, data preparation, and data quality tools.
Data Cleansing Tools Features & Capabilities
Data cleansing tools generally contain these and other similar feature sets:
Raw data ingestion
Support for a wide variety of data formats (e.g. .csv, .xml, etc)
Phone & email validation
Address & zip code cleansing
Auto or manual data mapping
Data consolidation and ETL
Data validation, matching, reconciliation
Data analysis, charting
Simple data cleansing tools are open source and available free. Otherwise, vendors offering business intelligence or data management tools also provide data cleansing tools. These vendors may offer a free 30-day trial of their data cleaning products. Afterward data cleansing tools are available on a subscription basis. Pricing corresponds to volume of data stored or exported. Pricing may also correspond with the number of validations (e.g. email, address) performed.
Data Cleansing Products
DemandTools for AppExchange is a data quality toolset for Salesforce.com CRM centric customers. The product comprises 11 individual modules to control, standardize, verify, deduplicate, import and manipulate Salesforce and/or Force.com data.
Dataloader.io delivers a cloud based solution to import and export information from Salesforce.
The V12 Data Platform (formerly called the Launchpad Marketing Cloud) is comprised of a collection of online and offline marketing solutions that is designed to manage existing customer relationships and identify new prospective customers by granting users access to The V12 Group Data Cloud. It...
Talend Data Quality is an open source data management tool handling parsing, standardization, matching and data profiling.
Cloudingo - a cloud-based SaaS, connects to salesforce.com and allows system administrators to scan their entire database for similar or duplicate records. Cloudingo was launched in late 2011. It is well known for its ease-of-use and rich user experience.
Informatica MDM is an enterprise master data management solution that competes directly with IBM's InfoSphere and Oracle's Siebel UCM product. The product has about 200 licensed users.
Ataccama is a data quality platform handling data parsing, standardization, cleansing and matching, and data profiling.
Prospecta Master Data Online (MDO) is a web-based tool that manages the governance and standardization of all types of master data across the user’s business. It allows business users to create information with standardized business rules, workflows and approval processes. MDO is available both...
TIBCO Clarity is an automated data cleansing application supporting removal or merging of duplicate records, formatting and transformation, as well as trend or pattern detection in datasets.
Trillium Software System, now supported by Syncsort since the 2016 acquisition, is a suite of products offering data quality improvement for business intelligence data, and data cleansing tools. It includes the components Trillium Quality for data cleansing and enrichment, Trillium Discovery...
Unique Entry is the first and only application on the Salesforce AppExchange to check and alert users of possible duplicates as Leads, Accounts, and Contacts are typed into Salesforce.
RedPoint Data Management & Quality handles the core requirements of data management including data quality, with data-profiling and general-purpose data-cleansing functionality, including parsing, standardization, matching and cleansing. The platform functions across NoSQL, Hadoop, and...
Clear Analytics is a business intelligence solution that enables non technical end users to perform analytics by leveraging existing knowledge of Excel coupled with a built in query builder. Some key features include: Dynamic Data Refresh, Data Share and In-Excel Collaboration.
VeriAS is a secure enterprise level platform that is designed to analyze, verify and score email lists in order to flag hard bouncing as well as malicious email addresses. The vendor’s value proposition is that their solution safeguards an organization’s email resources and improves sender...
MailBoxValidator is solution for verifying and cleaning email lists. This solution connects to a mail server and checks whether the mailbox exists or not. The vendor’s value proposition is that their solution reduces bounce rates, increases conversion rates and sender reputation.
StarDQ is a real time solution for cleansing, de-duping, and enriching enterprise data. By integrating StarDQ Solution, organizations can cleanse, match and unify data across multiple data sources and data domains. According to the vendor, the goal is to ensure that data is a...
Organizations need to make data-driven decisions. But how do they obtain sound data and scale and operationalize those data-driven decisions? Self-service data preparation solutions from SAP can help. The SAP® Agile Data Preparation application provides self-service access to trusted data,...