Skip to main content
TrustRadius
OpenRefine

OpenRefine

Overview

What is OpenRefine?

OpenRefine is a free, open-source tool designed to clean and transform messy data. It is positioned as a solution suitable for companies of all sizes, from small startups to large enterprises. According to the project, OpenRefine is utilized by data analysts, data scientists, researchers, librarians,...

Read more
Recent Reviews

TrustRadius Insights

OpenRefine is a powerful tool that simplifies data cleaning and processing for businesses. It has been highly recommended by many users, …
Continue reading
TrustRadius

Leaving a review helps other professionals like you evaluate Data Cleansing Tools

Be the first one in your network to review OpenRefine, and make your voice heard!

Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is OpenRefine?

OpenRefine (previously Google Refine) is a tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data. OpenRefine always keeps data private on the user's own computer until the user…

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

5 people also want pricing

Alternatives Pricing

What is Alteryx Designer Cloud?

Trifacta is a "data wrangling" (or data preparation) platform particularly of use with Hadoop, developed by the company Trifacta headquartered in San Francisco, California. Alteryx announced their acquisition of Trifacta in January of 2022.

N/A
Unavailable
What is Datameer?

Datameer helps businesses clean up, combine, and organize data to make sense of it and use it for reports and machine learning.

Return to navigation

Product Demos

OpenRefine Demo

YouTube

OpenRefine demo

YouTube

OpenRefine demo: a tool to clean up and enrich datasets - Arthur P Smith at WikidataCon 2017

YouTube
Return to navigation

Product Details

What is OpenRefine?

OpenRefine is a free, open-source tool designed to clean and transform messy data. It is positioned as a solution suitable for companies of all sizes, from small startups to large enterprises. According to the project, OpenRefine is utilized by data analysts, data scientists, researchers, librarians, and journalists across various industries to efficiently manage and enhance their datasets.

Key Features

Faceting: Drill through large datasets using facets and apply operations on filtered views of your dataset. Faceting allows you to explore your data by creating filters based on specific values or ranges within a column. You can apply operations on the filtered views to clean and transform the data.

Clustering: Fix inconsistencies by merging similar values thanks to powerful heuristics. Clustering helps identify similar values within a column and allows you to merge them into a single value. This feature is useful for cleaning up data with variations or misspellings.

Reconciliation: Match your dataset to external databases via reconciliation services. Reconciliation allows you to match your data against external databases to ensure accuracy and consistency. It helps in identifying and linking entities in your dataset to existing entities in external databases.

Infinite undo/redo: Rewind to any previous state of your dataset and replay your operation history on a new version of it. OpenRefine keeps track of all the operations performed on your dataset, allowing you to easily undo or redo any changes. This feature provides flexibility and ensures that you can revert to previous versions of your data.

Privacy: Your data is cleaned on your machine, not in some dubious data laundering cloud. OpenRefine operates locally on your machine, ensuring that your data remains private and secure. There is no need to worry about your data being processed on external servers.

Wikibase: Contribute to Wikidata, the free knowledge base anyone can edit, and other Wikibase instances. OpenRefine integrates with Wikibase, allowing you to contribute to Wikidata and other Wikibase instances. You can add and edit data in Wikibase directly from OpenRefine.

Data Cleaning: Easily clean and standardize your data by applying various transformations, such as removing duplicates, changing case, and formatting dates. OpenRefine provides a wide range of data cleaning functions to ensure the quality and consistency of your datasets.

Data Transformation: Transform your data from one format to another using OpenRefine's powerful transformation capabilities. You can split columns, merge cells, transpose data, and perform other transformations to reshape your data according to your specific requirements.

Data Enrichment: Extend your dataset with web services and external data sources to enrich your analysis. OpenRefine allows you to fetch data from APIs, web scraping, and linking to external databases, enabling you to enhance your dataset with additional information.

Data Exploration: Gain insights into your data through visualizations and statistical summaries. OpenRefine provides visualizations and statistical analysis tools that help you understand the distribution, patterns, and relationships within your data, enabling you to make informed decisions and discoveries.

OpenRefine Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews

Community Insights

TrustRadius Insights are summaries of user sentiment data from TrustRadius reviews and, when necessary, 3rd-party data sources. Have feedback on this content? Let us know!

OpenRefine is a powerful tool that simplifies data cleaning and processing for businesses. It has been highly recommended by many users, reviewers, and customers due to its ease of use, quick file conversion capability, and efficient data manipulation features. With OpenRefine, large datasets can be cleaned and transformed into other applications with just a few clicks, significantly reducing the time needed for data preparation and cleaning.

Data Analysts find OpenRefine to be especially useful in identifying errors in the data set effortlessly. The software's suggested changes streamline the process of cleaning dirty data while providing a quick way to modify data tables and metadata. Data wrangling becomes an effortless task as OpenRefine can match cells within a column even if they are formatted differently, perform filtering on empty rows, split columns, run GREL to bin rows easily. Additionally, OpenRefine's small learning curve makes it accessible to all users who need to work with data.

Overall, OpenRefine solves several business problems for users such as saving time for data cleaning and preparation; helping with the identification of errors in the dataset; enabling the modification of metadata quickly and easily; making data wrangling more effortless; and allowing non-programmers access to efficient data manipulation tools.

Users commonly recommend the following actions for OpenRefine:

  • Explore OpenRefine and utilize online tutorials for learning the tool. This can greatly enhance understanding of its capabilities.
  • Use OpenRefine for cleaning metadata and other forms of data. It is a useful tool for quickly cleaning different types of data.
  • Learn the General Refine Expression Language (GREL) for more complex data manipulations. This enables greater flexibility in analyses.

By exploring OpenRefine through tutorials, using it for cleaning various types of data, and mastering GREL, users can maximize the benefits of this tool.

Sorry, no reviews are available for this product yet

Return to navigation