Data Warehouse Software
Best Data Warehouse Software
TrustMaps are two-dimensional charts that compare products based on satisfaction ratings and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap, and those above the median line are considered Top Rated.
Data Warehouse Software Overview
What is Data Warehouse Software?
A data warehouse is a database designed for data analysis instead of standard transactional processing. A data warehouse acts as a conduit between operational data stores and supports analytics on the composite data. Slices of data from the warehouse—e.g. summary data for a single department to use, like sales or finance—are stored in a “data mart” for quick access.
In order for a data warehouse to support decision-making effectively, data extracted from various data sources and loaded into the warehouse is normalized. It can be organized into tables, cleaned of redundancy and transformed for consistency. The process by which this happens is called Extract, Transform, and Load (ETL). Once appropriately structured data is made available for querying and analysis.
Data Warehouse Features & Capabilities
To support analyses data warehouses provide the following capabilities:
Associated input, extract, and data management tools for preparation
Extract from a multitude of source file types (flat files, excel, application data, etc.)
May load & normalize structured, semi-structured, or unstructured data
Data transformation (cleansing, deduplication, consistency)
Data reconciliation for various naming conventions
Native & autonomous storage and processing optimization
Provide a 360 view of all enterprise data
Multiple deployment options (private or public cloud, on-premise, hybrid cloud)
Available as-a-service (automated infrastructure management)
Integrated machine-learning algorithms, AI
Access controlled data sharing, data mart
Deploy virtualized data warehouse for extra security, access control
In-built data encryption for high-security needs
Data warehouses from full-stack vendors are often sold as standalone products that must be integrated with other tools. Many data warehouses can be deployed and tested with ease under a free trial for 30 or 60 days. Vendors compete on performance but also pricing. Many popular data warehouses feature on-demand pricing, based on (for instance) compute per second. Alternately, some vendors offer a reduction in on-demand pricing for annual or multi-year commitments.
Data Warehouse Products
Arm Treasure Data manages customer data for global brands and Fortune 500 enterprises like Mattel, Subaru, Canon, LG and disruptive startups like Wish.com, Fivestars, and Zoom.
SAP Business Warehouse (or SAP NetWeaver Business Warehouse) is... a data warehouse for businesses.
Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.
Oracle Exadata is software and hardware engineered to support high-performance running of Oracle databases.
Databricks in San Francisco offers the Databricks Unified Analytics Platform, a data science platform and Apache Spark cluster manager.
The Vertica Analytics Platform supplies enterprise data warehouses with big data analytics capabilities and modernization. Vertica is owned and supported by Micro Focus.
Snowflake is the eponymous data warehouse with an emphasis on analysis, from the company in San Mateo, California.
ParAccel is an data warehouse appliance (DWA) option now offered by Actian, since the acquisition of ParAccel (April, 2013).
SAP HANA, Express Edition is a scaled down version of the HANA product that can run on laptops and other resource-constrained hosts, such as cloud-hosted virtual machines. The product is free to use for in-memory databases up to 32GB of RAM.
Part of IBM PureSystems, IBM Netezza Data Warehouse Appliances is that company's big data-focused DWA offering.
SQL Data Warehouse from Microsoft is a data warehouse-as-a-service for enterprises.
MemSQL is a NoSQL database offering from the company of the same name in San Francisco.
Stitch in Philadelphia offers their flagship ETL tool to developers; the company was spun off from RJMetrics after that company's acquisition by Magento, and is an independent entity.
Analyza is a business intelligence solution that is provided by PIT Business. The vendor provides support services in Luxembourg, Belgium, France, and other parts of Europe. Analyza allows users to: Build several indicators and dashboardsBuild ad-hoc analysis without IT knowledgeNavigate throu...
IBM dashDB is a fully managed cloud data warehouse, ideal for business intelligence reporting and analytics. Use dashDB to store relational data, including special types such as geospatial data. Then analyze that data with SQL or advanced built-in analytics like Netezza predictive analytics, anal...
Teradata data warehouse appliance is that company's DWA offering.
Merrill Corporation in St Paul offers DatasiteOne, the company's virtual data room (VDR) for due diligence, investment banking, IPOs, private equity or venture capital, and other goals, featuring ease of tools (e.g. drag-and-drop files, in document text search, etc.), five minute setup) and as we...
IBM InfoSphere Warehouse, since late 2013, is a legacy data warehouse brand.
The IBM Smart Analytics System provides an integrated data warehouse and analytics solution.
Pivotal Greenplum (formerly from EMC) is a massively parallel processing (MPP) data platform, based on the open source Greenplum Database. The data warehouse application is supported by Pivotal Software.