Data Virtuality, headquartered in Leipzig, offers two products to solve data integration and management problems in a tailored way that best suits data teams in the fast-paced world of data. The Data Virtuality Platform combines data virtualization and data replication, Data Virtuality Platform provides data teams the flexibility to always choose the right method for the specific requirement. It is an enabler for Data Fabric and Data Mesh by providing the self-service capabilities and data…
N/A
Presto
Score 10.0 out of 10
N/A
Presto is an open source SQL query engine designed to run queries on data stored in Hadoop or in traditional databases.
Teradata supported development of Presto followed the acquisition of Hadapt and Revelytix.
Data Virtuality Platform's best and most unique feature is that it is SQL-based, giving us flexibility when working with our data that other marketing integration pipeline tools couldn't provide. Our main benefits are the short time it takes to connect to our data sources and the flexibility of the virtual SQL layer to meet our end users' data needs. It paves the way for us to tap into various data repositories, extract the data contained within, and examine it. It allows us to access information, generate actionable reports, and make data-driven decisions. However, inadequate data governance rules and a complex configuration process make data connectors challenging.
Presto is for interactive simple queries, where Hive is for reliable processing. If you have a fact-dim join, presto is great..however for fact-fact joins presto is not the solution.. Presto is a great replacement for proprietary technology like Vertica
Linking, embedding links and adding images is easy enough.
Once you have become familiar with the interface, Presto becomes very quick & easy to use (but, you have to practice & repeat to know what you are doing - it is not as intuitive as one would hope).
Organizing & design is fairly simple with click & drag parameters.
Presto was not designed for large fact fact joins. This is by design as presto does not leverage disk and used memory for processing which in turn makes it fast.. However, this is a tradeoff..in an ideal world, people would like to use one system for all their use cases, and presto should get exhaustive by solving this problem.
Resource allocation is not similar to YARN and presto has a priority queue based query resource allocation..so a query that takes long takes longer...this might be alleviated by giving some more control back to the user to define priority/override.
UDF Support is not available in presto. You will have to write your own functions..while this is good for performance, it comes at a huge overhead of building exclusively for presto and not being interoperable with other systems like Hive, SparkSQL etc.
Data Virtuality Platform has powerful transformation and scheduling capabilities combined with the best support team. A virtual data layer that supports SQL procedures is a game changer. Data Virtuality has made data management more efficient. It lets us access various data sources, extract, and analyze data. The short time it takes to connect to our data sources and the flexibility of the virtual SQL layer allows us to meet our end users' data needs.
Presto is good for a templated design appeal. You cannot be too creative via this interface - but, the layout and options make the finalized visual product appealing to customers. The other design products I use are for different purposes and not really comparable to Presto.