Item: SingleStore
Rating: 7
Author: Tianwen Chu

Overall Satisfaction with MemSQL

Use Cases and Deployment Scope

SingleStore DB (formerly MemSQL) is used as a persistent storage solution for Spark. We use SingleStore DB (formerly MemSQL) spark connector (Scala code) to bridge two techs. I am leading projects of using spark and SingleStore DB (formerly MemSQL) to process life science data. It solved the spark storage issue.

Pros and Cons

Pros

Faster query speed than traditional SQL database.
It con server in the pipeline to deal with streaming data with Kafka, spark streaming and SingleStore DB (formerly MemSQL).
It is very scalable.

Cons

Better tuning of SingleStore DB (formerly MemSQL) performance on Scale-up server
SingleStore DB (formerly MemSQL) connection between spark failed when more than around 48 partitions data processing
Provide faster python API for invoking SingleStore DB (formerly MemSQL)

Return on Investment

It offers me solution to solve spark storage problem.
It adds more complexity of my application since multiple tech softwares are involved.
More types of bugs will be encountered when doing streamliner, including hardware connection.

Alternatives Considered

I have tried using CSV as a back-end storage, yet I/O is very heavy, direct transit from spark to SingleStore DB (formerly MemSQL) in memory really beats.

Likelihood to Recommend

If data has too many joins necessities, then think of using graphdatabase technology rather than relational table.

Using MemSQL

Likelihood to Renew

I still want to see the performance about using latest version of spark and memsql. About renewal, if there is a new and better version of spark-memsql connector, then maybe.

Other Software Used

Apache Spark, Docker, Sparksee

Comments

Please log in to join the conversation

SingleStore DB (formerly MemSQL) backup Spark