Apache Spark vs. Microsoft SQL Server

Apache Spark

Apache Spark

165 Reviews and Ratings

Microsoft SQL Server

Microsoft SQL Server

1673 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Spark	Score 8.9 out of 10	N/A	Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.	N/A
Microsoft SQL Server	Score 8.7 out of 10	N/A	Microsoft SQL Server is a relational database.	$1,418 Per License

Pricing

Apache Spark

Microsoft SQL Server

Editions & Modules

No answers on this topic

Subscription: $1,418.00
Per License
Enterprise: $13,748.00
Per License

Offerings

Pricing Offerings
Apache Spark	Microsoft SQL Server
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Apache Spark	Microsoft SQL Server
Considered Both Products	Apache Spark Ananth Gouri Assistant Professor Chose Apache Spark We used Surprise Kit for one of the other research works. It is more fine-tuned to Recommendation systems and their algorithms. Apache Spark has MLlib for majority of ML problems. Where as software like Surprse Kit - it suitable for a specific task of Recommendations only. Incentivized Helpful? Riyaz Khan Staff Engineer Chose Apache Spark Apache Spark is a fast-processing in-memory computing framework. It is 10 times faster than Apache Hadoop. Earlier we were using Apache Hadoop for processing data on the disk but now we are shifted to Apache Spark because of its in-memory computation capability. Also in SAP … Incentivized Helpful? Steven Li Senior Software Developer (Consultant) Chose Apache Spark Other teams used to work on Apache Hadoop but our team started with Apache Spark directly. Incentivized Helpful? Verified User Anonymous Chose Apache Spark There are a few alternatives that can do the same transformation and aggregation like Apache Spark can do but most of them are not able to perform parallel computation. For example, pandas is a really good tool to do that but not parallelized; However, there are some tools that … Incentivized Helpful? Surendranatha Reddy Chappidi Senior Data Engineer Chose Apache Spark Apache Spark works in distributed mode using cluster Informatica and Datastage cannot scale horizontally We can write custom code in spark, whereas in Datastage and Informatica we can only choose the different features proivided already. Incentivized Helpful? Verified User Anonymous Chose Apache Spark Apache Spark has much more better performance and features if we compare with Hive or map/reduce kind of solutions. Spark has many other features for machine learning, streaming. Incentivized Helpful? Chetan Munegowda Software Engineer Chose Apache Spark Spark is simply awesome to work on with any data sets and also has an in-memory database which makes it very flexible. Incentivized Helpful? YM Yogesh Mhasde Technical Manager Chose Apache Spark 1. Apache Spark is almost 100 % faster than Hadoop. 2. Apache Spark is more stable than Amazon EMR. 3. The end to end distributed machine library is more robust in Apache Spark. Incentivized Helpful? Verified User Anonymous Chose Apache Spark Databricks uses Spark as a foundation, and is also a great platform. It does bring several add-ons, which we did not feel needed by the time we evaluated - and haven't needed since then. One interesting plus in our opinion was the engineering support, which is great depending … Incentivized Helpful? Verified User Anonymous Chose Apache Spark It is easy to learn, read and to maintain. It brings the best of the Ruby on Rails framework from Java that helps to create a web service so easily. Communication is one of the most distinctive features of Apache Spark compared to alternative products. You are able to … Incentivized Helpful? SS Shiv Shivakumar Acquisitions Leader Chose Apache Spark We evaluated SAS alongside with Apache Spark but during the course of proof of concept found that Apache Spark was able to support the hadoop eco-system and hadoop file system much better. It was much faster at that time while having the ability to process data quickly for the … Incentivized Helpful? Carla Borges Consultor Tecnico - Java Developer and Php Developer. Chose Apache Spark I prefer Apache Spark compared to Hadoop, since in my experience Spark has more usability and comes equipped with simple APIs for Scala, Python, Java and Spark SQL, as well as provides feedback in REPL format on the commands. At the same time, Apache Spark seems to have the … Incentivized Helpful? Nitin Pasumarthy Software Engineer Chose Apache Spark All the above systems work quite well on big data transformations whereas Spark really shines with its bigger API support and its ability to read from and write to multiple data sources. Using Spark one can easily switch between declarative versus imperative versus functional … Incentivized Helpful? Kartik Chavan Data Analyst Chose Apache Spark Even with Python, MapReduce is lengthy coding. Combination of Python with Apache Spark will not only shorten the code, but it will effectively increase the speed of algorithms. Occasionally, I use MapReduce, but Apache Spark will replace MapReduce very soon. It has many … Incentivized Helpful? Anson Abraham Data Czar Chose Apache Spark vs MapRedce, it was faster and easier to manage. Especially for Machine Learning, where MapReduce is lacking. Also Apache Storm was slower and didn't scale as much as Spark does. Spark elasticity was easier to apply compared to storm and MapReduce. managing resources for … Incentivized Helpful? Verified User Anonymous Chose Apache Spark We specifically choose Spark over MapReduce to make the cluster processing faster Incentivized Helpful? Verified User Anonymous Chose Apache Spark Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and … Incentivized Helpful? Kamesh Emani Software Developer Intern Chose Apache Spark Apache Pig and Apache Hive provide most of the things spark provide but apache spark has more features like actions and transformations which are easy to code. Spark uses optimization technique as we can select driver program and manipulate DAG (Directed Acyclic Graph) Python … Incentivized Helpful? Verified User Anonymous Chose Apache Spark There are a few newer frameworks for general processing like Flink, Beam, frameworks for streaming like Samza and Storm, and traditional Map-Reduce. I think Spark is at a sweet spot where its clearly better than Map-Reduce for many workflows yet has gotten a good amount of … Incentivized Helpful? Jordan Moore Staff Consultant Chose Apache Spark Spark has primarily replaced my use of writing pure Hadoop MapReduce or Apache Pig jobs for processing data. I like the fact that I can alternate between the main programming languages that I know - Java and Python - and use those to learn the Scala API. Spark also can be … Incentivized Helpful?	Microsoft SQL Server Verified User Anonymous Chose Microsoft SQL Server You could consider i did use Mysql since i worked with some websites that were using a mysql database. I could not give a side by side comparision since i don't use those like i use the Microsoft SQL , but so far it worked well. I prefer Microsoft SQL due to support and info … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server UI of the Microsoft SQL Server makes it easy to use and learn. The better technical support and documentation give it an extra edge over other databases. The Microsoft ecosystem provides additional advantages, as we can seamlessly use other Microsoft products, such as Power … Incentivized Helpful? VK varun kohli Senior Executive Chose Microsoft SQL Server Oracle Database, MySQL, Db2 and Snowflake Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server is faster and more compatible, but it does cost more, so you're paying for those features. I use the others in many other places where critical transaction processing time and compatibility aren't of great concern. Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL is slower than MySQL and Access but far more feature-rich and reliable. Access is almost obsolete nowadays, so not too many people are considering it, but unless budget or an open-source ethos is a factor, Microsoft SQL is superior in every way. Many commonly used … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server providers a more user friendly experience when it comes to Microsoft SQL Server components management via its unique SQL Server management Studio. It is also a production ready, resilient, highly available and tested database management system (DBMS). The … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server The first database application taught when I was in school was Microsoft SQL Server. Microsoft SQL Server was used where I first started, so I had the opportunity to improve myself in MySQL. SQL is also used in my current workplace. It is widely used in very large projects due … Incentivized Helpful? Gordon Lo Digital Solutions Architect Chose Microsoft SQL Server We have a few different DB's in the organization, including: Pervasive, Oracle, Db2, MySQL. Many of them are of limited use for one specific application. These don't really compare to MS SQL server. Oracle is heavy and cumbersome and overkill for smaller apps. Pervasive - … Incentivized Helpful? Pritesh Patel Product Manager, Partnerships & Data Services Chose Microsoft SQL Server Microsoft SQL Server is a comprehensive solution as transactional database, data warehouse, analytics, reporting, and ETL. It also integrates with the cloud well (Azure). The ease of use and setup makes this better than Oracle Database because the query syntax is also different … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server I think both tools are really powerful and close to each other but since I moved to Europe I realized that most of the companies have been using SQL Server which in my opinion means something. The support from Microsoft I also consider a bit better and you can also find more … Incentivized Helpful? Blake Baron Senior Financial Analyst - Cost Analytics Chose Microsoft SQL Server Microsoft was the original creator of the SQL database, and thus, they still rule the market and drive innovation when it comes to data warehousing systems. It's comparable in price and allows you to retain the structured datasets that you lose when you change to a NoSQL … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server [Microsoft] SQL Server has a much better community and professional support and is overall just a more reliable system with Microsoft behind it. I've used MySQL in the past and SQL Server has just become more comfortable for me and is my go to RDBMS. Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server and Oracle are both extremely powerful and scalable enterprise relational database platforms. Microsoft SQL sets itself apart with its ease of use and licensing and support model. Microsoft is good company to work with and they provide clear and … Incentivized Helpful? RH Rick Hahn Consultant \ Support Chose Microsoft SQL Server It just boils down to why learn anther product when you are going to run across it so seldom. Developers determine what database engine I am going to need so I just tend to pick products for implementation that use a well know product that has lots of support resources … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server The most known and widely used competitor of Microsoft SQL is most probably the open-source MySQL. If given the choice I would personally choose MySQL over Microsoft's SQL Server, mainly because it is totally free and open source, but also because it integrates better with … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server was our first choice as we are a Microsoft partner. Incentivized Helpful? Salvador Orochena Salesforce Developer Chose Microsoft SQL Server [Microsoft SQL Server] offers a full solution, Inhouse Applications and hosted application continue to use SQL as backend database. Allows easy creation of development environments and continuous feature release. Incentivized Helpful? KG Kimberly Graham Operations Manager Chose Microsoft SQL Server All of the platforms have their own benefit. I was not the decision maker in selecting Microsoft SQL Server, as it was already being utilized when I joined the company, 7 years ago. I can say that I feel more comfortable with utilizing this platform as opposed to the other ones. Incentivized Helpful? John Ramírez Database Administrator Chose Microsoft SQL Server The free version is very powerfull and easy to install and use for small companies. Going to Professional and Standard, gives you all the support and the flexibility needed. It is known within the Database Administrator crew, and you can get support very easily over the … Incentivized Helpful? Akhilesh Arya Assistant Professor Chose Microsoft SQL Server SOL Sever is less expensive than any edition of Oracle database. Incentivized Helpful? IS Ivan Sytchev Supervisor, Information Technology Chose Microsoft SQL Server Native to Windows and being required for other MS apps puts it above others in terms of usage. If we were not heavily dependent on Microsoft applications or OS, we might have considered other database solutions. It's an expensive solutions but it is a solid reliable solution. … Incentivized Helpful? John P. Maher Independent Consultant Chose Microsoft SQL Server I was not too impressed with Oracle. Following the manual prohibited installation. They did provide a phone number and explained the manual was wrong and provided me with the correct information with which I was able to install the product. This was awhile back and I do not … Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server is one of the fastest RDBMS systems available in the market. Pricing is a bit on the higher side but all the features it provides pretty much justifies it. It can be integrated with a large number of frameworks thus enabling to work on multiple frameworks … Incentivized Helpful? JJ Jewon Ju Data Analyst Chose Microsoft SQL Server IBM Db2 on Cloud, MySQL and PostgreSQL Incentivized Helpful? Verified User Anonymous Chose Microsoft SQL Server Microsoft SQL Server is still the industry standard for the type of development we do, and the types of applications that we use. Almost every developer or analyst we hire has at least a reasonable grounding in the use of SQL servers, and it is almost universally compatible … Incentivized Helpful?

Best Alternatives
	Apache Spark	Microsoft SQL Server
Small Businesses	No answers on this topic	InterSystems IRIS Score 8.1 out of 10
Medium-sized Companies	Cloudera Manager Score 9.9 out of 10	InterSystems IRIS Score 8.1 out of 10
Enterprises	IBM Analytics Engine Score 7.1 out of 10	SAP IQ Score 6.0 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Spark	Microsoft SQL Server
Likelihood to Recommend	9.0 (0 ratings)	7.9 (0 ratings)
Likelihood to Renew	10.0 (0 ratings)	9.0 (0 ratings)
Usability	8.0 (0 ratings)	7.6 (0 ratings)
Availability	- (0 ratings)	10.0 (0 ratings)
Performance	- (0 ratings)	9.0 (0 ratings)
Support Rating	8.7 (0 ratings)	7.9 (0 ratings)
In-Person Training	- (0 ratings)	9.0 (0 ratings)
Online Training	- (0 ratings)	9.0 (0 ratings)
Implementation Rating	- (0 ratings)	9.0 (0 ratings)
Configurability	- (0 ratings)	10.0 (0 ratings)
Ease of integration	- (0 ratings)	9.0 (0 ratings)
Product Scalability	- (0 ratings)	9.0 (0 ratings)
Vendor post-sale	- (0 ratings)	9.0 (0 ratings)
Vendor pre-sale	- (0 ratings)	9.0 (0 ratings)

User Testimonials
	Apache Spark	Microsoft SQL Server
Likelihood to Recommend	Apache Spark has rich APIs for regular data transformations or for ML workloads or for graph workloads, whereas other systems may not such a wide range of support. Choose it when you need to perform data transformations for big data as offline jobs, whereas use MongoDB-like distributed database systems for more realtime queries. Incentivized Nitin Pasumarthy Software Engineer Read full review	Microsoft SQL Server is ideal for highly available SQL workloads by using SQL Server Always On availability groups. Microsoft SQL Server might not be appropriate for solutions which require a very low resource footprint, since it requires significant CPU cores and RAM memory as well as high IOPS, always depending on the usage scenario. Incentivized Verified User Anonymous Read full review
Pros	It performs a conventional disk-based process when the data sets are too large to fit into memory, which is very useful because, regardless of the size of the data, it is always possible to store them. It has great speed and ability to join multiple types of databases and run different types of analysis applications. This functionality is super useful as it reduces work times Apache Spark uses the data storage model of Hadoop and can be integrated with other big data frameworks such as HBase, MongoDB, and Cassandra. This is very useful because it is compatible with multiple frameworks that the company has, and thus allows us to unify all the processes. Incentivized Carla Borges Consultor Tecnico - Java Developer and Php Developer. Read full review	Easy to configure and use with Visual Studio and Dot Net Easy integration with MSBI to perform data analysis Data Security Easy to understand and use Very easy to export database and tables in the form of SQL query or a script Incentivized SP Shivani Pandey Project Engineer Read full review
Cons	Memory management. Very weak on that. PySpark not as robust as scala with spark. spark master HA is needed. Not as HA as it should be. Locality should not be a necessity, but does help improvement. But would prefer no locality Incentivized Anson Abraham Data Czar Read full review	The import/export process can be tricky to follow with lots of steps and could be better for importing flat files Obtaining help from Microsoft is cumbersome and often other internet sources are better and quicker The documentation is not great and again it's generally better to obtain help elsewhere if needed Incentivized Paul Nockolds Data Analyst / Scuba Dive Instructor Read full review
Likelihood to Renew	Capacity of computing data in cluster and fast speed. Steven Li Senior Software Developer (Consultant) Read full review	I think it is unlikely that sql server has disappointed someone, it is likely that someone will come initially discouraged if the needs and problems that occur are very challenging, but definitely have a SQL oriented system means having a solid base to work and on which maintain the their data securely Incentivized Luca Campanelli Database Administrator Read full review
Usability	If the team looking to use Apache Spark is not used to debug and tweak settings for jobs to ensure maximum optimizations, it can be frustrating. However, the documentation and the support of the community on the internet can help resolve most issues. Moreover, it is highly configurable and it integrates with different tools (eg: it can be used by dbt core), which increase the scenarios where it can be used Incentivized Verified User Anonymous Read full review	SQL Server mostly 'just works' or generates error messages to help you sort out the trouble. You can usually count on the product to get the job done and keep an eye on your potential mistakes. Interaction with other Microsoft products makes operating as a Windows user pretty straight forward. Digging through the multitude of dialogs and wizards can be a pain, but the answer is usually there somewhere. Incentivized Andy Erickson Senior Consultant Read full review
Reliability and Availability	No answers on this topic	Its does not have outages. Incentivized SS Srikanth S Sr MS SQL BI Developer Read full review
Performance	No answers on this topic	SSAS data cubes may some time slow down your Excel reports. Incentivized SS Srikanth S Sr MS SQL BI Developer Read full review
Support Rating	1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications. YM Yogesh Mhasde Technical Manager Read full review	We managed to handle most of our problems by looking into Microsoft's official documentation that has everything explained and almost every function has an example that illustrates in detail how a particular functionality works. Just like PowerShell has the ability to show you an example of how some cmdlet works, that is the case also here, and in my opinion, it is a very good practice and I like it. Incentivized LH luka horvat Software Engineer Read full review
In-Person Training	No answers on this topic	It was good Incentivized SS Srikanth S Sr MS SQL BI Developer Read full review
Online Training	No answers on this topic	very hands on and detailed training Incentivized SS Srikanth S Sr MS SQL BI Developer Read full review
Implementation Rating	No answers on this topic	Other than SQL taking quite a bit of time to actually install there are no problems with installation. Even on hardware that has good performance SQL can still take close to an hour to install a typical server with management and reporting services. Incentivized Bill Starling Database Administrator Read full review
Alternatives Considered	We used Surprise Kit for one of the other research works. It is more fine-tuned to Recommendation systems and their algorithms. Apache Spark has MLlib for majority of ML problems. Where as software like Surprse Kit - it suitable for a specific task of Recommendations only Incentivized Ananth Gouri Assistant Professor Read full review	Microsoft SQL is slower than MySQL and Access but far more feature-rich and reliable. Access is almost obsolete nowadays, so not too many people are considering it, but unless budget or an open-source ethos is a factor, Microsoft SQL is superior in every way. Many commonly used tools, like Crystal Reports, support it. Incentivized Verified User Anonymous Read full review
Scalability	No answers on this topic	SQL server does handle growing demands of a mid sized company. Incentivized SS Srikanth S Sr MS SQL BI Developer Read full review
Return on Investment	Faster turn around on feature development, we have seen a noticeable improvement in our agile development since using Spark. Easy adoption, having multiple departments use the same underlying technology even if the use cases are very different allows for more commonality amongst applications which definitely makes the operations team happy. Performance, we have been able to make some applications run over 20x faster since switching to Spark. This has saved us time, headaches, and operating costs. Incentivized Verified User Anonymous Read full review	Increased accuracy - We went from multiple users having different versions of an Excel spreadsheet to a single source of truth for our reporting. Increased Efficiency - We can now generate reports at any time from a single source rather than multiple users spending their time collating data and generating reports. Improved Security - Enterprise level security on a dedicated server rather than financial files on multiple laptop hard drives. Incentivized Verified User Anonymous Read full review
ScreenShots