Amazon EMR (Elastic MapReduce) vs. Db2

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Amazon EMR
Score 8.9 out of 10
N/A
Amazon EMR is a cloud-native big data platform for processing vast amounts of data quickly, at scale. Using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi (Incubating), and Presto, coupled with the scalability of Amazon EC2 and scalable storage of Amazon S3, EMR gives analytical teams the engines and elasticity to run Petabyte-scale analysis.N/A
Db2
Score 8.6 out of 10
N/A
DB2 is a family of relational database software solutions offered by IBM. It includes standard Db2 and Db2 Warehouse editions, either deployable on-cloud, or on-premise.
$0
Pricing
Amazon EMR (Elastic MapReduce)Db2
Editions & Modules
No answers on this topic
Db2 on Cloud Lite
$0
Db2 on Cloud Standard
$99
per month
Db2 Warehouse on Cloud Flex One
$898
per month
Db2 on Cloud Enterprise
$946
per month
Db2 Warehouse on Cloud Flex for AWS
2,957
per month
Db2 Warehouse on Cloud Flex
$3,451
per month
Db2 Warehouse on Cloud Flex Performance
13,651
per month
Db2 Warehouse on Cloud Flex Performance for AWS
13,651
per month
Db2 Standard Edition
Contact Sales
Db2 Advanced Edition
Contact Sales
Offerings
Pricing Offerings
Amazon EMRDb2
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoYes
Entry-level Setup FeeNo setup feeOptional
Additional Details
More Pricing Information
Community Pulse
Amazon EMR (Elastic MapReduce)Db2
Best Alternatives
Amazon EMR (Elastic MapReduce)Db2
Small Businesses

No answers on this topic

InterSystems IRIS
InterSystems IRIS
Score 8.0 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10
InterSystems IRIS
InterSystems IRIS
Score 8.0 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 7.2 out of 10
SAP IQ
SAP IQ
Score 10.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Amazon EMR (Elastic MapReduce)Db2
Likelihood to Recommend
8.0
(19 ratings)
8.9
(113 ratings)
Likelihood to Renew
-
(0 ratings)
7.9
(12 ratings)
Usability
7.0
(4 ratings)
9.2
(9 ratings)
Availability
-
(0 ratings)
9.1
(64 ratings)
Performance
-
(0 ratings)
9.1
(12 ratings)
Support Rating
9.0
(3 ratings)
8.9
(6 ratings)
In-Person Training
-
(0 ratings)
8.2
(1 ratings)
Implementation Rating
-
(0 ratings)
5.8
(3 ratings)
Configurability
-
(0 ratings)
9.1
(2 ratings)
Ease of integration
-
(0 ratings)
8.0
(4 ratings)
Product Scalability
-
(0 ratings)
8.5
(66 ratings)
Vendor post-sale
-
(0 ratings)
8.9
(2 ratings)
Vendor pre-sale
-
(0 ratings)
8.9
(2 ratings)
User Testimonials
Amazon EMR (Elastic MapReduce)Db2
Likelihood to Recommend
Amazon AWS
We are running it to perform preparation which takes a few hours on EC2 to be running on a spark-based EMR cluster to total the preparation inside minutes rather than a few hours. Ease of utilization and capacity to select from either Hadoop or spark. Processing time diminishes from 5-8 hours to 25-30 minutes compared with the Ec2 occurrence and more in a few cases.
Read full review
IBM
I have primarily used it as the basis for a SIS - but I have migrated more than a few systems from there database systems to DB2 (Filemaker, MySQL, etc.). DB2 does have a better structural approach, as opposed to Filemaker, which allows for more data consistency, but this can also lead to an inflexibility that can sometimes be counterintuitive when attempting to compensate for the flexibility of the work environment as Schools tend to have an all in one approach.
Read full review
Pros
Amazon AWS
  • EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost efficient. No need to maintain any libraries to connect to AWS resources.
  • EMR is highly available, secure and easy to launch. No much hassle in launching the cluster (Simple and easy).
  • EMR manages the big data frameworks which the developer need not worry (no need to maintain the memory and framework settings) about the framework settings. It's all setup on launch time. The bootstrapping feature is great.
Read full review
IBM
  • While we query a large set of data, the results are generally available within a minute or so.
  • Always reliable - I have never experienced an application going down.
  • It is easy to write queries and find tables and columns.
  • We can log in smoothly without any headaches.
Read full review
Cons
Amazon AWS
  • It would have been better if packages like HBase and Flume were available with Amazon EMR. This would make the product even more helpful in some cases.
  • Products like Cloudera provide the options to move the whole deployment into a dedicated server and use it at our discretion. This would have been a good option if available with EMR.
  • If EMR gave the option to be used with any choice of cloud provider, it would have helped instead of having to move the data from another cloud service to S3.
Read full review
IBM
  • Learning curve for DB resources - Improvements to UI or native command line built-ins can help with increasing efficiencies for DB resources
  • Better resource utilization monitoring and recommendations
  • Continue to adopt support for modern frameworks and languages making it easier for organizations to see making Db2 the easy first choice
Read full review
Likelihood to Renew
Amazon AWS
No answers on this topic
IBM
The DB2 database is a solid option for our school. We have been on this journey now for 3-4 years so we are still adapting to what it can do. We will renew our use of DB2 because we don’t see. Major need to change. Also, changing a main database in a school environment is a major project, so we’ll avoid that if possible.
Read full review
Usability
Amazon AWS
Documentation is quite good and the product is regularly updated, so new features regularly come out. The setup is straightforward enough, especially once you have already established the overall platform infrastructure and the aws-cli APIs are easy enough to use. It would be nice to have some out-of-the-box integrations for checking logs and the Spark UI, rather than relying on know-how and digging through multiple levels to find the informations
Read full review
IBM
You have to be well versed in using the technology, not only from a GUI interface but from a command line interface to successfully use this software to its fullest.
Read full review
Reliability and Availability
Amazon AWS
No answers on this topic
IBM
I have never had DB2 go down unexpectedly. It just works solidly every day. When I look at the logs, sometimes DB2 has figured out there was a need to build an index. Instead of waiting for me to do it, the database automatically created the index for me. At my current company, we have had zero issues for the past 8 years. We have upgrade the server 3 times and upgraded the OS each time and the only thing we saw was that DB2 got better and faster. It is simply amazing.
Read full review
Performance
Amazon AWS
No answers on this topic
IBM
The performances are exceptional if you take care to maintain the database. It is a very powerful tool and at the same time very easy to use. In our installation, we expect a DB machine on the mainframe with access to the database through ODBC connectors directly from branch servers, with fabulous end users experience.
Read full review
Support Rating
Amazon AWS
I give the overall support for Amazon EMR this rating because while the support technicians are very knowledgeable and always able to help, it sometimes takes a very long time to get in contact with one of the support technicians. So overall the support is pretty good for Amazon EMR.
Read full review
IBM
Easily the best product support team. :) Whenever we have questions, they have answered those in a timely manner and we like how they go above and beyond to help.
Read full review
In-Person Training
Amazon AWS
No answers on this topic
IBM
the material was very clear and all subjects have been handled
Read full review
Implementation Rating
Amazon AWS
No answers on this topic
IBM
db2 work well with the application, also the replication tool can keep it up
Read full review
Alternatives Considered
Amazon AWS
Snowflake is a lot easier to get started with than the other options. Snowflake's data lake building capabilities are far more powerful. Although Amazon EMR isn't our first pick, we've had an excellent experience with EC2 and S3. Because of our current API interfaces, it made more sense for us to continue with Hadoop rather than explore other options.
Read full review
IBM
DB2 was more scalable and easily configurable than other products we evaluated and short listed in terms of functionality and pricing. IBM also had a good demo on premise and provided us a sandbox experience to test out and play with the product and DB2 at that time came out better than other similar products.
Read full review
Scalability
Amazon AWS
No answers on this topic
IBM
By
using DB2 only to support my IzPCA activities, my knowledge here
is somewhat limited.

Anyway,
from what I was able to understand, DB2 is extremely scallable.

Maybe the information below could serve as an example of scalability.
Customer have an huge mainframe environment, 13x z15 CECs, around
80 LPARs, and maybe more than 50 Sysplexes (I am not totally sure about this
last figure...)

Today
we have 7 IzPCA
databases, each one in a distinct Syplex.

Plans
are underway to have, at the end, an small LPAR, with only one DB2 sub-system,
and with only one database, then transmit the data from a lot of other LPARs,
and then process all the data in this only one database.



The
IzPCA collect process (read the data received, manipulate it, and insert rows
in the tables) today is a huge process, demanding many elapsed
hours, and lots of CPU.

Almost
100% of the tables are PBR type, insert jobs run in parallel, but in 4 of the 7
database, it is a really a huge and long process.



Combining
the INSERTs loads from the 7 databases in only one will be impossible.......,,,,



But,
IzPCA recently introduced a new feature, called "Continuous
Collector"
.
By
using that feature, small amounts of data will be transmited to the central
LPAR at every 5 minutes (or even less), processed immediately,in
a short period of time, and with small use of CPU,
instead of one or two transmissions by day, of very large amounts of data and
the corresponding collect jobs occurring only once or twice a day, with long
elapsed times, and huge comsumption of CPU



I
suspect the total CPU seconds consumed will be more or less the same in
both cases, but in the new method it will occur in small bursts
many times a day!!
Read full review
Return on Investment
Amazon AWS
  • It was obviously cheaper and convenient to use as most of our data processing and pipelines are on AWS. It was fast and readily available with a click and that saved a ton of time rather than having to figure out the down time of the cluster if its on premises.
  • It saved time on processing chunks of big data which had to be processed in short period with minimal costs. EMR solved this as the cluster setup time and processing was simple, easy, cheap and fast.
  • It had a negative impact as it was very difficult in submitting the test jobs as it lags a UI to submit spark code snippets.
Read full review
IBM
  • Negative: Difficult and manual deployment
  • Negative: Missing assistants from common monitoring metrics
  • Positive: Stability
  • Positive: Performance
  • Positive: Resiliency and high availability (HADR)
  • Positive: Data Replication (Q-Rep)
  • Positive: Interaction with storage subsystems for backups (TSM, SVC)
  • Positive: Gigantic monitoring features in the form of table functions
Read full review
ScreenShots

Db2 Screenshots

Screenshot of Db2 - Data sharingScreenshot of Db2 - Machine LearningScreenshot of Db2 - Real time insights