Likelihood to Recommend Altogether, I want to say that Apache Hadoop is well-suited to a larger and unstructured data flow like an aggregation of web traffic or even advertising. I think Apache Hadoop is great when you literally have petabytes of data that need to be stored and processed on an ongoing basis. Also, I would recommend that the software should be supplemented with a faster and interactive database for a better querying service. Lastly, it's very cost-effective so it is good to give it a shot before coming to any conclusion.
Read full review Blob storage is well suited to hosting/sharing zipped files rather than several smaller files, as folder enumeration/listing is not supported. Files uploaded are case-sensitive, so users need to be educated on the correct naming convention format if they are delegating the file-sharing process outside of IT.
Read full review Pros Handles large amounts of unstructured data well, for business level purposes Is a good catchall because of this design, i.e. what does not fit into our vertical tables fits here. Decent for large ETL pipelines and logging free-for-alls because of this, also. Read full review Easy to work with - drag and drop Capable of long term retention Relatively low cost Good for different data formats Allows archival storage Read full review Cons Less organizational support system. Bugs need to be fixed and outside help take a long time to push updates Not for small data sets Data security needs to be ramped up Failure in NameNode has no replication which takes a lot of time to recover Read full review If we are transferring huge amount of data (outbound), it can get quite expensive. With new features being added constantly, although a good thing, at times it becomes difficult to keep up with the changes. Documentation needs to keep UpToDate and should include best practices. Performance can be improved especially when it comes to cold storage. Read full review Likelihood to Renew Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free
Read full review Usability As Hadoop enterprise licensed version is quite fine tuned and easy to use makes it good choice for Hadoop administrators. It’s scalability and integration with Kerberos is good option for authentication and authorisation. installation can be improved. logging can be improved so that it become easier for debugging purposes. parallel processing of data is achieved easily.
Read full review Blob storage is fairly simple, with several different options/settings that can be configured. The file explorer has enhanced its usability. Some areas could be improved, such as providing more details or stats on how many times a file has been accessed. It is an obvious choice if you're already using Azure/Entra.
Read full review Support Rating It's a great value for what you pay, and most Data Base Administrators (DBAs) can walk in and use it without substantial training. I tend to dabble on the analyst side, so querying the data I need feels like it can take forever, especially on higher traffic days like Monday.
Read full review Microsoft has improved its customer service standpoint over the years. The ability to chat with an issue, get a callback, schedule a call or work with an architecture team(for free) is a huge plus. I can get mentorship and guidance on where to go with my environment without pushy sales tactics. This is very refreshing. Typically support can get me to where I need to be on the first contact, which is also nice.
Read full review Online Training Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.
Read full review Alternatives Considered Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.
Read full review Azure Blob Storage is the best choice to store files when the app runs in Azure. It also has some advantages over S3, like Shared Access Signatures, that make it easy to control access to files directly via a URL. Azure Blob Storage is very fast and we have not had any major issues with it after using it for several years.
Read full review Return on Investment There are many advantages of Hadoop as first it has made the management and processing of extremely colossal data very easy and has simplified the lives of so many people including me. Hadoop is quite interesting due to its new and improved features plus innovative functions. Read full review Azure has increased the flexibility of where we place data within our organization. It has proven to be very reliable and always accessible. Read full review ScreenShots