The DataRobot AI Platform is presented as a solution that accelerates and democratizes data science by automating the end-to-end journey from data to value and allows users to deploy AI applications at scale. DataRobot provides a centrally governed platform that gives users AI to drive business outcomes, that is available on the user's cloud platform-of-choice, on-premise, or as a fully-managed service. The solutions include tools providing data preparation enabling users to explore and…
$0
Db2
Score 8.6 out of 10
N/A
DB2 is a family of relational database software solutions offered by IBM. It includes standard Db2 and Db2 Warehouse editions, either deployable on-cloud, or on-premise.
DataRobot can be used for risk assessment, such as predicting the likelihood of loan default. It can handle both classification and regression tasks effectively. It relies on historical data for model training. If you have limited historical data or the data quality is poor, it may not be the best choice as it requires a sufficient amount of high-quality data for accurate model building.
I have primarily used it as the basis for a SIS - but I have migrated more than a few systems from there database systems to DB2 (Filemaker, MySQL, etc.). DB2 does have a better structural approach, as opposed to Filemaker, which allows for more data consistency, but this can also lead to an inflexibility that can sometimes be counterintuitive when attempting to compensate for the flexibility of the work environment as Schools tend to have an all in one approach.
DataRobot helps, with algorithms, to analyze and decipher numerous machine-learning techniques in order to provide models to assist in company-wide decision making.
Our DataRobot program puts on an "even playing field" the strength of auto-machine learning and allows us to make decisions in an extremely timely manner. The speed is consistent without being offset by errors or false-negatives.
It encompasses many desired techniques that help companies in general, to reconfigure in to artificial intelligence driven firms, with little to no inconvenience.
The platform itself is very complicated. It probably can't function well without being complicated, but there is a big training curve to get over before you can effectively use it. Even I'm not sure if I'm effectively using it now.
The suggested model DataRobot deploys often not the best model for our purposes. We've had to do a lot of testing to make sure what model is the best. For regressive models, DataRobot does give you a MASE score but, for some reason, often doesn't suggest the best MASE score model.
The software will give you errors if output files are not entered correctly but will not exactly tell you how to fix them. Perhaps that is complicated, but being able to download a template with your data for an output file in the correct format would be nice.
DataRobot presents a machine-learning platform designed by data scientists from an array of backgrounds, to construct and develop precise predictive modeling in a fraction of the time previously taken. The tech invloved addresses the critical shortage of data scientists by changing the speed and economics of predictive analytics. DataRobot utilizes parallel processing to evaluate models in R, Python, Spark MLlib, H2O and other open source databases. It searches for possible permutations and algorithms, features, transformation, processes, steps and tuning to yield the best models for the dataset and predictive goal.
The DB2 database is a solid option for our school. We have been on this journey now for 3-4 years so we are still adapting to what it can do. We will renew our use of DB2 because we don’t see. Major need to change. Also, changing a main database in a school environment is a major project, so we’ll avoid that if possible.
You have to be well versed in using the technology, not only from a GUI interface but from a command line interface to successfully use this software to its fullest.
I have never had DB2 go down unexpectedly. It just works solidly every day. When I look at the logs, sometimes DB2 has figured out there was a need to build an index. Instead of waiting for me to do it, the database automatically created the index for me. At my current company, we have had zero issues for the past 8 years. We have upgrade the server 3 times and upgraded the OS each time and the only thing we saw was that DB2 got better and faster. It is simply amazing.
The performances are exceptional if you take care to maintain the database. It is a very powerful tool and at the same time very easy to use. In our installation, we expect a DB machine on the mainframe with access to the database through ODBC connectors directly from branch servers, with fabulous end users experience.
As I am writing this report I am participating with Datarobot Engineers in an complex environment and we have their whole support. We are in Mexico and is not common to have this commitment from companies without expensive contract services. Installing is on premise and the client does not want us to take control and they, the client, is also limited because of internal IT regulations ,,, soo we are just doing magic and everybody is committed.
Easily the best product support team. :) Whenever we have questions, they have answered those in a timely manner and we like how they go above and beyond to help.
I've done machine learning through python before, however having to code and test each model individually was very time consuming and required a lot of expertise. The data Robot approach, is an excellent way of getting to a well placed starting point. You can then pick up the model from there and fine tune further if you need.
DB2 was more scalable and easily configurable than other products we evaluated and short listed in terms of functionality and pricing. IBM also had a good demo on premise and provided us a sandbox experience to test out and play with the product and DB2 at that time came out better than other similar products.
By using DB2 only to support my IzPCA activities, my knowledge here is somewhat limited.
Anyway, from what I was able to understand, DB2 is extremely scallable.
Maybe the information below could serve as an example of scalability.
Customer have an huge mainframe environment, 13x z15 CECs, around 80 LPARs, and maybe more than 50 Sysplexes (I am not totally sure about this last figure...)
Today we have 7 IzPCA databases, each one in a distinct Syplex.
Plans are underway to have, at the end, an small LPAR, with only one DB2 sub-system, and with only one database, then transmit the data from a lot of other LPARs, and then process all the data in this only one database.
The IzPCA collect process (read the data received, manipulate it, and insert rows in the tables) today is a huge process, demanding many elapsed hours, and lots of CPU.
Almost 100% of the tables are PBR type, insert jobs run in parallel, but in 4 of the 7 database, it is a really a huge and long process.
Combining the INSERTs loads from the 7 databases in only one will be impossible.......,,,,
But, IzPCA recently introduced a new feature, called "Continuous Collector".
By using that feature, small amounts of data will be transmited to the central LPAR at every 5 minutes (or even less), processed immediately,in a short period of time, and withsmall use of CPU, instead of one or two transmissions by day, of very large amounts of data and the corresponding collect jobs occurring only once or twice a day, with long elapsed times, and huge comsumption of CPU
I suspect the total CPU seconds consumed will be more or less the same in both cases, but in the new method it will occur insmall bursts many times a day!!