H2O was used as an analytical tool, with easy to access machine learning functionalities. The data science team comprises different people with different backgrounds and abilities to code. We used H2O as an easily trained on, highly accessible tool for beginners in the AI area. As an open source version, it is good for small projects and trials in data analysis, scoring, clustering, and predictive modeling. It is a really fast tool and also runs on older hardware.
- Excellent analytical and prediction tool
- In the beginning, usage of H20 Flow in Web UI enables quick development and sharing of the analytical model
- Readily available algorithms, easy to use in your analytical projects
- Faster than Python scikit learn (in machine learning supervised learning area)
- It can be accessed (run) from Python, not only JAVA etc.
- Well documented and suitable for fast training or self studying
- In the beginning, one can use the clickable Flow interface (WEB UI) and later move to a Python console. There is then no need to click in H20 Flow
- It can be used as open source
- No weaknesses found yet
- This is not really a drawback, but rather a warning - the Drivereless AI is not a replacement for a data scientist yet, and will not replace data scientists in the next decade neither. The Driverless AI feature delivers reliable results only if the analyst is sure about the meaning of input data. The data quality is usually a major issue and no tool can detect the meaning of data in the input. Data scientists are also required for business interpretation of the findings. So be careful, and do not rely on this feature without a good understanding of what it really does in each step.
Use H2O.ai whenever you need easy to use tool, when you must be cost efficient (you can not charge the client extra money for software licenses used), need a tool with lots of algorithms that are normally used in data analytics, or need to work on one machine (it is either not allowed to move data to cloud storage or simply not necessary to connect to Hadoop, etc.). Also, you can call H2O directly from Python which makes analysis more efficient.