Exciting tool from Cloudera
Pros
- One single IDE (browser based application) that makes Scala, R, Python integrated under one tool
- For larger organizations/teams, it lets you be self reliant
- As it sits on your cluster, it has very easy access of all the data on the HDFS
- Linking with Github is a very good way to keep the code versions intact
Cons
- Not as great as RStudio; lacks some features when compared with it
- It is quite simple still (because its very early in its initiative), and companies may want to wait until they see a more developed product
Return on Investment
- As the tool itself can access all the HDFS, Spark data easily, the wait time between teams has reduced
- Installation was a breeze, and ramp up time was fairly easy
Alternatives Considered
Microsoft Azure Machine Learning Workbench
Other Software Used
Hadoop, HBase, Apache Solr
