Data Science Experience (DSX) addresses the entire Data Science lifecycle. It provides a choice of notebooks, collaboration, tutorials and deploying Machine Learning to production for Spark, R, Python and other ML languages.
The benefits of running DSX local on HDP is that today a large number of problems requires big-data for better predictions. For example, Deep Learning is more effective with big data. The combination of big data along with the compute available from big data platforms such as HDP will make data science more accessible, scalable and leverage all of the data present in the enterprise to make more accurate predictions.
By running DSX local on HDP, customers will be able to leverage the compute provided by YARN to make more accurate predictions by being able to use all of the data present in their enterprise data lake.
DSX already offers support for RStudio and Jupyter as notebooks and work is being done to integrate Zeppelin to DSX by both IBM and Hortonworks. This will offer more choices to Data Scientists.