I have RStudio installed and running on my edge node. I installed R on 6 datanodes running Spark2. I have several questions. I have R version 3.4.1 (2017-06-30) -- "Single Candle" installed on the datanodes, Do I need to set HOME directories on the datanode or do I need other programs installed. (Running the latest HDP). I have run sparklyr before and it creates a Yarn job when running, is SparkR different from sparklyr? What packages must be installed on the R server when running SparkR? Is the a good step by step on setting up and ruinning SparkR?
... View more