Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Can't connect to Spark Cluster from RStudio

Highlighted

Can't connect to Spark Cluster from RStudio

New Contributor

capture.jpg

Hi


Trying to connect to Spark Cluster from RStudio on local machine.

I go to set up the connection. From the drop down menu for "Master:", I choose "Cluster'' and then I receive this message attached.

Connecting with a remote spark cluster requires an RStudio server instance...

I am very new to this. Can anyone shed any light on how to overcome this problem?

Thanks!

1 REPLY 1

Re: Can't connect to Spark Cluster from RStudio

Expert Contributor

I havn't used R studio.

But if you are looking for a notebook to launch spark jobs, then you can give a try to Apache Zeppelin.

HDP comes bundled with zeppelin and you can install it as a service on the cluster.

Please use https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_zeppelin-component-guide/content/ch_inst... to install zeppelin using ambari.

Once you have zeppelin, you can use 'R interpreter' and start interacting with spark.

Steps to configure R interpreter : https://zeppelin.apache.org/docs/0.6.2/interpreter/r.html

Don't have an account?
Coming from Hortonworks? Activate your account here