Thank you @Consult , our idea is to use sparklyr to connect to Spark in our cluster, but using RStudio Desktop or RStudio Server. In our case, RStudio Server is outside the cluster, which steps should we follow to connect to a remote spark cluster? Cloudera Datascience Workbench is an option we may evaluate in the future, regarding this, is it necessary a separte node(s) for CDS to run? Could it run in an existing edge node?
... View more