Support Questions

Srini_D · ‎12-18-2015

I am using spark standalone cluster and below are my spark-env properties.

export SPARK_EXECUTOR_INSTANCES=432

export SPARK_EXECUTOR_CORES=24

export SPARK_EXECUTOR_MEMORY=36G

export SPARK_DRIVER_MEMORY=24G

I have 6 worker nodes and if i tried to run a job that has huge size of files and joins, it is getting stuck and failing. I could see 6 executors for the job with 24GB. Could you please provide me any links or details to tune it and understand the worker nodes and executors concepts. I referred one cloudera blog, but that is more about yarn. But, i need it for spark standalone cluster

awatson · ‎12-18-2015

Hi @Srinivasarao Daruna HDP does not support Spark in Standalone mode. You need to use Spark on Yarn.

Running Spark in Yarn Cluster mode you can specify number of executors by using the parameter:

--num-executor=6

This will give you 6 executors

For additional information regarding using Yarn Cluster mode please see - http://spark.apache.org/docs/latest/running-on-yar...

Cheers,

Andrew

View solution in original post

awatson · ‎12-18-2015

Hi @Srinivasarao Daruna HDP does not support Spark in Standalone mode. You need to use Spark on Yarn.

Running Spark in Yarn Cluster mode you can specify number of executors by using the parameter:

--num-executor=6

This will give you 6 executors

For additional information regarding using Yarn Cluster mode please see - http://spark.apache.org/docs/latest/running-on-yar...

Cheers,

Andrew

Cloudera Community

Support Questions

how many spark execturos runs for the below configuration and how can i tune it.?

Apache Storm Topology Tuning Approach

How to configure Ambari Auto-Start feature and Tun...

Phoenix HBase Tuning - Quick Tips

Spark specific recommendations for configuring/tun...

Apache Spark Job Is slow and wanted to Make It Apa...

Spark DataFrame to Solr Cloud - runs on Sandbox 2....

Running Spark in Production?

Ambari Metric Server basic tuning

Tuning MiNFI C++ repositores

Impala tuning