Created on 11-15-2020 05:19 PM - edited on 11-22-2020 10:03 PM by VidyaSargur
export SPARK_RAPIDS_DIR=/opt/rapids
export SPARK_CUDF_JAR=${SPARK_RAPIDS_DIR}/cudf-0.15-cuda10-2.jar
export SPARK_RAPIDS_PLUGIN_JAR=${SPARK_RAPIDS_DIR}/rapids-4-spark_2.12-0.2.0.jar
spark3-shell \
--master yarn \
--deploy-mode client \
--driver-cores 6 \
--driver-memory 15G \
--executor-cores 8 \
--conf spark.executor.memory=15G \
--conf spark.rapids.sql.concurrentGpuTasks=4 \
--conf spark.executor.resource.gpu.amount=1 \
--conf spark.rapids.sql.enabled=true \
--conf spark.rapids.sql.explain=ALL \
--conf spark.rapids.memory.pinnedPool.size=2G \
--conf spark.kryo.registrator=com.nvidia.spark.rapids.GpuKryoRegistrator \
--conf spark.plugins=com.nvidia.spark.SQLPlugin \
--conf spark.rapids.shims-provider-override=com.nvidia.spark.rapids.shims.spark301.SparkShimServiceProvider \
--conf spark.executor.resource.gpu.discoveryScript=${SPARK_RAPIDS_DIR}/getGpusResources.sh \
--jars ${SPARK_CUDF_JAR},${SPARK_RAPIDS_PLUGIN_JAR}
spark submit
Note: This is not definitive and do consult the RAPIDS.ai and Spark 3 documentation for further details.
Created on 01-27-2021 08:56 PM
is it supported ?
Created on 01-27-2021 09:11 PM
Spark 3 on CDP Private Cloud Base with GPU Support is fully supported. RAPIDS.ai is a Nvidia product though and not a Cloudera product offering