Created on 11-18-2014 05:25 PM - edited 09-16-2022 02:13 AM
Hi
I have installed CDH 5.2.0 on a single node
When launch spark python and after successfully executing simple python sc.parallelize(range(1000)).count()
I don’t see the this application in spark history server http://localhost:18088, when open I can only see
Event Log Location : hdfs//localhost:8020/user/spark/applicationHistory
No Completed Application Found
Can you please advise me if I have check any configurations or
I appreciate any help on this
thanks
Pal
Created 11-24-2014 02:37 PM
This is an option to spark-submit or pyspark. Look at the Spark docs.
Created 11-19-2014 12:13 AM
Are you running Spark on YARN, or using Spark standalone? if the latter, you won't see any YARN history since it's not using YARN.
Created 11-20-2014 04:55 PM
Hi Sowen,
Thanks for your reply,
Are you running Spark on YARN, or using Spark standalone? if the latter, you won't see any YARN history since it's not using YARN.
Yes Spark is running on YARN (MR2 Included) I checked this in Cloudera Manager Web console Spark à Configuration
Is that mean I have to configure Spark to user YARN (MR2 Included) according in Cloudera Manager its already OR am I missing something
I did a default installation and followed wizard.
Can you please advice
thanks
Pal
Created 11-21-2014 01:07 AM
Yes but did you also submit your Spark app to YARN? what is your master for the app?
Created 11-24-2014 01:58 AM
Hi Sowen,
Below are the steps
Since its single node installation master and worker are in same node
Please advice if I am running this simple python program incorrectly
thanks
Pal
Created 11-24-2014 02:01 AM
Spark defaults to run with a local master IIRC. You should set "--master yarn-client" to actually use YARN. I assume it's not different for pyspark vs spark-shell.
Created 11-24-2014 02:22 PM
Hi Srowen,
Thanks for the update, I am new to spark, can you please guide where should I set "--master yarn-client" in configuration file or ?
thanks
Pal
Created 11-24-2014 02:37 PM
This is an option to spark-submit or pyspark. Look at the Spark docs.
Created 07-26-2019 09:59 AM
Hi Pal,
Can you grep for the particular application ID in the folder /user/spark/applicationHistory to make sure whether the job has been successfully completed or still in .inprogress state?
Thanks
AKR