Created 12-14-2015 11:56 AM
I have a problem while try to run spark-submit to yarn-cluster
below is my spark-submit code
spark-submit --master yarn-cluster --name spark_ml user_recommendation.py
getting below error :
java.io.FileNotFoundException: File does not exist: hdfs://name-node:8020/user/spark/.sparkStaging/application_1450092198211_0007/pyspark.zip
Is it configuration issue?
Thanks,
Coktra
Created 12-14-2015 11:59 AM
Created 12-14-2015 11:59 AM
Created 12-14-2015 12:06 PM
Hi Neeraj,
thanks for the link, so there is no solution yet for this problem?
Created 12-14-2015 12:07 PM
@cokorda putra susila No. You can update that jira and vote for it
Created 12-14-2015 02:07 PM
@Neeraj Sabharwal thank you, i will vote to jira
Created 12-14-2015 02:13 PM
@cokorda putra susila I guess we can close this question for now. You can do it by accepting the jira response if you like.
Created 02-02-2016 02:16 AM
@cokorda putra susila can you accept the best answer to close this thread or provide your own solution?
Created 02-02-2016 07:50 PM
I am submitting spark-submit jobs with python code and they are running fine in YARN Cluster mode. I would like to understand the question a bit further. Is your Cluster running Spark? Have you setup YARN_CONF_DIR variable?
Created 02-04-2016 08:08 AM
What's your version of HDP and spark ?
Created 09-27-2018 12:06 PM
may be this will help - try your luck,
https://stackoverflow.com/questions/44231261/spark-yarn-file-does-not-exist-on-hdfs