<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question spark-submit fails giving FileNotFoundException in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/spark-submit-fails-giving-FileNotFoundException/m-p/87654#M21515</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am running spark-submit job on yarn cluster during which it is uploading dependent jars in default HDFS staging directory&amp;nbsp; which is /user/&amp;lt;user id&amp;gt;/.sparkStaging/&amp;lt;yarn applicationId&amp;gt;/*.jar. On verification during spark-submit job, i see that jar is getting uploaded but spark-submit is failing with below error - file owner and group belongs to the same id using which spark-submit is performed. I also tried using configuration parameter spark.yarn.StagingDir but even that didn't helped.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Your professional inputs will help in addressing this issue.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Error stack trace -&lt;/P&gt;&lt;P&gt;=========================&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Diagnostics: File does not exist: hdfs://user/&amp;lt;user id&amp;gt;/.sparkStaging/&amp;lt;yarn application_id&amp;gt;/chill-java-0.5.0.jar&lt;BR /&gt;java.io.FileNotFoundException: File does not exist:&lt;/P&gt;&lt;P&gt;hdfs://user/&amp;lt;user id&amp;gt;/.sparkStaging/&amp;lt;yarn application_id&amp;gt;/chill-java-0.5.0.jar&lt;BR /&gt;at org.apache.hadoop.hdfs.DistributedFileSystem$20.doCall(DistributedFileSystem.java:1257)&lt;BR /&gt;at org.apache.hadoop.hdfs.DistributedFileSystem$20.doCall(DistributedFileSystem.java:1249)&lt;BR /&gt;at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)&lt;BR /&gt;at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1249)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:251)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:61)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:359)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:357)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:422)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1920)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:356)&lt;BR /&gt;at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:60)&lt;BR /&gt;at java.util.concurrent.FutureTask.run(FutureTask.java:266)&lt;BR /&gt;at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)&lt;BR /&gt;at java.util.concurrent.FutureTask.run(FutureTask.java:266)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)&lt;BR /&gt;at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Hemil&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 14:13:33 GMT</pubDate>
    <dc:creator>techsoln</dc:creator>
    <dc:date>2022-09-16T14:13:33Z</dc:date>
  </channel>
</rss>

