<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Use Nifi PutHDFS and SparkSQL report file not found in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412670#M253608</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/94989"&gt;@Meepoljd&lt;/a&gt;&amp;nbsp;Spark will read the metadata of the table and get the list of files to be read and if there is continuous changes ( delete/overwrite)&amp;nbsp; on the table and on the execution of read operation, the spark job can fail with FNF exception.&lt;BR /&gt;&lt;BR /&gt;The option here would be to minimise the change duration and run the job when there are no changes&amp;nbsp;&lt;BR /&gt;OR catch the exception within the spark code and rebuild the dataframe.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 15 Oct 2025 09:37:08 GMT</pubDate>
    <dc:creator>haridjh</dc:creator>
    <dc:date>2025-10-15T09:37:08Z</dc:date>
    <item>
      <title>Use Nifi PutHDFS and SparkSQL report file not found</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412669#M253607</link>
      <description>&lt;P&gt;I am using NIFI to read data from FTP and push it to HDFS. My business process is to schedule workflows for subsequent computing tasks through dophinschedule. One detail here is that my workflow will execute Alter for corresponding analysis before running.&lt;BR /&gt;I have noticed that the Spark task occasionally encounters a FileNotFound error during runtime, which may cause the task to fail. It is speculated that this is because partition information has already been added. At this time, when the task is running, NIFI is still writing data to the corresponding partition, and the file being written will cause this error. How to optimize this problem?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Meepoljd_0-1760511333028.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/46382i2017B1B53658F8E5/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Meepoljd_0-1760511333028.png" alt="Meepoljd_0-1760511333028.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Oct 2025 06:58:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412669#M253607</guid>
      <dc:creator>Meepoljd</dc:creator>
      <dc:date>2025-10-15T06:58:30Z</dc:date>
    </item>
    <item>
      <title>Re: Use Nifi PutHDFS and SparkSQL report file not found</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412670#M253608</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/94989"&gt;@Meepoljd&lt;/a&gt;&amp;nbsp;Spark will read the metadata of the table and get the list of files to be read and if there is continuous changes ( delete/overwrite)&amp;nbsp; on the table and on the execution of read operation, the spark job can fail with FNF exception.&lt;BR /&gt;&lt;BR /&gt;The option here would be to minimise the change duration and run the job when there are no changes&amp;nbsp;&lt;BR /&gt;OR catch the exception within the spark code and rebuild the dataframe.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Oct 2025 09:37:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412670#M253608</guid>
      <dc:creator>haridjh</dc:creator>
      <dc:date>2025-10-15T09:37:08Z</dc:date>
    </item>
    <item>
      <title>Re: Use Nifi PutHDFS and SparkSQL report file not found</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412709#M253639</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/94989"&gt;@Meepoljd&lt;/a&gt;,&amp;nbsp;Did the response assist in resolving your query? If it did, kindly mark the relevant reply as the solution, as it will aid others in locating the answer more easily in the future.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Oct 2025 08:16:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Use-Nifi-PutHDFS-and-SparkSQL-report-file-not-found/m-p/412709#M253639</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2025-10-21T08:16:12Z</dc:date>
    </item>
  </channel>
</rss>

